What is turkishsuffix?
A Python library for adding complex suffixes to Turkish words. While doing that, the code uses rule set and python's string manipulation to enhance quality. There are similar libraries in both GitHub and PyPi, but most of them are not functioning right, and other's algorithm are too straightforward and because of that their efficiency is less than turkishsuffix.
turkishsuffix uses math and ordered data to dynamically calculate possibilities instead of checking every possibility statically. It has a config, which can easily be edited.
For now, turkishsuffix has 2 main components:
- __init__.py
- config.ini
In future, I might modularize init file to make it more developer friendly. Right now, my main goal is functionality and effectivity.
Which Features are Working Right Now?
These Turkish suffixes are functioning right now:
- çokluk hali
- ilgi hali
- eşitlik hali
- yönelme hali
- belirtme hali
- bulunma hali
- ayrılma hali
There are exceptions for "su" and "ne" words in ilgi hali. So exception system is functioning right now.
There is an option for apostrophe in special names (like human names, city names etc.), that is functioning right now.
What are Missing?
Right now, a complicated suffix named iyelik hali is missing. I am going to add that as a new method if I can find any spare time. Also, making the code more modular is also on my list.
How to Use?
Installing:
pip install turkishsuffix
Importing:
import turkishsuffix
Suffix construction:
suffix = turkishsuffix.Suffix(word, suffix type) # suffix type is optional
Suffix methods:
suffix.get_suffix()
suffix.get_word(special, type) # special = boolean, both are optional.
suffix.get_old_word()
suffix.set_suffix(suffix type)
How Useful is turkishsuffix?
Programmers usually avoid using suffixes while coding in Turkish. The reason behind that is Turkish having a complex suffix system. For example, an easy one, plural form (çokluk hali) is done with lar or ler. If latest vowel of the word is soft, word's suffix must be ler, if not the suffix must be lar. If we delve deeper, it becomes more complicated. Sometimes some situations require extra letters, some require both hard/soft and rounded/unrounded to be calculated. Because of this complexity, as I said at start, programmers avoid using suffixes. Instead of "Bora'nın", "Deniz'in", "Umut'un" and etc., they simply add another word into sentence and add suffix to that word. For example they add "bey" and those became "Bora Bey'in", "Deniz ,Bey'in" and "Umut Bey'in". While this can be used in Turkish, we should encourage people to use formal ones more. If we can encourage people to use this, even Facebook may change how they approach into this issue.
You can check out the examples on GitHub.
(Bilkent Üniversitesi'ne)
Link:
Github
Posted on Utopian.io - Rewarding Open Source Contributors
Thank you for the Contribution. It has been approved.
Since its your new project New Project you should clearly write about your future plans related to the project, how you will make it big. Also you should provide information about how other can get in touch with you to contribute to the project.
You can contact us on Discord.
[utopian-moderator]
I will improve the post quality. Thanks.
Hey @codingdefined, I just gave you a tip for your hard work on moderation. Upvote this comment to support the utopian moderators and increase your future rewards!
Hey @yokunjon I am @utopian-io. I have just upvoted you!
Achievements
Community-Driven Witness!
I am the first and only Steem Community-Driven Witness. Participate on Discord. Lets GROW TOGETHER!
Up-vote this comment to grow my power and help Open Source contributions like this one. Want to chat? Join me on Discord https://discord.gg/Pc8HG9x