Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_17761 |
Symbol | thiF |
ID | 4718509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1508150 |
End bp | 1509295 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640079505 |
Product | molybdopterin biosynthesis protein |
Protein accession | YP_001010166 |
Protein GI | 123969308 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.624505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAAG ATATAAAATT TAATTTCTTA AACTCTGATG AAGAAGAGAG ATACCAAAAA CATCTTACCC TTAAAGAGAT AGGTTACGAG GGCCAATTAA ATCTTAAAAA CAGCTCAGTA TTATGTATCG GAGCAGGTGG GCTTGGCTCT TCCGTTTTGC TTTATCTAGC TGCAACAGGA ATTGGGAGAA TTGGAATAGT TGATAACGAT CAAGTTGAAA AGTCTAATCT CCAGAGACAG ATAATTCATG AAACAAATAC TATTGGCAAT CTTAAAATTG ATTCTGCTAG GGAAAGAATT AAAAATTTCA ATCCTAATTG TGAAATATTA ACCTTTTCAG AGAGAATTAA TCCTAAAAAT GCTCTTGAAT TAATAAAGGA GTTTGATGTT ATTTGTGATT GCTCTGATAA CTTTGGCACA AGATATTTAA TAAATGATTC ATGCCTGATA TTAGATAAAC CCCTAGTCTT TGGAAGTGTA CAAGGCTTTG AAGGGCAGGT GAGTGTTTTC AATTTATATA AAAATAGTCC AAATTTAAGA GACTTACTTC CAGAATCACC TTCAAAAAAT GCTGCCCCAA GTTGTGCAGA ATTCGGAGTT GTGGGTGTTT CAACAGGTTT AATAGGAATT CTTCAGGTTA ATGAAATTAT CAAAATCATT TTGAAAAAAG GTGAAATTTT AGATGGGAAG ATTTTAATTT TTGATCTATT GAATATGAAT ATGAAAAAAT TACATCTAAA AAGTGATCAG TCAAATAAAC GAATAAAAAA CCTGTCTCAG TTTGAGGGCT TTTATAATAG CGACGAATAT TGTGAAAAAA ATAATGAGAT TAACATTATA AATGCTGATG AATTTAATAG TTTATACAAA GCAAAACCCA ACAAAATTCT TTTAATTGAT GTTAGAGAAA ATGAAGAATT TTCTTCATTT GCAATAGAAG GATCTATCTC AATTCCCTTA AGTCATTTGA AACAAGCATC TGACTTAAAA TTTATTCAAA AAGAAAGTTT AAATAAAGAG GTTTTCACTA TATGTAAATC GGGGAAACGC TCTGAAAAAG CATCAAGAAT CTTATCTAAA TTCAAAATTA AGTCAAGATC TATTGAAGGC GGCATCGAAA AGGTAAAAAA AATATTGGGC AATTAA
|
Protein sequence | MSKDIKFNFL NSDEEERYQK HLTLKEIGYE GQLNLKNSSV LCIGAGGLGS SVLLYLAATG IGRIGIVDND QVEKSNLQRQ IIHETNTIGN LKIDSARERI KNFNPNCEIL TFSERINPKN ALELIKEFDV ICDCSDNFGT RYLINDSCLI LDKPLVFGSV QGFEGQVSVF NLYKNSPNLR DLLPESPSKN AAPSCAEFGV VGVSTGLIGI LQVNEIIKII LKKGEILDGK ILIFDLLNMN MKKLHLKSDQ SNKRIKNLSQ FEGFYNSDEY CEKNNEINII NADEFNSLYK AKPNKILLID VRENEEFSSF AIEGSISIPL SHLKQASDLK FIQKESLNKE VFTICKSGKR SEKASRILSK FKIKSRSIEG GIEKVKKILG N
|
| |