Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_17601 |
Symbol | thiF |
ID | 4911805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1481810 |
End bp | 1482955 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640161360 |
Product | molybdopterin biosynthesis protein |
Protein accession | YP_001091984 |
Protein GI | 126697098 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACG ATATCAAATT TAACTTCTTA AGCTCTGATG AAGAAGAAAG ATATCAAAAA CATTTAACCC TTAAAGAAAT AGGTTATGAG GGTCAACTAA ATCTTAAAAA CAGCTCAGTA TTATGCATTG GAGCAGGTGG ACTTGGGTCT TCGGTTTTGC TTTATCTTGC CGCAACAGGA ATTGGGAGGA TTGGAATCGT AGATAACGAT CAAGTCGAAA AGTCTAATCT CCAGAGACAG ATAATTCATG AAACAAAAAC TATTGGCAAT CTGAAAATTG ATTCTGCCAA GGAAAGAATT AAAAAATTGA ATCCTAATTG TGAAATATTA ACCTTTCCAG AAAGAATTAA TCCTAAAAAT GCCCTTGAAT TAATAAATAA GTTTGATGTT ATTTGTGATT GCTCGGATAA CTTTAGTACA AGATATTTGA TAAATGATTC ATGTCTGATA TTAAATAAAC CCCTTGTATT TGGAAGTGTA CAAGGCTTCG AAGGACAAGT AAGTGTTTTC AATTTATACA AAAATAGTCC TAATTTAAGA GACTTACTTC CAGAATCACC TTCAAAAAAT GCTGCCCCTA GTTGTGCAGA ATACGGCGTT GTAGGCGTTT CAACAGGTTT AATAGGAATT CTTCAGGTTA ATGAAATAAT CAAAATCATT TTGAAAAAAG GTGAAACTTT GGATGGGAAA ATTTTAATTT TTGATCTATT AAAGATGAAT ATAAAAAAAT TAACTCTTAA AAGTGATCAG TTAAATAAAC GAATTAAAAA TCTGTCTCAA AATGAGGACT TCTATAATAG CGATGAATAT TGTGAAAAAA ATAATGAAAT TAATACTATT AATGCTAATG ACTTTAATAA TTTATATAAA GCAAAACGCA ACAAAATTCT TTTAATTGAT GTTAGAGAAA ATGAAGAATT TTCTACTTCT GCAATAGAGG GATCTATATC AATTCCCTTA AGGCATTTGG ACCAAAATTC TGACTTAAAA TTTATTCAGA AAGAAAGTTT AGGCAAAGAG GTTTTCACTA TATGTAAATC GGGGAAACGT TCTGAAGAAG CATCAAGAAT CTTGTCTAAA TTCAAAATTC AGTCAAGATC TATTGAAGGC GGCATTGAAA AGGTAAGAAA AATATTGTGC AACTAA
|
Protein sequence | MSNDIKFNFL SSDEEERYQK HLTLKEIGYE GQLNLKNSSV LCIGAGGLGS SVLLYLAATG IGRIGIVDND QVEKSNLQRQ IIHETKTIGN LKIDSAKERI KKLNPNCEIL TFPERINPKN ALELINKFDV ICDCSDNFST RYLINDSCLI LNKPLVFGSV QGFEGQVSVF NLYKNSPNLR DLLPESPSKN AAPSCAEYGV VGVSTGLIGI LQVNEIIKII LKKGETLDGK ILIFDLLKMN IKKLTLKSDQ LNKRIKNLSQ NEDFYNSDEY CEKNNEINTI NANDFNNLYK AKRNKILLID VRENEEFSTS AIEGSISIPL RHLDQNSDLK FIQKESLGKE VFTICKSGKR SEEASRILSK FKIQSRSIEG GIEKVRKILC N
|
| |