Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16891 |
Symbol | thiF |
ID | 5730135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1513656 |
End bp | 1514795 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641286070 |
Product | molybdopterin biosynthesis protein |
Protein accession | YP_001551574 |
Protein GI | 159904230 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCTA TAAAAGCCAA TAATATGCAG CTGAGCCCTG AAGAATTTGC TCGGTACTCT CGACATTTAT CATTACCAGA AATTGGAATT AAGGGTCAAA AGAAACTAAA AGGCAGTTCT GTACTTTGCA TTGGCTGTGG AGGGCTTGGT TCTCCAGTAC TGATATATCT TGCTGCGGCT GGCATAGGTA ATCTTGGGAT TGTAGATAAT GATTTGGTAG AAGAGTCAAA CTTACAAAGA CAAATCATTC ACACTCACAA GTCAATTGGT AAGTCAAAAA TAGAATCAGC ACGATCTCGA ATAATTGATA TTAATCCTTA TTGTAAAGTA ACCATTTTTA GTGAATTACT AAATAATGAA AATGCCCTAG AAATTATTAA ACCATATGAT CTAGTATGTG ACTGCACAGA TAACTTTGAA AGCAGATATT TAATAAATGA TGCCTGTGTT TTGCTTGGAA AGCCATATAT ATATGGAGCA ATTTCAAAGT TCGAAGGTCA AGCCTCTGTT TTTAATCTAG ATAAAGATAG TCCAAATTTT AGAGATTTAA TCCCTCAACC TCCCTCAATG GATTTACTAC CTTCATGTAG TGAGTCTGGT GTTATAGGGG TTCTTCCAGG TATTATTGGA TTAATACAGG CTACAGAAAT AATTAAGATT ATAGCTGGGA TAGGTACCAC TTTAAGTGGA AGGATATTGA TATTTAATGC GCTAGAAATG AAGTTTAAAG AGCTAAAGTT AAAGAAGGAT TATGCTGCAA AGCCAATTAC AGAATTGATT GATTATAAGG ATTTTTGTGG TTCTTCTGGT GTAACAAAAG TCGAAGACAG TATTAAAAGT ATTTCGGCTA AAAAGCTTAG AGTTTTACTT GAAGATAATC CCAACAAAAA TATCTTATTA GATGTTCGTA CTGAGGAAGA GTTTAAATTA AATGCAATCA AAGGATCAAT ATTAGTGCCA CTCAAAAATA TTCAAAATGG GCAAGAGATA GATAAAATTC GAAAACTAGC TAGCAATAAA AATATATTTG TTCATTGTAA GACTGGTAAA AGGTCGAGGA AAGCAATACT GAGTTTACGA TCAAATGGAA TTGATGCTGT CAATCTGGAA GGTGGGATAA ATTCTTGGAA TGAAAGCTAA
|
Protein sequence | MKSIKANNMQ LSPEEFARYS RHLSLPEIGI KGQKKLKGSS VLCIGCGGLG SPVLIYLAAA GIGNLGIVDN DLVEESNLQR QIIHTHKSIG KSKIESARSR IIDINPYCKV TIFSELLNNE NALEIIKPYD LVCDCTDNFE SRYLINDACV LLGKPYIYGA ISKFEGQASV FNLDKDSPNF RDLIPQPPSM DLLPSCSESG VIGVLPGIIG LIQATEIIKI IAGIGTTLSG RILIFNALEM KFKELKLKKD YAAKPITELI DYKDFCGSSG VTKVEDSIKS ISAKKLRVLL EDNPNKNILL DVRTEEEFKL NAIKGSILVP LKNIQNGQEI DKIRKLASNK NIFVHCKTGK RSRKAILSLR SNGIDAVNLE GGINSWNES
|
| |