Gene P9211_16891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16891 
SymbolthiF 
ID5730135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1513656 
End bp1514795 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content34% 
IMG OID641286070 
Productmolybdopterin biosynthesis protein 
Protein accessionYP_001551574 
Protein GI159904230 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCTA TAAAAGCCAA TAATATGCAG CTGAGCCCTG AAGAATTTGC TCGGTACTCT 
CGACATTTAT CATTACCAGA AATTGGAATT AAGGGTCAAA AGAAACTAAA AGGCAGTTCT
GTACTTTGCA TTGGCTGTGG AGGGCTTGGT TCTCCAGTAC TGATATATCT TGCTGCGGCT
GGCATAGGTA ATCTTGGGAT TGTAGATAAT GATTTGGTAG AAGAGTCAAA CTTACAAAGA
CAAATCATTC ACACTCACAA GTCAATTGGT AAGTCAAAAA TAGAATCAGC ACGATCTCGA
ATAATTGATA TTAATCCTTA TTGTAAAGTA ACCATTTTTA GTGAATTACT AAATAATGAA
AATGCCCTAG AAATTATTAA ACCATATGAT CTAGTATGTG ACTGCACAGA TAACTTTGAA
AGCAGATATT TAATAAATGA TGCCTGTGTT TTGCTTGGAA AGCCATATAT ATATGGAGCA
ATTTCAAAGT TCGAAGGTCA AGCCTCTGTT TTTAATCTAG ATAAAGATAG TCCAAATTTT
AGAGATTTAA TCCCTCAACC TCCCTCAATG GATTTACTAC CTTCATGTAG TGAGTCTGGT
GTTATAGGGG TTCTTCCAGG TATTATTGGA TTAATACAGG CTACAGAAAT AATTAAGATT
ATAGCTGGGA TAGGTACCAC TTTAAGTGGA AGGATATTGA TATTTAATGC GCTAGAAATG
AAGTTTAAAG AGCTAAAGTT AAAGAAGGAT TATGCTGCAA AGCCAATTAC AGAATTGATT
GATTATAAGG ATTTTTGTGG TTCTTCTGGT GTAACAAAAG TCGAAGACAG TATTAAAAGT
ATTTCGGCTA AAAAGCTTAG AGTTTTACTT GAAGATAATC CCAACAAAAA TATCTTATTA
GATGTTCGTA CTGAGGAAGA GTTTAAATTA AATGCAATCA AAGGATCAAT ATTAGTGCCA
CTCAAAAATA TTCAAAATGG GCAAGAGATA GATAAAATTC GAAAACTAGC TAGCAATAAA
AATATATTTG TTCATTGTAA GACTGGTAAA AGGTCGAGGA AAGCAATACT GAGTTTACGA
TCAAATGGAA TTGATGCTGT CAATCTGGAA GGTGGGATAA ATTCTTGGAA TGAAAGCTAA
 
Protein sequence
MKSIKANNMQ LSPEEFARYS RHLSLPEIGI KGQKKLKGSS VLCIGCGGLG SPVLIYLAAA 
GIGNLGIVDN DLVEESNLQR QIIHTHKSIG KSKIESARSR IIDINPYCKV TIFSELLNNE
NALEIIKPYD LVCDCTDNFE SRYLINDACV LLGKPYIYGA ISKFEGQASV FNLDKDSPNF
RDLIPQPPSM DLLPSCSESG VIGVLPGIIG LIQATEIIKI IAGIGTTLSG RILIFNALEM
KFKELKLKKD YAAKPITELI DYKDFCGSSG VTKVEDSIKS ISAKKLRVLL EDNPNKNILL
DVRTEEEFKL NAIKGSILVP LKNIQNGQEI DKIRKLASNK NIFVHCKTGK RSRKAILSLR
SNGIDAVNLE GGINSWNES