Gene P9301_17601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_17601 
SymbolthiF 
ID4911805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1481810 
End bp1482955 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content30% 
IMG OID640161360 
Productmolybdopterin biosynthesis protein 
Protein accessionYP_001091984 
Protein GI126697098 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACG ATATCAAATT TAACTTCTTA AGCTCTGATG AAGAAGAAAG ATATCAAAAA 
CATTTAACCC TTAAAGAAAT AGGTTATGAG GGTCAACTAA ATCTTAAAAA CAGCTCAGTA
TTATGCATTG GAGCAGGTGG ACTTGGGTCT TCGGTTTTGC TTTATCTTGC CGCAACAGGA
ATTGGGAGGA TTGGAATCGT AGATAACGAT CAAGTCGAAA AGTCTAATCT CCAGAGACAG
ATAATTCATG AAACAAAAAC TATTGGCAAT CTGAAAATTG ATTCTGCCAA GGAAAGAATT
AAAAAATTGA ATCCTAATTG TGAAATATTA ACCTTTCCAG AAAGAATTAA TCCTAAAAAT
GCCCTTGAAT TAATAAATAA GTTTGATGTT ATTTGTGATT GCTCGGATAA CTTTAGTACA
AGATATTTGA TAAATGATTC ATGTCTGATA TTAAATAAAC CCCTTGTATT TGGAAGTGTA
CAAGGCTTCG AAGGACAAGT AAGTGTTTTC AATTTATACA AAAATAGTCC TAATTTAAGA
GACTTACTTC CAGAATCACC TTCAAAAAAT GCTGCCCCTA GTTGTGCAGA ATACGGCGTT
GTAGGCGTTT CAACAGGTTT AATAGGAATT CTTCAGGTTA ATGAAATAAT CAAAATCATT
TTGAAAAAAG GTGAAACTTT GGATGGGAAA ATTTTAATTT TTGATCTATT AAAGATGAAT
ATAAAAAAAT TAACTCTTAA AAGTGATCAG TTAAATAAAC GAATTAAAAA TCTGTCTCAA
AATGAGGACT TCTATAATAG CGATGAATAT TGTGAAAAAA ATAATGAAAT TAATACTATT
AATGCTAATG ACTTTAATAA TTTATATAAA GCAAAACGCA ACAAAATTCT TTTAATTGAT
GTTAGAGAAA ATGAAGAATT TTCTACTTCT GCAATAGAGG GATCTATATC AATTCCCTTA
AGGCATTTGG ACCAAAATTC TGACTTAAAA TTTATTCAGA AAGAAAGTTT AGGCAAAGAG
GTTTTCACTA TATGTAAATC GGGGAAACGT TCTGAAGAAG CATCAAGAAT CTTGTCTAAA
TTCAAAATTC AGTCAAGATC TATTGAAGGC GGCATTGAAA AGGTAAGAAA AATATTGTGC
AACTAA
 
Protein sequence
MSNDIKFNFL SSDEEERYQK HLTLKEIGYE GQLNLKNSSV LCIGAGGLGS SVLLYLAATG 
IGRIGIVDND QVEKSNLQRQ IIHETKTIGN LKIDSAKERI KKLNPNCEIL TFPERINPKN
ALELINKFDV ICDCSDNFST RYLINDSCLI LNKPLVFGSV QGFEGQVSVF NLYKNSPNLR
DLLPESPSKN AAPSCAEYGV VGVSTGLIGI LQVNEIIKII LKKGETLDGK ILIFDLLKMN
IKKLTLKSDQ LNKRIKNLSQ NEDFYNSDEY CEKNNEINTI NANDFNNLYK AKRNKILLID
VRENEEFSTS AIEGSISIPL RHLDQNSDLK FIQKESLGKE VFTICKSGKR SEEASRILSK
FKIQSRSIEG GIEKVRKILC N