Gene A9601_17761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17761 
SymbolthiF 
ID4718509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1508150 
End bp1509295 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content30% 
IMG OID640079505 
Productmolybdopterin biosynthesis protein 
Protein accessionYP_001010166 
Protein GI123969308 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.624505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAG ATATAAAATT TAATTTCTTA AACTCTGATG AAGAAGAGAG ATACCAAAAA 
CATCTTACCC TTAAAGAGAT AGGTTACGAG GGCCAATTAA ATCTTAAAAA CAGCTCAGTA
TTATGTATCG GAGCAGGTGG GCTTGGCTCT TCCGTTTTGC TTTATCTAGC TGCAACAGGA
ATTGGGAGAA TTGGAATAGT TGATAACGAT CAAGTTGAAA AGTCTAATCT CCAGAGACAG
ATAATTCATG AAACAAATAC TATTGGCAAT CTTAAAATTG ATTCTGCTAG GGAAAGAATT
AAAAATTTCA ATCCTAATTG TGAAATATTA ACCTTTTCAG AGAGAATTAA TCCTAAAAAT
GCTCTTGAAT TAATAAAGGA GTTTGATGTT ATTTGTGATT GCTCTGATAA CTTTGGCACA
AGATATTTAA TAAATGATTC ATGCCTGATA TTAGATAAAC CCCTAGTCTT TGGAAGTGTA
CAAGGCTTTG AAGGGCAGGT GAGTGTTTTC AATTTATATA AAAATAGTCC AAATTTAAGA
GACTTACTTC CAGAATCACC TTCAAAAAAT GCTGCCCCAA GTTGTGCAGA ATTCGGAGTT
GTGGGTGTTT CAACAGGTTT AATAGGAATT CTTCAGGTTA ATGAAATTAT CAAAATCATT
TTGAAAAAAG GTGAAATTTT AGATGGGAAG ATTTTAATTT TTGATCTATT GAATATGAAT
ATGAAAAAAT TACATCTAAA AAGTGATCAG TCAAATAAAC GAATAAAAAA CCTGTCTCAG
TTTGAGGGCT TTTATAATAG CGACGAATAT TGTGAAAAAA ATAATGAGAT TAACATTATA
AATGCTGATG AATTTAATAG TTTATACAAA GCAAAACCCA ACAAAATTCT TTTAATTGAT
GTTAGAGAAA ATGAAGAATT TTCTTCATTT GCAATAGAAG GATCTATCTC AATTCCCTTA
AGTCATTTGA AACAAGCATC TGACTTAAAA TTTATTCAAA AAGAAAGTTT AAATAAAGAG
GTTTTCACTA TATGTAAATC GGGGAAACGC TCTGAAAAAG CATCAAGAAT CTTATCTAAA
TTCAAAATTA AGTCAAGATC TATTGAAGGC GGCATCGAAA AGGTAAAAAA AATATTGGGC
AATTAA
 
Protein sequence
MSKDIKFNFL NSDEEERYQK HLTLKEIGYE GQLNLKNSSV LCIGAGGLGS SVLLYLAATG 
IGRIGIVDND QVEKSNLQRQ IIHETNTIGN LKIDSARERI KNFNPNCEIL TFSERINPKN
ALELIKEFDV ICDCSDNFGT RYLINDSCLI LDKPLVFGSV QGFEGQVSVF NLYKNSPNLR
DLLPESPSKN AAPSCAEFGV VGVSTGLIGI LQVNEIIKII LKKGEILDGK ILIFDLLNMN
MKKLHLKSDQ SNKRIKNLSQ FEGFYNSDEY CEKNNEINII NADEFNSLYK AKPNKILLID
VRENEEFSSF AIEGSISIPL SHLKQASDLK FIQKESLNKE VFTICKSGKR SEKASRILSK
FKIKSRSIEG GIEKVKKILG N