Gene NATL1_09901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09901 
SymbolribB 
ID4781257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp908462 
End bp910147 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content35% 
IMG OID640084268 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II/unknown domain fusion protein 
Protein accessionYP_001014813 
Protein GI124025697 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.089208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.190435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATCAG AAGATTGTTA TGAAATTGAA TTTGATGATA TTGCAGATGC ACTAGCTGCT 
ATCAGAAATG GTGAATGTGT TGTTGTGGTT GATGATGAAA AGAGAGAAAA CGAAGGCGAT
TTAATATGTG CTGCTCAATT TGCGACTCCC CAGCAAATAA ATTTCATGGC AACAGAGGCT
AGAGGTTTGA TATGTCTAGC AATGCAAGGG GAAAGATTAG ACGAATTAGA TCTTCCTTTA
ATGGTTGACA GAAATACCGA TTCAAACCAA ACAGCATTCA CTGTAAGTAT CGATGCAGGT
CCTGAATTTG GAGTATCTAC GGGCATTTCA GCTGAAGATA GAGCTAAAAC AATTCAAGTT
GCTCTTAATA GTCAAACAAA ACCAATTGAT TTAAGAAGAC CCGGTCATAT TTTCCCTTTA
AGAGCAAAAA TTGGAGGAGT ATTAAAAAGG GCTGGACATA CGGAAGCGGC AGTAGACCTA
TCTTTGTTAG CAGGCTTATC TCCTGCGGGC GTTATCTGTG AAATTCAAAA TCTGGATGGC
TCAATGGCAA GATTGCCTGA GTTAAAAAAA TATGCGCAAG AAAGAAAATT AAAGTTGATC
AGTATTGCAG ATTTAATTCA CTACAGACTT GAAAATGAGA GATTTGTCTA CAGACAAGCA
GTAGCAAAGT TGCCTAGCCT ATTTGGAGAT TTCAAGGCAA TCGGTTACAA GAATGAATTG
GATGGATCAG AACATGTCGC GATAATAAAA GGAAATCCAG AAAATTTAAA AGAGCCGGTA
TTGGTAAGAA TGCACTCAGA GTGTCTAACA GGAGACGCAT TTGGATCATT AAGGTGTGAT
TGTCGGCCTC AATTAGAAGC TGCCTTGTCA AGAATTTCAG AAGAAGGAGA AGGAGTTGTT
GTCTATCTAA GGCAGGAAGG TAGAGGTATA GGTTTAGTTA ACAAATTAAA AGCCTATAAT
CTTCAAGATG GAGGATTAGA TACTGTTGAA GCCAATGAGA AATTAGGTTT TCCTGCAGAT
TTGAGAAACT ATGGAGTGGG GGCGCAAATA TTGACAGATT TAGGAATAAA TAGACTTAAA
TTACTAACAA ATAATCCTAG AAAAATAGCT GGTCTTGGTG GATATGGTCT TCAAGTTGAA
TCTAGAGTTC CATTAGTTAT TTGCCCCGGA GATCATAATG CGGCTTATCT TGAGGTAAAA
AGAGAAAAAC TTGGACACTT AATTGATAAT AATATTCAGA CAAATTTAAC CAATGAAAGA
CAAAATATTG TTGTCTATTG GGATGGAAAA GTTAACGACA GTGAATTAAA ACATTTTGAA
AATAAAGCAT GTAAGTGGTC AGAAAACCAT TTTTTAAATA TTTCTATTCA AACAGCTCCA
AGGTTAATAG CTCTATGTGA AAACCCATTA TTCATTTGGA ATGTTAGACA TAGAGATATC
AAAACACATT TGGAAGGTAA CTTTATAGAT AAAAGATTGC TTGAGTCACT ACTTAAGGAG
CTTAGCAATT GGAAAAATAC AGAAAGAGTT GGAATAATTA AAACTGACAA TTATGAAAGG
CTTTTACATC CATCTTCAAA TATAACTATA GAGTCAAAAA AAATAAGCGA ACTTTCAAAT
TTTGAAAATT CGCCATTATT TGATTGGAAT TTAAAAGATA AGACCAGTAC TTTCGAATGG
AGTTAA
 
Protein sequence
MKSEDCYEIE FDDIADALAA IRNGECVVVV DDEKRENEGD LICAAQFATP QQINFMATEA 
RGLICLAMQG ERLDELDLPL MVDRNTDSNQ TAFTVSIDAG PEFGVSTGIS AEDRAKTIQV
ALNSQTKPID LRRPGHIFPL RAKIGGVLKR AGHTEAAVDL SLLAGLSPAG VICEIQNLDG
SMARLPELKK YAQERKLKLI SIADLIHYRL ENERFVYRQA VAKLPSLFGD FKAIGYKNEL
DGSEHVAIIK GNPENLKEPV LVRMHSECLT GDAFGSLRCD CRPQLEAALS RISEEGEGVV
VYLRQEGRGI GLVNKLKAYN LQDGGLDTVE ANEKLGFPAD LRNYGVGAQI LTDLGINRLK
LLTNNPRKIA GLGGYGLQVE SRVPLVICPG DHNAAYLEVK REKLGHLIDN NIQTNLTNER
QNIVVYWDGK VNDSELKHFE NKACKWSENH FLNISIQTAP RLIALCENPL FIWNVRHRDI
KTHLEGNFID KRLLESLLKE LSNWKNTERV GIIKTDNYER LLHPSSNITI ESKKISELSN
FENSPLFDWN LKDKTSTFEW S