Gene PMN2A_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0317 
Symbol 
ID3605690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp854756 
End bp856441 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content35% 
IMG OID637687176 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II/unknown domain fusion protein 
Protein accessionYP_291512 
Protein GI72382157 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0930503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATCAG AAGATTGTTA TGAAATTGAA TTTGATGATA TTGCAGATGC ACTAGCTGCT 
ATCAGAAATG GTGAATGTGT TGTTGTGGTT GATGATGAAA AGAGAGAAAA CGAAGGCGAT
TTAATATGTG CTGCTCAATT TGCGACTCCC CAGCAAATAA ATTTTATGGC AACAGAAGCT
AGAGGTTTGA TATGTCTAGC AATGCAAGGG GAAAGGTTAG ATGAATTAGA TCTTCCTTTA
ATGGTTGACA GAAATACCGA CTCAAACCAA ACAGCATTCA CTGTAAGTAT CGATGCAGGT
CCAGAATTTG GAGTATCTAC TGGTATTTCA GCTGAAGATA GAGCTAAAAC AATTCAAGTT
GCTCTTAACA GTCAAACAAA ACCAATTGAT TTAAGAAGAC CCGGTCATAT TTTCCCTTTA
AGAGCAAAAA TTGGAGGAGT ATTAAAAAGG GCTGGACATA CGGAAGCGGC AGTAGACCTA
TCTTTGTTAG CAGGCTTATC TCCTGCAGGC GTTATCTGTG AAATTCAAAA TCTGGATGGC
TCAATGGCAA GATTACCAGA GTTAAAAAAA TATGCGAGAG AAAGAAATTT AAAATTGATC
AGTATTGCAG ATTTAATTCA CTACAGACTT GAAAATGAGA GATTTGTGTA CAGACAAGCA
GTAGCAAAGT TGCCTAGCCT ATTTGGAGAT TTCAAGGCAA TCGGTTACAA GAATGAATTG
GATGGATCAG AACATGTCGC GATAATAAAA GGAAATCCAG AAAATTTAAA AGAGCCGGTA
TTGGTAAGAA TGCACTCAGA GTGTCTTACA GGAGATGCAT TTGGATCATT AAGGTGCGAT
TGTCGGCCTC AATTAGAAGC TGCCTTGTCA AGAATTTCAG AAGAAGGAGA AGGAGTTGTT
GTCTATCTAA GGCAGGAAGG GAGAGGTATA GGTTTAGTTA ACAAATTAAA AGCCTATAAT
CTACAAGATG GAGGATTAGA TACTGTTGAA GCCAATGAGA AGTTAGGTTT TCCTGCAGAT
TTAAGAAACT ATGGAGTAGG GGCGCAAATA TTGACAGATT TAGGAATAAA TAGACTTAAA
TTACTAACAA ATAATCCTAG AAAAATAGCT GGTCTTGGTG GATATGGTCT TCAGGTTGAA
TCTAGAGTTC CATTAGTTAT TTGTCCCGGA GATCATAATG CGGCTTATCT TGAGGTGAAA
AGAGAAAAAC TTGGACACTT AATTGATAAT AATATTCAGA GGAATTTAAC AAATGAAAGA
CAAAATATTG TTGTCTATTG GGATGGAAAA GTTAACAACA GTGAATTGAA GCATTTTGAA
GATAAAGCAT GTAAGTGGTC AGAAAACCAT TTTTTAAATA TTTCTATTCA AACAGCTCCA
AGGTTGATAG CTCTATGTGA AAACCCATTA TTTATTTGGA ATGTTAGACA TAGAGATATC
AAAACACATT TAGAAGGTAA CTTGATAGAT AAAAGATTGC TTGAGTCACT ACTTAAGGAG
TTGAGCAATT GGGAAAATAC AGAAAGAATA GGAATCATTA AAACTGAGAA TTATGAAAGA
CTTTTACATC CTTCTTCAAA TATATCTATA GAGTCAAAAA AAATAAGCGA ACTTTCAAAT
TTTGAAAATT CGCCATTATT TGACTGGAAT TTAAAAGATA AGACGAGTAC TATTGAATGG
AGTTAA
 
Protein sequence
MKSEDCYEIE FDDIADALAA IRNGECVVVV DDEKRENEGD LICAAQFATP QQINFMATEA 
RGLICLAMQG ERLDELDLPL MVDRNTDSNQ TAFTVSIDAG PEFGVSTGIS AEDRAKTIQV
ALNSQTKPID LRRPGHIFPL RAKIGGVLKR AGHTEAAVDL SLLAGLSPAG VICEIQNLDG
SMARLPELKK YARERNLKLI SIADLIHYRL ENERFVYRQA VAKLPSLFGD FKAIGYKNEL
DGSEHVAIIK GNPENLKEPV LVRMHSECLT GDAFGSLRCD CRPQLEAALS RISEEGEGVV
VYLRQEGRGI GLVNKLKAYN LQDGGLDTVE ANEKLGFPAD LRNYGVGAQI LTDLGINRLK
LLTNNPRKIA GLGGYGLQVE SRVPLVICPG DHNAAYLEVK REKLGHLIDN NIQRNLTNER
QNIVVYWDGK VNNSELKHFE DKACKWSENH FLNISIQTAP RLIALCENPL FIWNVRHRDI
KTHLEGNLID KRLLESLLKE LSNWENTERI GIIKTENYER LLHPSSNISI ESKKISELSN
FENSPLFDWN LKDKTSTIEW S