Gene P9211_00231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00231 
Symbolgap2 
ID5730606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp24834 
End bp25856 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content38% 
IMG OID641284365 
Productglyceraldehyde 3-phosphate dehydrogenase(NADP+)(phosphorylating) 
Protein accessionYP_001549908 
Protein GI159902564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.148459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.106886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTAC GTGTAGCGAT TAATGGATTT GGCCGAATTG GTCGTAACTT CATGCGTTGT 
TGGTTGAGTC GAGGCTCTAA TACTGGCTTA GAAGTTGTTG GTGTCAATGT AACTTCTGAC
CCTAAGACCA ATGCCCATTT GCTTAGATAT GACTCTATTC TTGGCGAACT CAAGGACACT
GAAATTGGTT ATACAGATGA CAATTTTATA ATCAATGGAA AAGAGATCAA ATGCTTTTCT
GATAGGAACC CTTTAAATTT GCCTTGGAAG GATTGGGGAG TAGATCTTGT AATTGAGTCA
ACTGGTGTTT TTAATACTTA TGAAGGGGCC AGTAAGCATT TAGCTATAGG AGCTAAGAAA
GTTATTCTTA CAGCTCCTGG TAAAGGTGAT GGCGTTGGTA CTTTCGTTGT TGGAGTGAAT
GCAGATCAAT ATAATCATTC AGATTTTAAT GTTCTTAGTA ATGCGAGTTG TACGACGAAC
TGTCTTGCAC CAGTAGTGAA GGTTTTAGAT CAAACTTTTG GAATTAACAA AGGTTTGATG
ACTACAATCC ATAGTTATAC AGGTGATCAA AGAATTCTTG ATAATAGTCA CCGTGACCTT
AGAAGAGCTA GAGCTGCAGC AATGAACATA GTGCCTACTT CCACTGGAGC AGCTAAAGCT
GTGGCGTTAG TTTATCCGGA AATGAAGGGC AAGTTAACTG GAATTGCAAT GAGAGTTCCT
ACTCCTAATG TTTCTGCAGT TGATATAGTT TTTGAAGCTG GTTGTTCAAT TACTGCAGAA
GATATTAATG CTGCTATGAA AACTGCTTCT GAGGGGTCTA TGAAGGGAAT TATTAAATAT
GGAGATCTTC CATTAGTCTC TAGTGATTAT GCCGGAACTA ATGAATCTTC TATTATTGAT
ACTGATTTGA CTATGGCTAT TGGTAATAAC ATGGGCAAAG TAGTTGCTTG GTACGATAAT
GAGTGGGGAT ATAGTCAAAG GGTTGTAGAT TTAGCAGAAA TTGTTGCTAA GAATTGGAAG
TAA
 
Protein sequence
MTLRVAINGF GRIGRNFMRC WLSRGSNTGL EVVGVNVTSD PKTNAHLLRY DSILGELKDT 
EIGYTDDNFI INGKEIKCFS DRNPLNLPWK DWGVDLVIES TGVFNTYEGA SKHLAIGAKK
VILTAPGKGD GVGTFVVGVN ADQYNHSDFN VLSNASCTTN CLAPVVKVLD QTFGINKGLM
TTIHSYTGDQ RILDNSHRDL RRARAAAMNI VPTSTGAAKA VALVYPEMKG KLTGIAMRVP
TPNVSAVDIV FEAGCSITAE DINAAMKTAS EGSMKGIIKY GDLPLVSSDY AGTNESSIID
TDLTMAIGNN MGKVVAWYDN EWGYSQRVVD LAEIVAKNWK