Gene A9601_00221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00221 
Symbolgap2 
ID4716704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp24082 
End bp25104 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content37% 
IMG OID640077719 
Productglyceraldehyde 3-phosphate dehydrogenase(NADP+)(phosphorylating) 
Protein accessionYP_001008417 
Protein GI123967559 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0664623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTGC GTGTTGCAAT TAACGGCTTT GGCAGAATTG GTCGAAACTT TATGCGTTGT 
TGGCTTAGTA GAGGGGCTTA CACCAATATT GAGGTAGTTG GGATTAATGT TACCTCTGAT
CCTAAGACTA ATGCTCATTT ATTGAAATAT GACTCAGTCC TTGGTCAACT GGATGGTGTT
GATATTCAAT ACACTGACGA TACTTTTGTA ATTAATAACA AGACAATTAA ATGTTTCTCT
GATAGAAACC CTATGAACCT CCCTTGGAAA GAATGGGGTG TAGATTTGGT TATTGAATCT
ACTGGAGTAT TTAATACAGA CGTAGGTGCA AGTAAGCACT TAGAGGTAGG AGCAAAAAAA
GTAATCTTAA CTGCTCCCGG TAAAGGCGAT GGTGTTGGTA CTTATGTAGT TGGAGTTAAT
GCTGATACAT ATAAACATAA AGATTATGAT ATTTTGAGTA ATGCTAGTTG TACAACGAAC
TGTTTAGCTC CAGTAGTTAA AGTTTTAGAC CAAACTTTTG GGATTAATAA AGGTTTGATG
ACTACAATTC ATAGTTATAC AGGGGATCAA AGAATTTTAG ATAATAGTCA TAGAGATCTA
AGAAGGGCTA GAGCCGCAGC TACAAACATT GTTCCTACTT CTACAGGAGC TGCAAAAGCA
GTAGCTCTGG TATACCCAGA AATGAAAGGC AAATTAACAG GAATTGCAAT GAGAGTTCCA
ACTCCTAATG TTTCAGCAGT AGATTTTGTT TTTGAATCTT CTAAATCTGT CACAGCTGAA
GAAGTCAACA CTGCTCTCAA GGAAGCATCT TTAGGCTCAA TGAAAGGAAT TATTAAGTAT
GGAGATGAAC CATTAGTTTC AAGCGATTAT GCAGGTACCA ATGAATCATC AATTGTAGAT
AGCGACCTCA CTATGTGTAT CGGCGACAAT CTTGTTAAGG TCCTTGCTTG GTATGACAAC
GAGTGGGGCT ATAGTCAAAG GGTTGTTGAT TTAGCAGAGA TTGTTGCTAA AAATTGGGAA
TAA
 
Protein sequence
MTLRVAINGF GRIGRNFMRC WLSRGAYTNI EVVGINVTSD PKTNAHLLKY DSVLGQLDGV 
DIQYTDDTFV INNKTIKCFS DRNPMNLPWK EWGVDLVIES TGVFNTDVGA SKHLEVGAKK
VILTAPGKGD GVGTYVVGVN ADTYKHKDYD ILSNASCTTN CLAPVVKVLD QTFGINKGLM
TTIHSYTGDQ RILDNSHRDL RRARAAATNI VPTSTGAAKA VALVYPEMKG KLTGIAMRVP
TPNVSAVDFV FESSKSVTAE EVNTALKEAS LGSMKGIIKY GDEPLVSSDY AGTNESSIVD
SDLTMCIGDN LVKVLAWYDN EWGYSQRVVD LAEIVAKNWE