Gene P9303_11051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_11051 
Symbolgap3 
ID4778291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp987357 
End bp988412 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content45% 
IMG OID640086614 
Productputative glyceraldehyde 3-phosphate dehydrogenase 
Protein accessionYP_001017119 
Protein GI124022812 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.853518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC AAGGGCCAAT GCGTATTGGC ATCAACGGTT TTGGACGTAT CGGGCGGCTG 
GTCTTTCGGG CTCTTTGGGG GAGACCAGGG ATCGAGATCT GCCATGTGAA CGACCCATCT
GGGGAGGCTG CCACTGCAGC ACACCTGCTT GAGTTCGATT CTGTACACGG CCGCTGGGAT
CAGCTTGTTG GCGGCAGCGA GAACCAACTG TTGGTGAATC AACAAACCAT TAGTTACTCC
CAAGGAAGCA CTCCTGGCGC TGTGCCTTGG AATGACGCTG GCATTGACAT CGTTTTGGAA
TGCAGCGGTA AATTTAAATC ACCAGAAACT CTCAGCGTTT ACTTCGATCA ACATGCTTTA
AAACGTGTGA TTGTTGCTTG CCCAGTTAAA GGAAAACTAG CTGGCAAAGA AGTGTTGAAT
ATCGTTTATG GGATCAATCA TCATCTTTAT GACCCAAAGA TTGATCATTT AGTGACAGCA
GCATCATGTA CGACAAACTG TTTGGCTCCG TTGGTAAAGG TGGTGCATGA AAACTTTGGT
ATTTCACATG GCTCCATAAC AACTCTTCAT GATATTACTA ACACACAAGT TCCGATTGAT
AGCTTCCAAA AGGACTTACG TAGGGCTAGG AGTTGTCTAC AAAGTCTGAT CCCAACGACG
ACAGGCTCTG CCAAAGCAAT CGCAATGATC TTTCCAGATC TAGAGGGAAA ACTCAATGGA
CATGCTGTAA GAGTACCTTT GCTCAATGCT TCATTGACCG ATGCAGTTTT TGAGCTAAAG
CGCTCAATTA CGGTGAAGGA TGCCAATGAA GCGTTTAGGC ATGCAGCAGA ACATCAGCTA
AAGGGAATCC TTGGCTATGA AGACAAGCCT CTTGTCTCTA TTGATTACGT TAACGATTCT
CGAAGCTCAA TTATCGATGG ATTATCAACG ATGGTTGTTA ACGGCTCTCA GCTAAAGGTT
TACGCATGGT ACGACAATGA ATGGGGATAT AGCAGTCGAA TGGCAGATCT TACGTGTTAT
ATTTCTAATC TTGAAAATAC TAGTAATACC TTTTAA
 
Protein sequence
MAKQGPMRIG INGFGRIGRL VFRALWGRPG IEICHVNDPS GEAATAAHLL EFDSVHGRWD 
QLVGGSENQL LVNQQTISYS QGSTPGAVPW NDAGIDIVLE CSGKFKSPET LSVYFDQHAL
KRVIVACPVK GKLAGKEVLN IVYGINHHLY DPKIDHLVTA ASCTTNCLAP LVKVVHENFG
ISHGSITTLH DITNTQVPID SFQKDLRRAR SCLQSLIPTT TGSAKAIAMI FPDLEGKLNG
HAVRVPLLNA SLTDAVFELK RSITVKDANE AFRHAAEHQL KGILGYEDKP LVSIDYVNDS
RSSIIDGLST MVVNGSQLKV YAWYDNEWGY SSRMADLTCY ISNLENTSNT F