Gene P9301_12361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_12361 
Symbolgap3 
ID4911458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1048311 
End bp1049333 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content33% 
IMG OID640160825 
Productputative glyceraldehyde 3-phosphate dehydrogenase 
Protein accessionYP_001091460 
Protein GI126696574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.257201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG GAATTAATGG TTTTGGAAGA ATTGGCAGAT TAGTTTTCAG AGCATTATGG 
GATAGAGCTG ATATAGAAAT AACTCATATA AATGAGATAG CAGGAGATTC GAATGCTGCT
GCGCATTTAC TCGAATTCGA TTCAGTCCAT GGTAGATGGG TGAAAGATAT AAAGGTAAAA
GAAGAAGAAA TAATAATTGA TGGAAAGAAA TTAACCTACA CATCTTTTAA AAATTACCTT
GATATTCCTT GGGAAAAATC TTCTGTAGAT ATTATTTTGG AATGTACAGG AAAGAATAAA
AAGCCAGACA AACTAAATCC CTATTTTGAT ACTCTTGGGA TGAAAAGAGT AATAGTAGCT
TGTCCAGTCA AAGGAATTGT TGCAGAAGCT GAATCACTGA ATATTGTTTA CGGTATAAAT
CAAAGTCTTT ATGACCCTAC CAAACATAAA TTAGTAACTG CAGCATCCTG CACTACAAAT
TGTTTAGCTC CGATAGTAAA GGTAATTAAT GAAAATTTTT CTATTAAACA CGGTGCTATT
ACAACTATTC ACGATGTAAC GAACACTCAA GTTCCTGTAG ATTTTTATAA AAGTGATCTG
AGGAGAGCAA GAGGATGTAT GCAAAGTTTA ATACCTACTA CCACTGGATC TGCTAAAGCT
ATCGCTGAGA TCTTTCCAGA ATTAAAAGGA AAATTAAATG GACATGCAGT AAGAGTTCCT
CTACTTAATG GTTCTTTAAC AGATGCAGTT TTTGAATTAA ATAAAGAAGT GACAACTGAA
CAAGTGAATA TGGCACTAAA GGAAGCTTCA GAAACTTATT TAAAAGGAAT TCTTGGCTAC
GAAGAAAGAC CTTTAGTTTC TGCAGATTAT GTAAATGACT CTAGAAGTTC AATAGTTGAT
AGTTTATCAA CGATGGTTGT TAATTCAAAT TTATTAAAGA TATACGCTTG GTATGACAAC
GAGTGGGGTT ACAGCTGCAG ACTTGCAGAT CTTACTGAAT ATGTAATCAA AAAAGAGATT
TAA
 
Protein sequence
MKIGINGFGR IGRLVFRALW DRADIEITHI NEIAGDSNAA AHLLEFDSVH GRWVKDIKVK 
EEEIIIDGKK LTYTSFKNYL DIPWEKSSVD IILECTGKNK KPDKLNPYFD TLGMKRVIVA
CPVKGIVAEA ESLNIVYGIN QSLYDPTKHK LVTAASCTTN CLAPIVKVIN ENFSIKHGAI
TTIHDVTNTQ VPVDFYKSDL RRARGCMQSL IPTTTGSAKA IAEIFPELKG KLNGHAVRVP
LLNGSLTDAV FELNKEVTTE QVNMALKEAS ETYLKGILGY EERPLVSADY VNDSRSSIVD
SLSTMVVNSN LLKIYAWYDN EWGYSCRLAD LTEYVIKKEI