Gene EcSMS35_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1756 
SymbolgapC 
ID6143907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1760214 
End bp1761215 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content48% 
IMG OID641616632 
Productglyceraldehyde-3-phosphate dehydrogenase (phosphorylating) 
Protein accessionYP_001743810 
Protein GI170680304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000557721 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAG TTGGTATTAA CGGTTTTGGT CGTATCGGTC GACTGGTGTT GCGTCGATTA 
CTTGAAGTCA AAAGCAACAT AGACATTGTT GCTATTAATG ATCTCACTTC CCCAAAAATT
CTCGCCTACC TGTTGAAACA TGATTCAAAC TACGGACCGT TCCCCTGGAG CGTTGATTTT
ACGGAAGATT CACTTATCGT TGATGGAAAA AGTATCGCGG TTTACGCCGA AAAAGAGGCT
AAAAATATTC CATGGAAAGC GAAAGGCGCA GAAATCATTG TCGAATGTAC TGGCTTTTAT
ACCTCCGCCG AGAAATCGCA GGCGCACCTT GATGCTGGTG CGAAGAAGGT GTTGATTTCC
GCCCCTGCCG GTGAAATGAA AACCATCGTT TATAACGTCA ATGACGACAC ACTGGATGGC
AACGACACCA TTGTTTCCGT GGCGTCATGC ACCACTAACT GTCTTGCACC GATGGCCAAA
GCCTTGCACG ACAGTTTCGG AATAGAAGTC GGCACGATGA CGACCATTCA TGCCTATACC
GGCACTCAGT CACTGGTGGA TGGTCCGCGA GGTAAAGATC TACGCGCTTC ACGTGCAGCG
GCAGAAAATA TCATTCCCCA CACTACAGGT GCGGCAAAAG CCATTGGTCT GGTGATCCCG
GAACTGAGCG GCAAACTGAA AGGTCATGCG CAACGCGTGC CGGTGAAAAC AGGTTCGGTC
ACTGAGCTGG TGTCCATTCT CGGAAAAAAA GTGACTGCCG AAGAGGTGAA TAACGCACTT
AAACAAGCAA CCACCAATAA CGAGTCATTT GGTTATACCG ATGAAGAAAT AGTCTCTTCC
GATATCATTG GCAGCCATTT CGGTTCGGTG TTTGATGCCA CGCAAACGGA AATTACCGCC
GTGGGCGATT TACAACTGGT GAAAACGGTC GCCTGGTACG ATAACGAATA TGGCTTCGTC
ACACAGCTCA TTCGCACCCT CGAAAAATTC GCTAAACTCT GA
 
Protein sequence
MSKVGINGFG RIGRLVLRRL LEVKSNIDIV AINDLTSPKI LAYLLKHDSN YGPFPWSVDF 
TEDSLIVDGK SIAVYAEKEA KNIPWKAKGA EIIVECTGFY TSAEKSQAHL DAGAKKVLIS
APAGEMKTIV YNVNDDTLDG NDTIVSVASC TTNCLAPMAK ALHDSFGIEV GTMTTIHAYT
GTQSLVDGPR GKDLRASRAA AENIIPHTTG AAKAIGLVIP ELSGKLKGHA QRVPVKTGSV
TELVSILGKK VTAEEVNNAL KQATTNNESF GYTDEEIVSS DIIGSHFGSV FDATQTEITA
VGDLQLVKTV AWYDNEYGFV TQLIRTLEKF AKL