Gene EcolC_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2242 
Symbol 
ID6064371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2460629 
End bp2461630 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content48% 
IMG OID641601647 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_001725206 
Protein GI170020252 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAG TTGGTATTAA CGGTTTTGGT CGTATCGGTC GACTGGTGTT GCGTCGATTA 
CTTGAAGTCA AAAGCAACAT AGACGTTGTC GCTATTAATG ATCTCACTTC CCCAAAAATT
CTCGCCTACC TGCTGAAACA TGATTCAAAC TACGGACCAT TCCCCTGGAG CGTTGATTTT
ACGGAAGATT CACTTATCGT TGATGGGAAA AGTATCGCGG TTTACGCCGA AAAAGAGGCT
AAAAATATTC CGTGGAAAGC GAAAGGTGCA GAAATCATTG TCGAATGTAC TGGCTTTTAT
ACCTCCGCCG AGAAATCGCA GGCGCATCTT GATGCTGGTG CGAAGAAGGT GTTGATTTCC
GCCCCTGCCG GTGAAATGAA AACTATCGTT TATAACGTCA ATGACGACAC TCTGGATGGC
AACGACACCA TTGTTTCCGT GGCGTCATGC ACCACTAACT GTCTTGCGCC GATGGCCAAA
GCCTTGCATG ACAGTTTCGG GATAGAAGTC GGCACGATGA CGACCATTCA TGCCTATACT
GGCACCCAGT CACTGGTGGA TGGCCCGCGT GGTAAAGATT TACGTGCTTC ACGCGCAGCG
GCAGAAAATA TCATTCCCCA CACTACGGGG GCGGCAAAAG CCATTGGTCT GGTGATCCCG
GAACTGAGCG GCAAACTGAA AGGTCATGCG CAACGCGTGC CGGTGAAAAC AGGTTCGGTC
ACTGAACTGG TATCGATTCT CGGAAAAAAA GTGACTGCCG AAGAGGTGAA TAACGCACTT
AAACAAGCAA CCACCAATAA CGAGTCATTT GGTTATACCG ATGAAGAAAT AGTCTCTTCC
GATATCATTG GCAGCCATTT CGGTTCGGTG TTTGATGCCA CGCAAACGGA AATTACCGCC
GTGGGCGATT TACAACTGGT GAAAACGGTC GCCTGGTACG ATAACGAATA TGGCTTCGTC
ACACAGCTTA TTCGCACCCT CGAAAAATTC GCTAAACTCT GA
 
Protein sequence
MSKVGINGFG RIGRLVLRRL LEVKSNIDVV AINDLTSPKI LAYLLKHDSN YGPFPWSVDF 
TEDSLIVDGK SIAVYAEKEA KNIPWKAKGA EIIVECTGFY TSAEKSQAHL DAGAKKVLIS
APAGEMKTIV YNVNDDTLDG NDTIVSVASC TTNCLAPMAK ALHDSFGIEV GTMTTIHAYT
GTQSLVDGPR GKDLRASRAA AENIIPHTTG AAKAIGLVIP ELSGKLKGHA QRVPVKTGSV
TELVSILGKK VTAEEVNNAL KQATTNNESF GYTDEEIVSS DIIGSHFGSV FDATQTEITA
VGDLQLVKTV AWYDNEYGFV TQLIRTLEKF AKL