Gene ECH74115_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2022 
SymbolgapC 
ID6967263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1921811 
End bp1922812 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content48% 
IMG OID643385939 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_002270428 
Protein GI209399787 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00000572719 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAG TTGGTATTAA CGGTTTTGGT CGTATCGGTC GACTGGTGTT GCGTCGATTA 
CTTGAAGTCA AAAGCAACAT AGACGTTGTC GCTATTAATG ATCTTACTTC CCCAAAAATT
CTCGCCTATC TGCTGAAACA TGATTCAAAC TATGGTCCGT TCCCCTGGAG CGTTGATTTT
ACGGAAGATT CACTTATCGT TGATGGGAAA AGTATCGCGG TTTACGCCGA AAAAGAGGCT
AAAAATATTC CGTGGAAAGC GAAAGGTGCA GAAATCATTG TCGAATGTAC TGGCTTTTAT
ACCTCCGCCG AGAAATCGCA GGCGCATCTT GATGCTGGCG CGAAGAAGGT GTTGATTTCC
GCCCCTGCAG GTGAAATGAA AACTATCGTT TATAAAGTCA ATGACGACAC TCTGGATGGC
AACGACACCA TTGTTTCCGT GGCATCATGC ACTACTAACT GTCTTGCGCC GATGGCCAAA
GCCTTGCATG ACAGTTTCGG GATAGAAGTC GGCACGATGA CGACCATTCA TGCCTATACT
GGCACCCAGT CACTGGTGGA TGGCCCGCGT GGTAAAGATT TACGTGCTTC ACGCGCCGCG
GCAGAAAATA TCATTCCCCA CACTACGGGC GCGGCAAAAG CCATTGGTCT GGTGATCCCG
GAACTGAGCG GCAAACTGAA AGGTCATGCG CAACGCGTGC CGGTGAAAAC AGGTTCGGTC
ACTGAACTGG TATCGATTCT CGGAAAAAAA GTGACTGCCG AAGAGGTGAA TAACGCACTT
AAACAAGCAA CCACCAATAA CGAGTCATTT GGTTATACCG ATGAAGAAAT AGTCTCTTCC
GATATCATTG GCAGCCATTT CGGTTCGGTG TTTGATGCCA CGCAAACGGA AATTACCGCC
GTGGGCGATT TACAACTGGT GAAAACGGTC GCCTGGTACG ACAACGAATA TGGCTTCGTC
ACGCAGCTTA TTCGCACCCT CGAAAAATTC GCTAAACTCT GA
 
Protein sequence
MSKVGINGFG RIGRLVLRRL LEVKSNIDVV AINDLTSPKI LAYLLKHDSN YGPFPWSVDF 
TEDSLIVDGK SIAVYAEKEA KNIPWKAKGA EIIVECTGFY TSAEKSQAHL DAGAKKVLIS
APAGEMKTIV YKVNDDTLDG NDTIVSVASC TTNCLAPMAK ALHDSFGIEV GTMTTIHAYT
GTQSLVDGPR GKDLRASRAA AENIIPHTTG AAKAIGLVIP ELSGKLKGHA QRVPVKTGSV
TELVSILGKK VTAEEVNNAL KQATTNNESF GYTDEEIVSS DIIGSHFGSV FDATQTEITA
VGDLQLVKTV AWYDNEYGFV TQLIRTLEKF AKL