Gene ECH74115_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3679 
SymboleutD 
ID6970108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3392802 
End bp3393818 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID643387473 
Productphosphotransacetylase 
Protein accessionYP_002271926 
Protein GI209400708 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase 
TIGRFAM ID[TIGR00651] phosphate acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTG AACGTTGTTG TGAACTGGCG TTGCGAGCGC CCGCCAGAGT GGTTTTTCCG 
GATGCGTTAG ATCAACGTGT GCTGAAAGCT GCGCAATATT TACATCAACA AGGTCTGGCA
ACGCCCATTC TGGTCGCCAA TCCGTTTGAA CTTCGTCAGT TTGCGCTCAG TCACGGCGTG
GCGATGGACG GGCTACAGGT GATAGATCCG CATGGCAACC TCGCAATGCG GGAAGAATTT
GCTCATCGCT GGCTGGCCCG CGCGGGCGAA AAAACGCCGC CGGATGCGCT GGAAAAACTT
ACCGATCCGC TGATGTTCGC CGCCGCAATG GTCAGCGCCG GTAAAGCGGA TGTCTGTATC
GCGGGCAACC TCTCTTCCAC GGCGAATGTG CTGCGTGCCG GATTACGCAT TATCGGCTTG
CAGCCAGGCT GTAAAACGCT CTCATCCATT TTCCTGATGC TGCCACAGTA CAGCGGTCCG
GCGTTGGGCT TTGCCGATTG CAGCGTGGTA CCACAGCCGA CGGCGGCGCA GCTGGCGGAT
ATCGCGCTTG CCAGCGCCGA AACCTGGCGC GCCATCACCG GAGAAGAGCC GCGCGTGGCG
ATGCTGTCGT TTTCCAGTAA CGGTAGCGCC CGTCACCCCT GCGTTGCCAA TGTCCAGCAG
GCGACAGAAA TCGTCCGTGA GCGCGCACCA AAGCTGGTAG TCGATGGCGA GTTGCAGTTT
GACGCCGCCT TCGTGCCGGA AGTGGCGGCG CAAAAAGCGC CTGCCAGCCC GCTACAGGGC
AAGGCCAATG TGATGGTTTT TCCGTCGCTG GAAGCCGGAA ATATTGGCTA CAAAATCGCA
CAACGACTCG GCGGATATCG TGCCGTCGGG CCATTGATAC AAGGACTTGC CGCGCCGATG
CACGATCTCT CTCGTGGTTG TAGTGTGCAG GAAATTATCG AGCTGGCGCT GGTGGCAGCT
GTGCCGCGTC AGACAGAAGT GAACCGCGAA AGCAGTTTAC AAACACTGGT TGAATGA
 
Protein sequence
MIIERCCELA LRAPARVVFP DALDQRVLKA AQYLHQQGLA TPILVANPFE LRQFALSHGV 
AMDGLQVIDP HGNLAMREEF AHRWLARAGE KTPPDALEKL TDPLMFAAAM VSAGKADVCI
AGNLSSTANV LRAGLRIIGL QPGCKTLSSI FLMLPQYSGP ALGFADCSVV PQPTAAQLAD
IALASAETWR AITGEEPRVA MLSFSSNGSA RHPCVANVQQ ATEIVRERAP KLVVDGELQF
DAAFVPEVAA QKAPASPLQG KANVMVFPSL EAGNIGYKIA QRLGGYRAVG PLIQGLAAPM
HDLSRGCSVQ EIIELALVAA VPRQTEVNRE SSLQTLVE