Gene ECH74115_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3821 
Symbol 
ID6972055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3544267 
End bp3546927 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content55% 
IMG OID643387606 
Productacetyltransferase, GNAT family protein 
Protein accessionYP_002272059 
Protein GI209400567 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGC GAGGACTGGA AGCACTACTG CGACCAAAAT CGATAGCGGT AATTGGCGCG 
TCGATGAAAC CCAATCGCGC AGGTTACCTG ATGATGCGTA ACCTGCTGGC GGGAGGCTTT
AACGGACCGG TACTCCCGGT GACGCCAGCC TGGAAAGCGG TGTTGGGTGT GTTGGCCTGG
CCGGATATTG CCAGCTTGCC CTTTACACCC GACCTTGCGG TTTTATGTAC CAATGCCAGC
CGTAATCTTG CTCTTCTGGA AGAGCTCGGC GAGAAAGGCT GTAAAACCTG CATTATTCTT
TCCGCCCCGG CATCGCAACA CGAAGATCTC CGCGCCTGCG CCCTGCGCCA TAACATGCGC
CTGCTTGGAC CAAACAGTCT GGGTTTACTG GCTCCCTGGC AAGGTCTGAA TGCCAGCTTT
TCGCCTGTGC CGATTAAACG CGGCAAGCTG GCGTTTATTT CACAGTCGGC TGCCGTCTCC
AACACCATCC TCGACTGGGC GCAACAGCGT GAGATGGGCT TTTCCTACTT TATTGCGCTC
GGCGACAGCC TAGATATCGA CGTTGATGAA TTGCTTGACT ATCTGGCACG CGACAGTAAA
ACCAGCGCCA TCCTGCTCTA TCTCGAACAG TTAAGCGACG CGCGACGCTT TGTTTCGGCG
GCCCGTAGTG CCTCGCGTAA TAAACCGATT CTGGTGATTA AAAGCGGACG TAGCCCGGCA
GCACAGCGAC TGCTCAACAC GACGGCAGGA ATGGATCCGG CGTGGGATGC GGCGATTCAG
CGTGCCGGTT TGTTGCGGGT GCAGGATACC CACGAGCTGT TTTCGGCGGT GGAAACCCTC
AGCCATATGC GCCCGCTGCG TGGCGACCGG TTGATGATTA TCAGCAACGG TGCTGCGCCT
GCCGCGCTGG CGCTGGATGC CTTATGGTCA CGCAATGGCA AGCTGGCAAC GCTAAGCGAA
GAAACCTGCC AGAAACTGCG CGATGCACTG CCAGGACATG TGGCAATCTC TAACCCGCTC
GATCTGCGCG ATGACGCCAG CAGCGAGCAT TATGTCAAAA CGCTGGACAT TCTGCTCCAC
AGCCAGGATT TTGATGCGCT GATGGTTATT CATTCGCCCA GCGCCGCTGC TCCCGCAACA
GAAATCGCGC AAGTATTAAT TGAAGCGGTA AAGCATCATC CCCGCAGCAA GTATGTTTCT
CTGCTGACGA ACTGGTGCGG CGAGCACTCC TCGCAAGAGG CACGACGTTT ATTCAGCGAA
GCCGGGCTGC CGACCTACCG TACCCCGGAA GGAACCATCA CTGCTTTTAT GCATATGGTG
GAGTACCGGC GTAATCAGAA GCAACTACGC GAAACGCCGG CGTTGCCCAG CAATCTGACT
TCCAATACCG CAAAAGCGCA TCTTCTGTTG CAACAGGCGA TTGCCGAAGG GGCTACGTCG
CTCGATACCC ATGAAGTTCA GCCCATCCTG CAATCGTATG GGATGAACAC GCTCCCTACC
TGGATTGCCA GCGATAGCAC CGAAGCGGTG CATATTGCCG AACAGATTGG TTATCCGGTG
GCGCTGAAAT TGCGTTCGCC GGATATTCCA CATAAATCGG AAGTTCAGGG CGTCATGCTG
TACCTGCGTA CAGCCAATGA AGTCCAGCAA GCGGCAAACG CTATTTTCGA TCGCGTAAAA
ATGACCTGGC CGCAGGCGCG GGTCCACGGC CTGTTGGTGC AAAGTATGGC TAACCGTGCT
GGCGCTCAGG AGTTGCGGGT TGTGGTTGAG CACGATCCGG TTTTTGGGCC GTTGATCATG
CTGGGTGAAG GCGGTGTGGA GTGGCGTCCT GAAGATCAAG CCGTCGTCGC ACTGCCGCCG
CTGAATATGA ACCTGGCCCG CTATCTGGTT ATTCAGGGGA TCAAAAGTAA AAAGATTCGT
GCGCGCAGTG CGCTACGCCC ATTGGATGTT GCAGGCTTGA GCCAGCTTCT GGTGCAGGTT
TCCAACTTGA TTGTCGATTG CCCGGAAATT CAGCGTCTGG ATATTCATCC TTTGCTGGCT
TCTGGCAGTG AATTTACCGC GCTGGATGTC ACTCTGGATA TCGCGCCGTT TGAAGGCGAT
AACGAGAGTC GGCTGGCAGT GCGCCCTTAT CCGCATCAGC TGGAAGAATG GGTAGAATTG
AAAAACGGTG AACGCTGCTT GTTCCGCCCG ATTTTGCCAG AAGATGAGCC ACAACTTCAG
CAATTCATTT CGCGGGTCAC CAAAGAAGAT CTTTATTACC GCTACTTTAG CGAGATCAAC
GAATTTACCC ATGAAGATTT AGCCAACATG ACGCAGATCG ACTACGATCG GGAAATGGCG
TTTGTAGCGG TACGACGAAT TGATCAAACG GAAGAGATCC TCGGCGTCAC GCGTGCGATC
TCCGACCCTG ATAACATCGA TGCCGAATTT GCCGTGCTGG TTCGCTCGGA TCTCAAAGGG
TTAGGCTTGG GTCGACGCTT AATGGAAAAG TTGATTACCT ATACGCGAGA TCACGGACTA
CAACGTCTGA ATGGTATTAC GATGCCAAAC AATCGTGGCA TGGTGGCGCT GGCCCGCAAG
CTCGGGTTTA ACGTTGATAT CCAGCTCGAA GAGGGGATCG TTGGGCTTAC GCTAAATCTT
GCCCAACGCG AGGAATCATG A
 
Protein sequence
MSQRGLEALL RPKSIAVIGA SMKPNRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVLAW 
PDIASLPFTP DLAVLCTNAS RNLALLEELG EKGCKTCIIL SAPASQHEDL RACALRHNMR
LLGPNSLGLL APWQGLNASF SPVPIKRGKL AFISQSAAVS NTILDWAQQR EMGFSYFIAL
GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA
AQRLLNTTAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP
AALALDALWS RNGKLATLSE ETCQKLRDAL PGHVAISNPL DLRDDASSEH YVKTLDILLH
SQDFDALMVI HSPSAAAPAT EIAQVLIEAV KHHPRSKYVS LLTNWCGEHS SQEARRLFSE
AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAKAHLLL QQAIAEGATS
LDTHEVQPIL QSYGMNTLPT WIASDSTEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML
YLRTANEVQQ AANAIFDRVK MTWPQARVHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM
LGEGGVEWRP EDQAVVALPP LNMNLARYLV IQGIKSKKIR ARSALRPLDV AGLSQLLVQV
SNLIVDCPEI QRLDIHPLLA SGSEFTALDV TLDIAPFEGD NESRLAVRPY PHQLEEWVEL
KNGERCLFRP ILPEDEPQLQ QFISRVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA
FVAVRRIDQT EEILGVTRAI SDPDNIDAEF AVLVRSDLKG LGLGRRLMEK LITYTRDHGL
QRLNGITMPN NRGMVALARK LGFNVDIQLE EGIVGLTLNL AQREES