Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3821 |
Symbol | |
ID | 6972055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3544267 |
End bp | 3546927 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387606 |
Product | acetyltransferase, GNAT family protein |
Protein accession | YP_002272059 |
Protein GI | 209400567 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.126569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGC GAGGACTGGA AGCACTACTG CGACCAAAAT CGATAGCGGT AATTGGCGCG TCGATGAAAC CCAATCGCGC AGGTTACCTG ATGATGCGTA ACCTGCTGGC GGGAGGCTTT AACGGACCGG TACTCCCGGT GACGCCAGCC TGGAAAGCGG TGTTGGGTGT GTTGGCCTGG CCGGATATTG CCAGCTTGCC CTTTACACCC GACCTTGCGG TTTTATGTAC CAATGCCAGC CGTAATCTTG CTCTTCTGGA AGAGCTCGGC GAGAAAGGCT GTAAAACCTG CATTATTCTT TCCGCCCCGG CATCGCAACA CGAAGATCTC CGCGCCTGCG CCCTGCGCCA TAACATGCGC CTGCTTGGAC CAAACAGTCT GGGTTTACTG GCTCCCTGGC AAGGTCTGAA TGCCAGCTTT TCGCCTGTGC CGATTAAACG CGGCAAGCTG GCGTTTATTT CACAGTCGGC TGCCGTCTCC AACACCATCC TCGACTGGGC GCAACAGCGT GAGATGGGCT TTTCCTACTT TATTGCGCTC GGCGACAGCC TAGATATCGA CGTTGATGAA TTGCTTGACT ATCTGGCACG CGACAGTAAA ACCAGCGCCA TCCTGCTCTA TCTCGAACAG TTAAGCGACG CGCGACGCTT TGTTTCGGCG GCCCGTAGTG CCTCGCGTAA TAAACCGATT CTGGTGATTA AAAGCGGACG TAGCCCGGCA GCACAGCGAC TGCTCAACAC GACGGCAGGA ATGGATCCGG CGTGGGATGC GGCGATTCAG CGTGCCGGTT TGTTGCGGGT GCAGGATACC CACGAGCTGT TTTCGGCGGT GGAAACCCTC AGCCATATGC GCCCGCTGCG TGGCGACCGG TTGATGATTA TCAGCAACGG TGCTGCGCCT GCCGCGCTGG CGCTGGATGC CTTATGGTCA CGCAATGGCA AGCTGGCAAC GCTAAGCGAA GAAACCTGCC AGAAACTGCG CGATGCACTG CCAGGACATG TGGCAATCTC TAACCCGCTC GATCTGCGCG ATGACGCCAG CAGCGAGCAT TATGTCAAAA CGCTGGACAT TCTGCTCCAC AGCCAGGATT TTGATGCGCT GATGGTTATT CATTCGCCCA GCGCCGCTGC TCCCGCAACA GAAATCGCGC AAGTATTAAT TGAAGCGGTA AAGCATCATC CCCGCAGCAA GTATGTTTCT CTGCTGACGA ACTGGTGCGG CGAGCACTCC TCGCAAGAGG CACGACGTTT ATTCAGCGAA GCCGGGCTGC CGACCTACCG TACCCCGGAA GGAACCATCA CTGCTTTTAT GCATATGGTG GAGTACCGGC GTAATCAGAA GCAACTACGC GAAACGCCGG CGTTGCCCAG CAATCTGACT TCCAATACCG CAAAAGCGCA TCTTCTGTTG CAACAGGCGA TTGCCGAAGG GGCTACGTCG CTCGATACCC ATGAAGTTCA GCCCATCCTG CAATCGTATG GGATGAACAC GCTCCCTACC TGGATTGCCA GCGATAGCAC CGAAGCGGTG CATATTGCCG AACAGATTGG TTATCCGGTG GCGCTGAAAT TGCGTTCGCC GGATATTCCA CATAAATCGG AAGTTCAGGG CGTCATGCTG TACCTGCGTA CAGCCAATGA AGTCCAGCAA GCGGCAAACG CTATTTTCGA TCGCGTAAAA ATGACCTGGC CGCAGGCGCG GGTCCACGGC CTGTTGGTGC AAAGTATGGC TAACCGTGCT GGCGCTCAGG AGTTGCGGGT TGTGGTTGAG CACGATCCGG TTTTTGGGCC GTTGATCATG CTGGGTGAAG GCGGTGTGGA GTGGCGTCCT GAAGATCAAG CCGTCGTCGC ACTGCCGCCG CTGAATATGA ACCTGGCCCG CTATCTGGTT ATTCAGGGGA TCAAAAGTAA AAAGATTCGT GCGCGCAGTG CGCTACGCCC ATTGGATGTT GCAGGCTTGA GCCAGCTTCT GGTGCAGGTT TCCAACTTGA TTGTCGATTG CCCGGAAATT CAGCGTCTGG ATATTCATCC TTTGCTGGCT TCTGGCAGTG AATTTACCGC GCTGGATGTC ACTCTGGATA TCGCGCCGTT TGAAGGCGAT AACGAGAGTC GGCTGGCAGT GCGCCCTTAT CCGCATCAGC TGGAAGAATG GGTAGAATTG AAAAACGGTG AACGCTGCTT GTTCCGCCCG ATTTTGCCAG AAGATGAGCC ACAACTTCAG CAATTCATTT CGCGGGTCAC CAAAGAAGAT CTTTATTACC GCTACTTTAG CGAGATCAAC GAATTTACCC ATGAAGATTT AGCCAACATG ACGCAGATCG ACTACGATCG GGAAATGGCG TTTGTAGCGG TACGACGAAT TGATCAAACG GAAGAGATCC TCGGCGTCAC GCGTGCGATC TCCGACCCTG ATAACATCGA TGCCGAATTT GCCGTGCTGG TTCGCTCGGA TCTCAAAGGG TTAGGCTTGG GTCGACGCTT AATGGAAAAG TTGATTACCT ATACGCGAGA TCACGGACTA CAACGTCTGA ATGGTATTAC GATGCCAAAC AATCGTGGCA TGGTGGCGCT GGCCCGCAAG CTCGGGTTTA ACGTTGATAT CCAGCTCGAA GAGGGGATCG TTGGGCTTAC GCTAAATCTT GCCCAACGCG AGGAATCATG A
|
Protein sequence | MSQRGLEALL RPKSIAVIGA SMKPNRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVLAW PDIASLPFTP DLAVLCTNAS RNLALLEELG EKGCKTCIIL SAPASQHEDL RACALRHNMR LLGPNSLGLL APWQGLNASF SPVPIKRGKL AFISQSAAVS NTILDWAQQR EMGFSYFIAL GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA AQRLLNTTAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP AALALDALWS RNGKLATLSE ETCQKLRDAL PGHVAISNPL DLRDDASSEH YVKTLDILLH SQDFDALMVI HSPSAAAPAT EIAQVLIEAV KHHPRSKYVS LLTNWCGEHS SQEARRLFSE AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAKAHLLL QQAIAEGATS LDTHEVQPIL QSYGMNTLPT WIASDSTEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML YLRTANEVQQ AANAIFDRVK MTWPQARVHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM LGEGGVEWRP EDQAVVALPP LNMNLARYLV IQGIKSKKIR ARSALRPLDV AGLSQLLVQV SNLIVDCPEI QRLDIHPLLA SGSEFTALDV TLDIAPFEGD NESRLAVRPY PHQLEEWVEL KNGERCLFRP ILPEDEPQLQ QFISRVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA FVAVRRIDQT EEILGVTRAI SDPDNIDAEF AVLVRSDLKG LGLGRRLMEK LITYTRDHGL QRLNGITMPN NRGMVALARK LGFNVDIQLE EGIVGLTLNL AQREES
|
| |