Gene ECH74115_5335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5335 
Symbol 
ID6967045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4977132 
End bp4978121 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content55% 
IMG OID643388996 
Productacetyltransferase, GNAT family 
Protein accessionYP_002273405 
Protein GI209398236 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1246] N-acetylglutamate synthase and related acetyltransferases 
TIGRFAM ID[TIGR02447] thioesterase domain, putative 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.251571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC TTCCAGGGTT GTCACGGGAA ACAAGAGAGA GTATCGCTAT GTATCACCTT 
CGGGTTCCAC AAACAGAAGA AGAATTAGAG CGTTACTATC AGTTTCGCTG GGAAATGTTG
CGTAAGCCCC TGCATCAACC AAAAGGTTCG GAACGCGACG CGTGGGATGC GATGGCGCAT
CACCAGATGG TCGTCGACGA GCAGGGTAAT CTGGTGGCGG TAGGCCGACT GTATATTAAT
GCCGACAATG AAGCGTCCAT TCGCTTTATG GCCGTTCATC CCGACGTGCA GGACAAAGGG
TTAGGCACGC TGATGGCGAT GACCCTGGAG TCGGTGGCGC GTCAGGAAGG CGTTAAGCGC
GTGACCTGTA GCGCCCGTGA AGACGCGGTG GAGTTTTTCG CCAAGCTGGG GTTTGTTAAT
CAGGGAGAAA TCACCACGCC AACCACCACG CCGATTCGCC ATTTTTTGAT GATTAAGCCC
GTCGCCACTC TGGATGATAT TTTGCATCGC GGCGACTGGT GCGCGCAGCT GCAACAGGCG
TGGTACGAAC ATATCCCGCT TAGTGAAAAA ATGGGCGTGC GCATTCAGCA ATATACCGGG
CAAAAATTTA TCACTACCAT GCCAGAAACC GGCAATCAGA ATCCGCACCA TACGCTGTTT
GCCGGGAGTT TATTCTCACT GGCGACGCTC ACCGGTTGGG GGCTTATCTG GCTGATGCTG
CGCGAACGCC ACCTCGGCGG AACGATTATT CTGGCGGATG CGCATATCCG CTACAGCAAA
CCGATTAGCG GTAAACCCCA TGCGGTAGCC GACCTTGGTG CCTTAAGCGG CGATCTCGAC
CGTCTGGCGC GCGGACGAAA AGCACGGGTG CAGATGCAGG TCGAAATCTT TGGCGACGAG
ACGCCGGGTG CAGTGTTTGA AGGCACGTAT ATCGTTCTGC CCGCGAAGCC ATTTGGCCCG
TATGAAGAGG GCGGGAACGA AGAAGAGTAG
 
Protein sequence
MSQLPGLSRE TRESIAMYHL RVPQTEEELE RYYQFRWEML RKPLHQPKGS ERDAWDAMAH 
HQMVVDEQGN LVAVGRLYIN ADNEASIRFM AVHPDVQDKG LGTLMAMTLE SVARQEGVKR
VTCSAREDAV EFFAKLGFVN QGEITTPTTT PIRHFLMIKP VATLDDILHR GDWCAQLQQA
WYEHIPLSEK MGVRIQQYTG QKFITTMPET GNQNPHHTLF AGSLFSLATL TGWGLIWLML
RERHLGGTII LADAHIRYSK PISGKPHAVA DLGALSGDLD RLARGRKARV QMQVEIFGDE
TPGAVFEGTY IVLPAKPFGP YEEGGNEEE