Gene ECH74115_0138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0138 
Symbol 
ID6966770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp149434 
End bp150663 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content49% 
IMG OID643384215 
Productpolysaccharide deacetylase domain protein 
Protein accessionYP_002268738 
Protein GI209396198 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGTGT CAGTGCCGCG 
TTACCTGCCC GTTATATGCA AACCATCGAA AATGCTGCGG TCTGGGCGCA AATTGGTGAC
AAGATGGTGA CCGTGGGGAA TATTCGGGCC GGACAAATCA TTGCCGTGGA GCCCACTGCC
GCAAGTTATT ACGCATTTAA TTTTGGCTTT GATAAAGGGT TTATCGATAA AGGTCATCTT
GAGCCGGTTC AGGGGCGACA AAAAGTTGAA GACGGTTTGG GTGACCTCAA CAAGCCGCTG
AGTAATCAGA ACTTAGTTAC CTGGAAAGAT ACGCCGGTCT ATAACGCGCC GAGTGTGGGC
AGTGCGCCAT TTGGGGTACT GGCGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA
GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTGGC CTATATCAGC
GCGCTGGATG CCCAACCCGA TAATGGCCTT CCGGTGCTAA CCTATCACCA TATTCTGCGC
GACGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CCTCGGTACG CGCTTTCAAT
AACCAGATGG CCTGGCTGCG CGACAGGGGA TACGCGACAC TGAGCATGGT GCAGCTGGAA
GGTTACGTGA AGAATAAGAT CAATCTTCCT GCGCGAGCGG TGGTGATTAC CTTTGATGAT
GGCCTCAAGT CAGTGAGCCG CTATGCGTAT CCTGTGTTGA AACAATATGG CATGAAGGCG
ACGGCGTTTA TTGTTACCTC GCGCATCAAA CGTCACCCGC AGAAGTGGGA ACCAAAATCG
CTGCAATTTA TGAGCGTTTC TGAGCTTAAC GAAATTCGCG ATGTATTTGA TTTCCAGTCA
CATACCCATT TTTTGCATCG GGTAGATGGT TATCGCCGAC CCATATTACT GAGCCGTAGT
GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGTCGCG CTCTGGCGCA ATTTAATCCG
CATGTCTGGT ATCTTTCGTA TCCGTTTGGC GGCTTTAATG ACAAAGCCGT GAAGGCAGCA
AACGATGCCG GATTTCACCT GGCGGTGACA ACCATGAAAG GCAAAGTAAA ACCGGGGGAT
AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG
CGGCTGGTGA GTAACCAGCC GCAGGGATAA
 
Protein sequence
MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAVWAQIGD KMVTVGNIRA GQIIAVEPTA 
ASYYAFNFGF DKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SNQNLVTWKD TPVYNAPSVG
SAPFGVLADN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL PVLTYHHILR
DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMVQLE GYVKNKINLP ARAVVITFDD
GLKSVSRYAY PVLKQYGMKA TAFIVTSRIK RHPQKWEPKS LQFMSVSELN EIRDVFDFQS
HTHFLHRVDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVWYLSYPFG GFNDKAVKAA
NDAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG