Gene EcHS_A0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0134 
Symbol 
ID5591459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp146565 
End bp147794 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID640919321 
Productpolysaccharide deacetylase domain-containing protein 
Protein accessionYP_001456916 
Protein GI157159598 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGTGT CAGTGCCGCG 
TTACCTGCCC GTTATATGCA AACCATCGAA AATGCTGCGG TCTGGGCGCA AATTGGTGAC
AAGATGGTGA CCGTGGGGAA TATTCGGGCC GGACAAATCA TTGCCGTGGA GCCCACTGCC
GCAAGTTATT ACGCATTTAA TTTTGGCTTT GGCAAAGGGT TTATCGATAA AGGTCATCTC
GAGCCGGTTC AGGGGCGACA AAAAGTTGAA GACGGTTTGG GTGACCTCAA CAAGCCGCTG
AGTCATCAGA ACTTAGTTAC CTGGAAAGAT ACGCCGGTTT ATAACGCGCC GAGTGCGGGC
AGTGCGCCAT TTGGGGTACT GGTGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA
GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTGGC CTATATCAGC
GCGCTGGATG CCCAACCCGA TAATGGCCTG CCGGTGCTAA CCTATCACCA TATTCTGCGC
GATGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CCTCGGTACG CGCTTTCAAT
AACCAGATGG CCTGGCTGCG CGACAGGGGA TACGCGACAC TGAGCATGGC GCAGCTGGAA
GGTTACGTGA AGAATAAGAT CAATCTCCCT GCGCGAGCGG TGGTGATTAC CTTTGATGAT
GGCCTCAAGT CGGTGAGCCG CTATGCGTAT CCTGTATTGA AACAATATGG CATGAAGGCG
ACGGCGTTTG TTGTTACCTC GCGCATCAAA CGTCACCCGC AGAAGTGGAA CCCAAAATCG
CTGCAATTTA TGAGCGTTTC TGAGCTTAAC GAAATTCGCG ATGTTTTTGA TTTTCAGTCA
CATACCCATT TTTTGCATCG GGTGGATGGC TATCGCCGAC CCATATTACT GAGCCGTAGT
GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGCCGCG CTCTGGCGCA ATTTAATCCG
CATGTCTGGT ATCTTTCGTA TCCGTTTGGC GGATTTAATG ACAAAGCCGT GAAGGCAGCA
AACGATGCCG GATTTCACCT GGCGGTGACA ACCATGAAAG GCAAAGTAAA ACCGGGGGAT
AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG
CGGCTGGTGA GTAACCAGCC GCAGGGATAA
 
Protein sequence
MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAVWAQIGD KMVTVGNIRA GQIIAVEPTA 
ASYYAFNFGF GKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SHQNLVTWKD TPVYNAPSAG
SAPFGVLVDN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL PVLTYHHILR
DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMAQLE GYVKNKINLP ARAVVITFDD
GLKSVSRYAY PVLKQYGMKA TAFVVTSRIK RHPQKWNPKS LQFMSVSELN EIRDVFDFQS
HTHFLHRVDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVWYLSYPFG GFNDKAVKAA
NDAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG