Gene EcDH1_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3472 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3725067 
End bp3726296 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID 
Productpolysaccharide deacetylase 
Protein accessionACX41087 
Protein GI260450665 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGTGT CAGTGCCGCG 
TTACCTGCCC GTTATATGCA AACCATCGAA AATGCTGCGG TCTGGGCGCA AATTGGTGAC
AAGATGGTGA CCGTGGGGAA TATTCGGGCC GGACAAATCA TTGCCGTGGA GCCCACTGCC
GCAAGTTATT ACGCATTTAA TTTTGGCTTT GGCAAAGGTT TTATCGATAA AGGTCATCTC
GAGCCGGTTC AGGGGCGACA AAAAGTTGAA GACGGTTTGG GCGACCTCAA CAAGCCGCTG
AGTAATCAGA ACTTAGTTAC CTGGAAAGAT ACGCCGGTCT ATAACGCGCC GAGTGCGGGA
AGTGCGCCAT TTGGGGTACT GGCGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA
GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTGGC CTATATCAGC
GCACTGGATG CCCAACCCGA TAATGGCCTG TCGGTGCTAA CCTATCACCA TATTCTGCGC
GACGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CATCGGTACG CGCTTTCAAT
AACCAGATGG CCTGGCTGCG TGACAGGGGA TACGCGACAC TGAGCATGGT GCAGCTGGAA
GGCTACGTGA AGAATAAGAT CAATCTCCCT GCGCGAGCGG TGGTGATTAC CTTTGATGAT
GGCCTCAAGT CGGTGAGCCG CTATGCGTAT CCTGTGTTGA AACAATATGG CATGAAGGCG
ACGGCGTTTA TTGTTACCTC ACGCATCAAA CGTCACCCGC AGAAGTGGAA CCCAAAATCG
CTGCAATTTA TGAGCGTTTC TGAGCTTAAC GAAATTCGCG ATGTATTTGA TTTCCAGTCA
CATACCCATT TTTTGCATCG GGTAGATGGT TATCGCCGAC CCATATTACT GAGCCGTAGT
GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGCCGCG CTCTGGCGCA ATTTAATCCG
CATGTCTGGT ATCTTTCGTA TCCGTTTGGC GGATTTAATG ACAACGCCGT GAAGGCAGCA
AACGATGCCG GATTTCACCT GGCGGTGACA ACCATGAAAG GCAAAGTAAA ACCGGGGGAT
AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG
CGGCTGGTGA GTAACCAGCC GCAGGGATAA
 
Protein sequence
MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAVWAQIGD KMVTVGNIRA GQIIAVEPTA 
ASYYAFNFGF GKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SNQNLVTWKD TPVYNAPSAG
SAPFGVLADN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL SVLTYHHILR
DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMVQLE GYVKNKINLP ARAVVITFDD
GLKSVSRYAY PVLKQYGMKA TAFIVTSRIK RHPQKWNPKS LQFMSVSELN EIRDVFDFQS
HTHFLHRVDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVWYLSYPFG GFNDNAVKAA
NDAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG