Gene EcDH1_0809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0809 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp856678 
End bp857994 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content49% 
IMG OID 
Productguanine deaminase 
Protein accessionACX38493 
Protein GI260448071 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGGAG AACACACGTT AAAAGCGGTA CGAGGCAGTT TTATTGATGT CACCCGTACG 
ATCGATAACC CGGAAGAGAT TGCCTCTGCG CTGCGGTTTA TTGAGGATGG TTTATTACTC
ATTAAACAGG GAAAAGTGGA ATGGTTTGGC GAATGGGAAA ACGGAAAGCA TCAAATTCCT
GACACCATTC GCGTGCGCGA CTATCGCGGC AAACTGATAG TACCGGGCTT TGTCGATACA
CATATCCATT ATCCGCAAAG TGAAATGGTG GGGGCCTATG GTGAGCAATT GCTGGAGTGG
TTGAATAAAC ACACCTTCCC TACTGAACGT CGTTATGAGG ATTTAGAGTA CGCCCGCGAA
ATGTCGGCGT TCTTCATCAA GCAGCTTTTA CGTAACGGAA CCACCACGGC GCTGGTGTTT
GGCACTGTTC ATCCGCAATC TGTTGATGCG CTGTTTGAAG CCGCCAGTCA TATCAATATG
CGTATGATTG CCGGTAAGGT GATGATGGAC CGCAACGCAC CGGATTATCT GCTCGACACT
GCCGAAAGCA GCTATCACCA AAGCAAAGAA CTGATCGAAC GCTGGCACAA AAATGGTCGT
CTGCTATATG CGATTACGCC ACGCTTCGCC CCGACCTCAT CTCCTGAACA GATGGCGATG
GCGCAACGCC TGAAAGAAGA ATATCCGGAT ACGTGGGTAC ATACCCATCT CTGTGAAAAC
AAAGATGAAA TTGCCTGGGT GAAATCGCTT TATCCTGACC ATGATGGTTA TCTGGATGTT
TACCATCAGT ACGGCCTGAC CGGTAAAAAC TGTGTCTTTG CTCACTGCGT CCATCTCGAA
GAAAAAGAGT GGGATCGTCT CAGCGAAACC AAATCCAGCA TTGCTTTCTG TCCGACCTCC
AACCTTTACC TCGGCAGCGG CTTATTCAAC TTGAAAAAAG CATGGCAGAA GAAAGTTAAA
GTGGGCATGG GAACGGATAT CGGTGCCGGA ACCACTTTCA ACATGCTGCA AACGCTGAAC
GAAGCCTACA AAGTATTGCA ATTACAAGGC TATCGCCTCT CGGCATATGA AGCGTTTTAC
CTGGCCACGC TCGGCGGAGC GAAATCTCTG GGCCTTGACG ATTTGATTGG CAACTTTTTA
CCTGGCAAAG AGGCTGATTT CGTGGTGATG GAACCCACCG CCACTCCGCT ACAGCAGCTG
CGCTATGACA ACTCTGTTTC TTTAGTCGAC AAATTGTTCG TGATGATGAC GTTGGGCGAT
GACCGTTCGA TCTACCGCAC CTACGTTGAT GGTCGTCTGG TGTACGAACG CAACTAA
 
Protein sequence
MSGEHTLKAV RGSFIDVTRT IDNPEEIASA LRFIEDGLLL IKQGKVEWFG EWENGKHQIP 
DTIRVRDYRG KLIVPGFVDT HIHYPQSEMV GAYGEQLLEW LNKHTFPTER RYEDLEYARE
MSAFFIKQLL RNGTTTALVF GTVHPQSVDA LFEAASHINM RMIAGKVMMD RNAPDYLLDT
AESSYHQSKE LIERWHKNGR LLYAITPRFA PTSSPEQMAM AQRLKEEYPD TWVHTHLCEN
KDEIAWVKSL YPDHDGYLDV YHQYGLTGKN CVFAHCVHLE EKEWDRLSET KSSIAFCPTS
NLYLGSGLFN LKKAWQKKVK VGMGTDIGAG TTFNMLQTLN EAYKVLQLQG YRLSAYEAFY
LATLGGAKSL GLDDLIGNFL PGKEADFVVM EPTATPLQQL RYDNSVSLVD KLFVMMTLGD
DRSIYRTYVD GRLVYERN