Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4173 |
Symbol | guaD |
ID | 6971810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3868316 |
End bp | 3869632 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387919 |
Product | guanine deaminase |
Protein accession | YP_002272358 |
Protein GI | 209398700 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGAG AACACACGTT AAAAGCGGTA CGAGGCAGTT TTATTGATGT CGCCCGTACG GTCGATAACC CGGAAGAGAT TGCCTCTGCG CTGCGGTTTA TTGAGGATGG TTTATTACTC ATTAAACAGG GAAAAGTGGA ATGGTTTGGC GAATGGGAAG ACGGAAAGCA TCAAATTCCT GACACTATTC GCGTGCGCGA CTATCGCGGC AAACTGATAG TACCGGGCTT TGTCGATACA CATATCCATT ATCCGCAAAG TGAAATGGTG GGGGCCTATG GGGAACAATT GCTGGAGTGG TTGAATAAAC ACACCTTCCC TACTGAACGT CGTTATGAGG ATTTAGAGTA CGCCCGCGAA ATGTCGGCGT TCTTCATCAA GCAGCTTTTA CGTAACGGAA CCACCACGGC GCTGGTGTTT GGCACTGTTC ATCCGCAATC CGTTGATGCG CTGTTTGAAG CCGCCAGTCA TATCAATATG CGTATGATTG CCGGTAAGGT GATGATGGAC CGCAACGCAC CGGATTATCT GCTAGACACT GCCGAAAGCA GCTATCACCA AAGCAAAGAA CTGATTGAAC GCTGGCACAA AAATGGTCGT CTGCTATATG CGATTACGCC ACGCTTCGCC CCTACTTCTT CTCCTGAACA GATGGCGATG GCGCAACGCC TGAAAGAGGA ATATCCGGAT ACGTGGGTAC ATACCCATCT CTGTGAAAAC AAAGATGAAA TTGCCTGGGT GAAATCGCTT TATCCTGACC ATGATGGTTA TCTGGATGTT TACCATCAGT ACGGCCTGAC CGGTAAAAAC TGTGTCTTTG CTCACTGCGT CCATCTCAAA GAAAAAGAGT GGGATCGTCT CAGCGAAACC AAATCCAGCA TTGCTTTCTG TCCGACCTCC AACCTTTACC TCGGCAGCGG GTTATTCAAC TTGAAAAAAG CATGGCAGAA GAAAGTTAAA GTGGGCATGG GAACGGATAT CGGTGCCGGA ACCACTTTCA ACATGCTGCA AACGCTGAAC GAAGCCTACA AAGTGTTGCA ATTACAAGGC TATCGCCTCT CGGCATATGA AGCGTTTTAC CTGGCCACGC TCGGCGGAGC GAAATCTCTG GGCCTTGACG ATTTGATTGG CAACTTTTTA CCTGGCAAAG AGGCTGATTT CGTGGTGATG GAACCCACCG CCACTCCGCT ACAGCAGCTG CGCTATGACA ACTCTGTTTC TTTAGTCGAC AAATTGTTCG TGATGATGAC GTTGGGCGAT GACCGTTCGA TCTACCGCAC CTACGTTGAT GGTCGTCTGG TGTACGAACG CAACTAA
|
Protein sequence | MSGEHTLKAV RGSFIDVART VDNPEEIASA LRFIEDGLLL IKQGKVEWFG EWEDGKHQIP DTIRVRDYRG KLIVPGFVDT HIHYPQSEMV GAYGEQLLEW LNKHTFPTER RYEDLEYARE MSAFFIKQLL RNGTTTALVF GTVHPQSVDA LFEAASHINM RMIAGKVMMD RNAPDYLLDT AESSYHQSKE LIERWHKNGR LLYAITPRFA PTSSPEQMAM AQRLKEEYPD TWVHTHLCEN KDEIAWVKSL YPDHDGYLDV YHQYGLTGKN CVFAHCVHLK EKEWDRLSET KSSIAFCPTS NLYLGSGLFN LKKAWQKKVK VGMGTDIGAG TTFNMLQTLN EAYKVLQLQG YRLSAYEAFY LATLGGAKSL GLDDLIGNFL PGKEADFVVM EPTATPLQQL RYDNSVSLVD KLFVMMTLGD DRSIYRTYVD GRLVYERN
|
| |