Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0138 |
Symbol | |
ID | 6966770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 149434 |
End bp | 150663 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643384215 |
Product | polysaccharide deacetylase domain protein |
Protein accession | YP_002268738 |
Protein GI | 209396198 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGTGT CAGTGCCGCG TTACCTGCCC GTTATATGCA AACCATCGAA AATGCTGCGG TCTGGGCGCA AATTGGTGAC AAGATGGTGA CCGTGGGGAA TATTCGGGCC GGACAAATCA TTGCCGTGGA GCCCACTGCC GCAAGTTATT ACGCATTTAA TTTTGGCTTT GATAAAGGGT TTATCGATAA AGGTCATCTT GAGCCGGTTC AGGGGCGACA AAAAGTTGAA GACGGTTTGG GTGACCTCAA CAAGCCGCTG AGTAATCAGA ACTTAGTTAC CTGGAAAGAT ACGCCGGTCT ATAACGCGCC GAGTGTGGGC AGTGCGCCAT TTGGGGTACT GGCGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTGGC CTATATCAGC GCGCTGGATG CCCAACCCGA TAATGGCCTT CCGGTGCTAA CCTATCACCA TATTCTGCGC GACGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CCTCGGTACG CGCTTTCAAT AACCAGATGG CCTGGCTGCG CGACAGGGGA TACGCGACAC TGAGCATGGT GCAGCTGGAA GGTTACGTGA AGAATAAGAT CAATCTTCCT GCGCGAGCGG TGGTGATTAC CTTTGATGAT GGCCTCAAGT CAGTGAGCCG CTATGCGTAT CCTGTGTTGA AACAATATGG CATGAAGGCG ACGGCGTTTA TTGTTACCTC GCGCATCAAA CGTCACCCGC AGAAGTGGGA ACCAAAATCG CTGCAATTTA TGAGCGTTTC TGAGCTTAAC GAAATTCGCG ATGTATTTGA TTTCCAGTCA CATACCCATT TTTTGCATCG GGTAGATGGT TATCGCCGAC CCATATTACT GAGCCGTAGT GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGTCGCG CTCTGGCGCA ATTTAATCCG CATGTCTGGT ATCTTTCGTA TCCGTTTGGC GGCTTTAATG ACAAAGCCGT GAAGGCAGCA AACGATGCCG GATTTCACCT GGCGGTGACA ACCATGAAAG GCAAAGTAAA ACCGGGGGAT AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG CGGCTGGTGA GTAACCAGCC GCAGGGATAA
|
Protein sequence | MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAVWAQIGD KMVTVGNIRA GQIIAVEPTA ASYYAFNFGF DKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SNQNLVTWKD TPVYNAPSVG SAPFGVLADN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL PVLTYHHILR DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMVQLE GYVKNKINLP ARAVVITFDD GLKSVSRYAY PVLKQYGMKA TAFIVTSRIK RHPQKWEPKS LQFMSVSELN EIRDVFDFQS HTHFLHRVDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVWYLSYPFG GFNDKAVKAA NDAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG
|
| |