Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0132 |
Symbol | |
ID | 5589341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 146499 |
End bp | 147728 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923861 |
Product | polysaccharide deacetylase domain-containing protein |
Protein accession | YP_001461298 |
Protein GI | 157156843 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGTGT CAGTGCCGCG TTACCTGCCC GTTATATGCA AACCATCGAA AATGCTGCGA TCTGGGCGCA AATTGGCGAC AAGATGGTGA CCGTGGGGAA TATTCGGGCC GGGCAAATCA TTGCCGTGGA GCCCACTGCC GCAAGTTATT ACGCATTTAA TTTTGGCTTT GGTAAAGGGT TTATCGATAA AGGGCATCTC GAGCCAGTTC AGGGGCGACA AAAAGTTGAA GACGGTTTGG GTGACCTCAA CAAGCCGCTG AGTAATCAGA ACTTAATTAC CTGGAAAGAT ACGCCGGTCT ATAACGCGCC GAGTGCGGGC AGTGCGCCAT TTGGGGTACT GGCGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTCGC CTATATCAGC GCGCTGGATG CCCAACCCGA TAATGGCCTG CCGGTGCTAA CCTATCACCA TATTCTGCGC GATGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CCTCGGTACG CGCTTTCAAT AACCAGATGG CCTGGCTGCG CGACAGGGGA TACGCGACAC TGAGCATGGC GCAGCTGGAA GGTTACGTGA AGAATAAGAT CAATCTCCCT GCGCGAGCGG TGGTGATTAC CTTTGATGAT GGCCTCAAGT CGGTGAGCCG CTATGCGTAT CCTGTATTGA AACAATATGG CATGAAGGCG ACGGCGTTTG TTGTTACCTC GCGCATCAAA CGTCACCCGC AGAAGTGGAA CCCAAAATCG CTGCAATTTA TGAGCGTTTC TGAGCTTAAC GAAATTCGCG ATGTTCTTGA TTTTCAGTCA CATACCCATT TTTTGCATCG GGTGGATGGC TATCGCCGAC CCATATTACT GAGCCGTAGT GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGCCGCG CTCTGGCGCA ATTTAATCCG CATGTCTGGT ATCTTTCGTA TCCGTTTGGC GGATTTAATG ACAAAGCCGT GAAGGCAGCA AACGATGCCG GATTTCACCT GGCGGTGACA ACCATGAAAG GCAAAGTAAA ACCGGGGGAT AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG CGGCTGGTGA GTAACCAGCC GCAGGGATAA
|
Protein sequence | MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAIWAQIGD KMVTVGNIRA GQIIAVEPTA ASYYAFNFGF GKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SNQNLITWKD TPVYNAPSAG SAPFGVLADN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL PVLTYHHILR DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMAQLE GYVKNKINLP ARAVVITFDD GLKSVSRYAY PVLKQYGMKA TAFVVTSRIK RHPQKWNPKS LQFMSVSELN EIRDVLDFQS HTHFLHRVDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVWYLSYPFG GFNDKAVKAA NDAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG
|
| |