Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0140 |
Symbol | |
ID | 6146996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 155134 |
End bp | 156363 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641615041 |
Product | polysaccharide deacetylase domain-containing protein |
Protein accession | YP_001742257 |
Protein GI | 170680801 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGCGT CAGTGCCGCA TTACCTGCCC GTTATATGCA AACCATCGAA AATGCCGCGG TCTGGGCGCA AATTGGCGAC AAAATGGTGA CGGTAGGGAA TATTCGGGCC GGGCAAATCA TTGCCGTGGA GCCCACTGCC GCAAGTTATT ACGCATTTAA TTTTGGCTTT GGTAAAGGGT TTATCGATAA AGGCCATCTC GAGCCGGTTC AGGGGCGACA AAAAGTTGAA GATGGTTTGG GTGACCTCAA CAAGCCGCTG AGTAATCAGA ACTTAATAAC CTGGAAAGAT ACACTGGTTT ATAACGCACC GAGTGTGGGC AGTGCGCCAT TTGGAGTACT GGCGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTGGC CTATATCAGC GCGCTGGATG CCCAACCCGA TAATGGCCTG CCGGTGCTAA CCTATCACCA TATTTTGCGC GACGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CCTCGGTACG CGCTTTCAAT AACCAGATGG CCTGGCTGCG CGACAGGGGA TACGCGACAC TGAGCATGGC GCAGTTGGAA GGTTACGTGA AGAATAAGAT CAATCTCCCT GCGCGAGCGG TGGTAATTAC CTTTGATGAT GGCCTCAAGT CGGTGAGCCG CTATGCGTAT CCTGTGTTGA AACAATATGG TATGAAGGCG ACGGCGTTTA TTGTTACCTC GCGCATCAAA CGTCACCCGC AGAAATGGAA CCCAAAATCG CTGCAATTTA TGAGCGTTTC TGAACTTAAC GAAATTCGCG ATGTATTTGA TTTCCAGTCA CATACTCATT TTTTGCATCG GATAGATGGT TATCGGCGGC CTATATTGCT GAGCCGTAGT GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGCCGCG CTCTGGCGCA ATTTAATCCA CATGTTTTGT ATCTTTCTTA TCCATTTGGC GGATTTAATG ACAAAGCCGT GAAGGCAGCA AAAGAAGCCG GATTTCACCT GGCGGTGACG ACCATGAAAG GCAAAGTAAA ACCGGGGGAT AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG CGGCTGGTGA GTAACCAGCC GCAGGGATAA
|
Protein sequence | MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAVWAQIGD KMVTVGNIRA GQIIAVEPTA ASYYAFNFGF GKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SNQNLITWKD TLVYNAPSVG SAPFGVLADN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL PVLTYHHILR DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMAQLE GYVKNKINLP ARAVVITFDD GLKSVSRYAY PVLKQYGMKA TAFIVTSRIK RHPQKWNPKS LQFMSVSELN EIRDVFDFQS HTHFLHRIDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVLYLSYPFG GFNDKAVKAA KEAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG
|
| |