Gene EcSMS35_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0140 
Symbol 
ID6146996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp155134 
End bp156363 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content48% 
IMG OID641615041 
Productpolysaccharide deacetylase domain-containing protein 
Protein accessionYP_001742257 
Protein GI170680801 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAC AAGCTGTTAT TCTCCTGCTG ATGCTGTTTA CCGCAAGCGT CAGTGCCGCA 
TTACCTGCCC GTTATATGCA AACCATCGAA AATGCCGCGG TCTGGGCGCA AATTGGCGAC
AAAATGGTGA CGGTAGGGAA TATTCGGGCC GGGCAAATCA TTGCCGTGGA GCCCACTGCC
GCAAGTTATT ACGCATTTAA TTTTGGCTTT GGTAAAGGGT TTATCGATAA AGGCCATCTC
GAGCCGGTTC AGGGGCGACA AAAAGTTGAA GATGGTTTGG GTGACCTCAA CAAGCCGCTG
AGTAATCAGA ACTTAATAAC CTGGAAAGAT ACACTGGTTT ATAACGCACC GAGTGTGGGC
AGTGCGCCAT TTGGAGTACT GGCGGACAAT TTGCGCTACC CGATTTTGCA TAAACTGAAA
GACAGGTTAA ATCAAACCTG GTATCAGATC CGTATTGGCG ATCGACTGGC CTATATCAGC
GCGCTGGATG CCCAACCCGA TAATGGCCTG CCGGTGCTAA CCTATCACCA TATTTTGCGC
GACGAAGAAA ACACCCGTTT TCGCCATACT TCGACGACCA CCTCGGTACG CGCTTTCAAT
AACCAGATGG CCTGGCTGCG CGACAGGGGA TACGCGACAC TGAGCATGGC GCAGTTGGAA
GGTTACGTGA AGAATAAGAT CAATCTCCCT GCGCGAGCGG TGGTAATTAC CTTTGATGAT
GGCCTCAAGT CGGTGAGCCG CTATGCGTAT CCTGTGTTGA AACAATATGG TATGAAGGCG
ACGGCGTTTA TTGTTACCTC GCGCATCAAA CGTCACCCGC AGAAATGGAA CCCAAAATCG
CTGCAATTTA TGAGCGTTTC TGAACTTAAC GAAATTCGCG ATGTATTTGA TTTCCAGTCA
CATACTCATT TTTTGCATCG GATAGATGGT TATCGGCGGC CTATATTGCT GAGCCGTAGT
GAGCACAATA TTCTGTTTGA TTTTGCACGT TCACGCCGCG CTCTGGCGCA ATTTAATCCA
CATGTTTTGT ATCTTTCTTA TCCATTTGGC GGATTTAATG ACAAAGCCGT GAAGGCAGCA
AAAGAAGCCG GATTTCACCT GGCGGTGACG ACCATGAAAG GCAAAGTAAA ACCGGGGGAT
AATCCGTTGT TACTAAAACG ACTTTATATC TTAAGAACGG ATTCGCTGGA GACGATGTCG
CGGCTGGTGA GTAACCAGCC GCAGGGATAA
 
Protein sequence
MYKQAVILLL MLFTASVSAA LPARYMQTIE NAAVWAQIGD KMVTVGNIRA GQIIAVEPTA 
ASYYAFNFGF GKGFIDKGHL EPVQGRQKVE DGLGDLNKPL SNQNLITWKD TLVYNAPSVG
SAPFGVLADN LRYPILHKLK DRLNQTWYQI RIGDRLAYIS ALDAQPDNGL PVLTYHHILR
DEENTRFRHT STTTSVRAFN NQMAWLRDRG YATLSMAQLE GYVKNKINLP ARAVVITFDD
GLKSVSRYAY PVLKQYGMKA TAFIVTSRIK RHPQKWNPKS LQFMSVSELN EIRDVFDFQS
HTHFLHRIDG YRRPILLSRS EHNILFDFAR SRRALAQFNP HVLYLSYPFG GFNDKAVKAA
KEAGFHLAVT TMKGKVKPGD NPLLLKRLYI LRTDSLETMS RLVSNQPQG