Gene EcSMS35_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1946 
SymboltreA 
ID6142681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1966387 
End bp1968084 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID641616822 
Producttrehalase 
Protein accessionYP_001743998 
Protein GI170681871 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0674464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCC CCACCCCTTC TCGCCCGCAA AAAATGGCGT TAATTCCAGC CTGCATCTTT 
TTGTGTTTTG CTGCGCTATC GGTGCAGGCA GAAGAAACAT CGGTAACGCC ACAGCCGCCT
GATATTTTAT TAGGGCCGCT GTTTAATGAT GTGCAAAACG CCAAACTTTT TCCGGACCAA
AAAACCTTTG CCGATGCCGT GCCGAACAGC GATCCGCTGA TGATCCTTGC TGATTATCGG
ATGCAGCAAA ACCAGAGCGG ATTTGATCTG CGCCATTTCG TTAACGTCAA TTTCACCCTG
CCAAAAGAGG GCGAGAAATA TGTTCCGCCA GAGGGGCAGT CACTGCGCGA ACATATTGAC
GGACTTTGGC CGGTATTAAC GCGTGCTACC GAAAATACCG AAAAATGGGA TTCTCTGTTA
CCGCTGCCGG AACCTTATGT CGTGCCGGGC GGACGTTTTC GCGAGGTATA TTACTGGGAC
AGTTACTTCA CCATGTTAGG ACTTGCCGAA AGCGGTCACT GGGATAAAGT CGCGGATATG
GTGGCCAATT TTGCTCATGA AATAGACACT TACGGTCATA TTCCCAACGG CAACCGCAGT
TACTATTTAA GCCGCTCGCA GCCGCCCTTC TTTGCCCTGA TGGTAGAGTT ACTGGCGCAG
CATGAAGGCG ATGCCGCGTT GAAACAATAC CTGCCGCAAA TGCAAAAAGA ATATGCTTAC
TGGATGGACG GTGTTGAAAA CCTGCAAGCC GGACAACAGG AAAAACGCGT TGTCAAACTT
CAGGATGGTA CCCTTCTCAA CCGCTACTGG GACGATCGCG ATACGCCACG ACCAGAGTCA
TGGGTGGAAG ATATTGCCAC CGCCAAAAGC AATCCGAATC GACCTGCCAC TGAAATTTAC
CGCGATCTGC GCTCTGCCGC TGCGTCTGGC TGGGATTTCA GCTCGCGCTG GATGGACAAT
CCGCAGCAGT TAAATACCTT ACGCACCACC AGCATCGTAC CGGTCGATCT GAACAGCCTG
ATGTTTAAAA TGGAAAAAAT CCTCGCCCGC GCCAGCAAAG CTGCCGGAGA TAACGCGATG
GCAAACCAGT ACGAAACGCT GGCGAATGCC CGTCAAAAAG GGATCGAAAA ATACCTGTGG
AACGATCAAC AAGGCTGGTA TGCCGATTAC GACCTGAAAA GTCATAAAGT GCGCAATCAG
TTAACCGCGG CCGCCCTGTT CCCGCTGTAC GTCAATGCGG CAGCGAAAGA TCGCGCCAGC
AAAATGGCGA CGGCGACGAA AACACATCTG CTGCAACCCG GCGGCCTGAA CACCACGTCG
GTGAAAAGTG GACAACAATG GGATGCGCCA AACGGCTGGG CACCGTTGCA GTGGGTCGCG
ACAGAAGGAT TACAAAACTA CGGGCAAAAA GAGGTGGCGA TGGACATTAG CTGGCACTTC
CTGACCAATG TTCAGAACAC CTATGACCGG GAGAAAAAGC TGGTGGAAAA ATATGATGTC
AGCGCCACCG GAACAGGGGG CGGCGGTGGC GAATATCCAT TACAGGATGG CTTTGGCTGG
ACCAATGGCG TGACGCTGAA AATGCTGGAT TTGATCTGCC CGAAAGAGCA ACCGTGTGAC
AATGTTCCGG CGACGCGTCC ACTCAGTGAA TCAACAACAC AGCCGCTAAA ACAAAAAGAG
GCGGAACCTA CGCCTTAA
 
Protein sequence
MKSPTPSRPQ KMALIPACIF LCFAALSVQA EETSVTPQPP DILLGPLFND VQNAKLFPDQ 
KTFADAVPNS DPLMILADYR MQQNQSGFDL RHFVNVNFTL PKEGEKYVPP EGQSLREHID
GLWPVLTRAT ENTEKWDSLL PLPEPYVVPG GRFREVYYWD SYFTMLGLAE SGHWDKVADM
VANFAHEIDT YGHIPNGNRS YYLSRSQPPF FALMVELLAQ HEGDAALKQY LPQMQKEYAY
WMDGVENLQA GQQEKRVVKL QDGTLLNRYW DDRDTPRPES WVEDIATAKS NPNRPATEIY
RDLRSAAASG WDFSSRWMDN PQQLNTLRTT SIVPVDLNSL MFKMEKILAR ASKAAGDNAM
ANQYETLANA RQKGIEKYLW NDQQGWYADY DLKSHKVRNQ LTAAALFPLY VNAAAKDRAS
KMATATKTHL LQPGGLNTTS VKSGQQWDAP NGWAPLQWVA TEGLQNYGQK EVAMDISWHF
LTNVQNTYDR EKKLVEKYDV SATGTGGGGG EYPLQDGFGW TNGVTLKMLD LICPKEQPCD
NVPATRPLSE STTQPLKQKE AEPTP