Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1946 |
Symbol | treA |
ID | 6142681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1966387 |
End bp | 1968084 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616822 |
Product | trehalase |
Protein accession | YP_001743998 |
Protein GI | 170681871 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0674464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCCC CCACCCCTTC TCGCCCGCAA AAAATGGCGT TAATTCCAGC CTGCATCTTT TTGTGTTTTG CTGCGCTATC GGTGCAGGCA GAAGAAACAT CGGTAACGCC ACAGCCGCCT GATATTTTAT TAGGGCCGCT GTTTAATGAT GTGCAAAACG CCAAACTTTT TCCGGACCAA AAAACCTTTG CCGATGCCGT GCCGAACAGC GATCCGCTGA TGATCCTTGC TGATTATCGG ATGCAGCAAA ACCAGAGCGG ATTTGATCTG CGCCATTTCG TTAACGTCAA TTTCACCCTG CCAAAAGAGG GCGAGAAATA TGTTCCGCCA GAGGGGCAGT CACTGCGCGA ACATATTGAC GGACTTTGGC CGGTATTAAC GCGTGCTACC GAAAATACCG AAAAATGGGA TTCTCTGTTA CCGCTGCCGG AACCTTATGT CGTGCCGGGC GGACGTTTTC GCGAGGTATA TTACTGGGAC AGTTACTTCA CCATGTTAGG ACTTGCCGAA AGCGGTCACT GGGATAAAGT CGCGGATATG GTGGCCAATT TTGCTCATGA AATAGACACT TACGGTCATA TTCCCAACGG CAACCGCAGT TACTATTTAA GCCGCTCGCA GCCGCCCTTC TTTGCCCTGA TGGTAGAGTT ACTGGCGCAG CATGAAGGCG ATGCCGCGTT GAAACAATAC CTGCCGCAAA TGCAAAAAGA ATATGCTTAC TGGATGGACG GTGTTGAAAA CCTGCAAGCC GGACAACAGG AAAAACGCGT TGTCAAACTT CAGGATGGTA CCCTTCTCAA CCGCTACTGG GACGATCGCG ATACGCCACG ACCAGAGTCA TGGGTGGAAG ATATTGCCAC CGCCAAAAGC AATCCGAATC GACCTGCCAC TGAAATTTAC CGCGATCTGC GCTCTGCCGC TGCGTCTGGC TGGGATTTCA GCTCGCGCTG GATGGACAAT CCGCAGCAGT TAAATACCTT ACGCACCACC AGCATCGTAC CGGTCGATCT GAACAGCCTG ATGTTTAAAA TGGAAAAAAT CCTCGCCCGC GCCAGCAAAG CTGCCGGAGA TAACGCGATG GCAAACCAGT ACGAAACGCT GGCGAATGCC CGTCAAAAAG GGATCGAAAA ATACCTGTGG AACGATCAAC AAGGCTGGTA TGCCGATTAC GACCTGAAAA GTCATAAAGT GCGCAATCAG TTAACCGCGG CCGCCCTGTT CCCGCTGTAC GTCAATGCGG CAGCGAAAGA TCGCGCCAGC AAAATGGCGA CGGCGACGAA AACACATCTG CTGCAACCCG GCGGCCTGAA CACCACGTCG GTGAAAAGTG GACAACAATG GGATGCGCCA AACGGCTGGG CACCGTTGCA GTGGGTCGCG ACAGAAGGAT TACAAAACTA CGGGCAAAAA GAGGTGGCGA TGGACATTAG CTGGCACTTC CTGACCAATG TTCAGAACAC CTATGACCGG GAGAAAAAGC TGGTGGAAAA ATATGATGTC AGCGCCACCG GAACAGGGGG CGGCGGTGGC GAATATCCAT TACAGGATGG CTTTGGCTGG ACCAATGGCG TGACGCTGAA AATGCTGGAT TTGATCTGCC CGAAAGAGCA ACCGTGTGAC AATGTTCCGG CGACGCGTCC ACTCAGTGAA TCAACAACAC AGCCGCTAAA ACAAAAAGAG GCGGAACCTA CGCCTTAA
|
Protein sequence | MKSPTPSRPQ KMALIPACIF LCFAALSVQA EETSVTPQPP DILLGPLFND VQNAKLFPDQ KTFADAVPNS DPLMILADYR MQQNQSGFDL RHFVNVNFTL PKEGEKYVPP EGQSLREHID GLWPVLTRAT ENTEKWDSLL PLPEPYVVPG GRFREVYYWD SYFTMLGLAE SGHWDKVADM VANFAHEIDT YGHIPNGNRS YYLSRSQPPF FALMVELLAQ HEGDAALKQY LPQMQKEYAY WMDGVENLQA GQQEKRVVKL QDGTLLNRYW DDRDTPRPES WVEDIATAKS NPNRPATEIY RDLRSAAASG WDFSSRWMDN PQQLNTLRTT SIVPVDLNSL MFKMEKILAR ASKAAGDNAM ANQYETLANA RQKGIEKYLW NDQQGWYADY DLKSHKVRNQ LTAAALFPLY VNAAAKDRAS KMATATKTHL LQPGGLNTTS VKSGQQWDAP NGWAPLQWVA TEGLQNYGQK EVAMDISWHF LTNVQNTYDR EKKLVEKYDV SATGTGGGGG EYPLQDGFGW TNGVTLKMLD LICPKEQPCD NVPATRPLSE STTQPLKQKE AEPTP
|
| |