Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1301 |
Symbol | treA |
ID | 5592548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1294118 |
End bp | 1295815 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640920457 |
Product | trehalase |
Protein accession | YP_001458018 |
Protein GI | 157160700 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCCC CCGCACCTTC TCGCCCGCAA AAAATGGCGT TAATTCCAGC CTGCATCTTT TTGTGTTTCG CTGCGCTATC GGTGCAGGCA GAAGAAACAC CGGTAACACC ACAGCCGCCT GATATTTTAT TAGGGCCGCT GTTTAATGAT GTGCAAAACG CCAAACTTTT TCCGGACCAA AAAACCTTTG CCGATGCCGT GCCGAACAGC GATCCGCTGA TGATCCTTGC TGATTATCGG ATGCAGCAAA ACCAGAGCGG ATTTGATCTG CGCCATTTCG TTAACGTCAA TTTCACCCTG CCGAAAGAGG GCGAGAAATA TGTTCCGCCA GAGGGACAGT CACTGCGCGA ACATATTGAC GGACTTTGGC CGGTATTAAC GCGTTCTACC GAAAACACCG AAAAATGGGA TTCTCTGTTA CCGCTGCCGG AACCTTATGT CGTGCCGGGC GGACGTTTTC GCGAGGTATA TTACTGGGAC AGTTACTTCA CCATGTTAGG ACTTGCCGAA AGCGGTCACT GGGATAAAGT CGCGGATATG GTGGCCAATT TTGCTCATGA AATAGACACT TACGGTCATA TTCCCAACGG CAACCGCAGT TACTATTTAA GCCGCTCGCA ACCGCCCTTC TTTGCCCTGA TGGTAGAGTT ACTGGCGCAG CATGAAGGCG ATGCCGCGTT GAAGCAATAC CTGCCGCAAA TGCAAAAAGA ATATGCTTAC TGGATGGACG GTGTTGAAAA CCTGCAAGCC GGACAACAGG AAAAACGCGT TGTAAAACTT CAGGATGGTA CCCTTCTCAA CCGCTACTGG GACGATCGCG ATACGCCACG ACCAGAGTCA TGGGTGGAAG ATATTGCCAC CGCCAAAAGC AATCCGAATC GACCTGCCAC TGAAATTTAC CGCGACCTGC GCTCTGCCGC TGCGTCTGGC TGGGATTTCA GCTCGCGCTG GATGGACAAC CCGCAGCAGT TAAATACCTT ACGCACCACC AGCATCGTAC CGGTCGATCT GAACAGCCTG ATGTTTAAAA TGGAAAAAAT CCTCGCCCGC GCCAGCAAAG CTGCCGGAGA TAACGCGATG GCAAACCAGT ACGAAACGCT GGCAAATGCC CGTCAAAAAG GGATCGAAAA ATACCTGTGG AACGATCAAC AAGGCTGGTA TGCCGATTAC GACCTGAAAA GTCATAAAGT GCGCAATCAG TTAACCGCGG CCGCCCTGTT CCCGCTGTAC GTCAATGCGG CAGCGAAAGA TCGCGCCAAC AAAATGGCGA CGGCGACGAA AACACATCTG CTGCAACCCG GCGGCCTGAA CACCACGTCG GTGAAAAGTG GGCAACAATG GGATGCGCCA AATGGCTGGG CACCGTTACA GTGGGTCGCG ACAGAAGGAT TACAAAACTA CGGGCAAAAA GAGGTGGCGA TGGACATTAG CTGGCACTTC CTGACCAATG TTCAGCACAC CTATGACCGG GAGAAAAAGC TGGTGGAAAA ATATGATGTC AGCACCACCG GAACGGGGGG CGGCGGTGGC GAATATCCAT TACAGGATGG CTTTGGCTGG ACCAATGGCG TGACGCTGAA AATGCTGGAT TTGATCTGCC CGAAAGAGCA ACCGTGTGAC AATGTTCCGG CGACGCGTCC GACCGTTAAG TCAGCAACGA CGCAACCCTC AACCAAAGAG GCACAACCCA CACCTTAA
|
Protein sequence | MKSPAPSRPQ KMALIPACIF LCFAALSVQA EETPVTPQPP DILLGPLFND VQNAKLFPDQ KTFADAVPNS DPLMILADYR MQQNQSGFDL RHFVNVNFTL PKEGEKYVPP EGQSLREHID GLWPVLTRST ENTEKWDSLL PLPEPYVVPG GRFREVYYWD SYFTMLGLAE SGHWDKVADM VANFAHEIDT YGHIPNGNRS YYLSRSQPPF FALMVELLAQ HEGDAALKQY LPQMQKEYAY WMDGVENLQA GQQEKRVVKL QDGTLLNRYW DDRDTPRPES WVEDIATAKS NPNRPATEIY RDLRSAAASG WDFSSRWMDN PQQLNTLRTT SIVPVDLNSL MFKMEKILAR ASKAAGDNAM ANQYETLANA RQKGIEKYLW NDQQGWYADY DLKSHKVRNQ LTAAALFPLY VNAAAKDRAN KMATATKTHL LQPGGLNTTS VKSGQQWDAP NGWAPLQWVA TEGLQNYGQK EVAMDISWHF LTNVQHTYDR EKKLVEKYDV STTGTGGGGG EYPLQDGFGW TNGVTLKMLD LICPKEQPCD NVPATRPTVK SATTQPSTKE AQPTP
|
| |