Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4720 |
Symbol | treC |
ID | 6147116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4818274 |
End bp | 4819929 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619536 |
Product | trehalose-6-phosphate hydrolase |
Protein accession | YP_001746644 |
Protein GI | 170682699 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02403] alpha,alpha-phosphotrehalase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.782883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCATC TTCCCCACTG GTGGCAAAAC GGCGTTATCT ACCAGATTTA TCCAAAGAGT TTTCAGGACA CCACGGGTAG CGGCACCGGC GATTTACGTG GCGTTATCCA GCGCCTGGAC TATCTGCATA AACTGGGCGT TGATGCCATC TGGCTAACCC CCTTTTATGT CTCTCCCCAG GTCGATAACG GTTACGACGT AGCAAATTAT ACGGCGATTG ATCCCACCTA TGGCACGCTG GACGATTTTG ACGAACTGGT GACACAGGCA AAATCGCGTG GGATTCGTAT CATTCTCGAT ATGGTGTTTA ACCATACCTC TACCCAACAT GCCTGGTTTC GCGAGGCGCT GAACAAAGAA AGCCCTTACC GCCAGTTTTA TATCTGGCGC GATGGAGAAC CAGAAACGCC ACCGAACAAC TGGCGTTCAA AATTCGGTGG TAGTGCGTGG CGCTGGCATG CGGAAAGCGA ACAGTACTAT TTGCATCTCT TTGCACCAGA ACAGGCGGAT CTCAACTGGG AGAATCCAGC GGTACGCGCA GAGCTGAAAA AAGTCTGTGA GTTCTGGGCC AATCGTGGGG TCGACGGGTT GCGCCTGGAT GTGGTGAATC TGATCTCCAA AGATCCGCGT TACCCTGAAG ACCTGGATGG CGACGGGCGT CGCTTCTACA CCGACGGGCC ACGAGCACAC GAGTTTTTGC ACGAGATGAA CCGCGATGTG TTTACGCCAC ACGGGTTAAT GACCGTAGGT GAAATGTCCT CCACCAGCCT TGAGCATTGC CAGCGATACG CGGCACTGAC AGGCAGTGAA TTGTCGATGA CCTTTAATTT TCATCACCTG AAGGTCGATT ATCCCGGTGG TGAGAAATGG ACGCTGGCTA AACCTGACTT TGTGGCGTTG AAAACATTGT TCCGCCACTG GCAACAAGGA ATGCACAACG TGGCATGGAA TGCCTTGTTC TGGTGTAACC ACGATCAGCC GCGCATTGTT TCTCGCTTTG GTGATGAAGG TGAATACCGC GTGCCTGCGG CAAAAATGCT GGCGATGGTG CTGCATGGCA TGCAGGGAAC GCCGTATATC TACCAGGGCG AAGAGATTGG CATGACCAAC CCACATTTCA CGCGCATTAC TGACTATCGC GACGTGGAGA GCCTCAATAT GTTTGCCGAG CTGCGCAACG ATGGTCGTGA TGCCGACGAG TTATTGGCAA TCCTCGCCAG TAAATCCCGT GACAACAGTC GCACACCCAT GCAATGGACC AACGGCGATA ATGCCGGGTT TACGGCTGGC GAACCGTGGA TTGGCCTGGG CGATAACTAT CAGGAAATCA ACGTAGAAGC AGCGCTGGCC GATGAGTCCT CGGTGTTTTA CACCTACCAA AAGTTAATCG CACTGCGTAA GCAAGAAGCC GTCCTGACAT GGGGCGATTA CCAGGATCTG CTGCCAAACA GCCCTGTATT GTGGTGCTAT CGCCGTGAGT GGAAGGGGCA AACCTTGCTG GTCATTGCCA ACCTTAGCCG TGAGACCCAA CCCTGGCAGC CAGGGAAAAT GCTCGGCAAC TGGCAGCTTG TGATGCATAA CTACGAAGAA GCCTCACCAC AACCCTGTGC CATGACTTTA CGGCCTTTTG AGGCTGTCTG GTGGTTACAG AAGTAA
|
Protein sequence | MTHLPHWWQN GVIYQIYPKS FQDTTGSGTG DLRGVIQRLD YLHKLGVDAI WLTPFYVSPQ VDNGYDVANY TAIDPTYGTL DDFDELVTQA KSRGIRIILD MVFNHTSTQH AWFREALNKE SPYRQFYIWR DGEPETPPNN WRSKFGGSAW RWHAESEQYY LHLFAPEQAD LNWENPAVRA ELKKVCEFWA NRGVDGLRLD VVNLISKDPR YPEDLDGDGR RFYTDGPRAH EFLHEMNRDV FTPHGLMTVG EMSSTSLEHC QRYAALTGSE LSMTFNFHHL KVDYPGGEKW TLAKPDFVAL KTLFRHWQQG MHNVAWNALF WCNHDQPRIV SRFGDEGEYR VPAAKMLAMV LHGMQGTPYI YQGEEIGMTN PHFTRITDYR DVESLNMFAE LRNDGRDADE LLAILASKSR DNSRTPMQWT NGDNAGFTAG EPWIGLGDNY QEINVEAALA DESSVFYTYQ KLIALRKQEA VLTWGDYQDL LPNSPVLWCY RREWKGQTLL VIANLSRETQ PWQPGKMLGN WQLVMHNYEE ASPQPCAMTL RPFEAVWWLQ K
|
| |