Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4844 |
Symbol | treC |
ID | 6269416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4514975 |
End bp | 4516630 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641728582 |
Product | trehalose-6-phosphate hydrolase |
Protein accession | YP_001882976 |
Protein GI | 187733088 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02403] alpha,alpha-phosphotrehalase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.257097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAATC CTCCCCACTG GTGGCAAAAC GGCGTTATCT ACCAGATTTA TCCAAAGAGT TTTCAGGACA CCACGGGTAG CGGTACCGGC GATTTACGTG GCGTTATCCA ACGCCTGGAC TATCTGCATA AACTGGGCGT TGATGCCATC TGGCTAACCC CCTTTTATGT CTCTCCCCAG GTCGATAACG GTTACGACGT AGCGAACTAT ACGGCGATTG ATCCCACCTA CGGCACGCTG GACGATTTTG ACGAACTGGT GACGCAGGCA AAATCGCGCG GGATTCGTAT CATTCTCGAT ATGGTGTTTA ACCATACCTC TACCCAACAT GCCTGGTTTC GCGAGGCGCT GAACAAAGAA AGCCCTTACC GCCAGTTTTA TATCTGGCGC GATGGAGAAC CAGAAACGCC ACCGAACAAC TGGCGTTCAA AATTTGGCGG TAGTGCGTGG CGCTGGCATG CGGAAAGCGA ACAGTACTAT TTGCATCTCT TTGCACCAGA ACAGGCGGAT CTCAACTGGG AGAATCCAGC GGTACGCGCA GTGCTGAAAA AAGTCTGTGA GTTCTGGGCC GATCGTGGGG TCGACGGGTT GCGCCTGGAT GTGGTGAATC TGATCTCCAA AGACCCGCGT TTCCCTGATG ACCTGGATGG CGACGGGCGT CGCTTCTACA CCGACGGGCC ACGAGCACAC GAGTTTTTGC ACGAGATGAA CCGCGATGTG TTTACGCCAC GCGGGTTAAT GACCGTAGGT GAAATGTCCT CCACCAGCCT TGAGCATTGC CAGCGATACG CGGCACTGAC AGGCAGTGAA TTGTCGATGA CCTTTAATTT TCATCACCTG AAGGTCGATT ATCCCGGTGG TGAAAAATGG ACGCTGGCTA AACCTGACTT TGTGGCGTTG AAAACATTGT TCCGCCACTG GCAACAAGGA ATGCACAACG TAGCATGGAA TGCCTTGTTC TGGTGTAACC ACGATCAGCC GCGCATTGTT TCTCGCTTTG GTGATGAAGG TGAATACCGC GTGCCTGCGG CAAAAATGCT GGCGATGGTG CTGCATGGCA TGCAGGGAAC GCCGTATATC TACCAGGGCG AAGAGATTGG CATGACCAAC CCGCATTTCA CGCGCATTAC TGACTATCGC GACGTAGAGA GCCTCAATAT GTTTGCCGAG CTGCGCAACG ATGGTCGTGA TGCCGACGAG TTATTGGCAA TCCTTGCCAG TAAATCCCGT GACAACAGCC GCACGCCCAT GCAATGGAGC AACGGCGATA ATGCCGGATT TACGGCTGGC GAACCGTGGA TTGGCCTAGG TGATAACTAT CAACAAATCA ACGTAGAAGC CGCGCTGGCC GATGATTCCT CGGTGTTTTA CACCTACCAA AAGTTAATCG CACTGCGTAA GCAGGAAGCC ATCCTGACAT GGGGCAATTA CCAGGATCTG CTGCCAAACA GCCCTGTATT GTGGTGCTAT CGCCGTGAAT GGAAGGGGCA AACCTTGCTG GTCATTGCCA ACCTTAGCCG TGAGATCCAA CCCTGGCAGC CAGGGCAAAT GCGCGGCAAC TGGCAGCTTG TGATGCATAA CTACGAAGAA GCCTCACCAC AACCCTGTGC CATGAATTTA CGGCCTTTTG AGGCTGTCTG GTGGTTACAG AAGTAA
|
Protein sequence | MTNPPHWWQN GVIYQIYPKS FQDTTGSGTG DLRGVIQRLD YLHKLGVDAI WLTPFYVSPQ VDNGYDVANY TAIDPTYGTL DDFDELVTQA KSRGIRIILD MVFNHTSTQH AWFREALNKE SPYRQFYIWR DGEPETPPNN WRSKFGGSAW RWHAESEQYY LHLFAPEQAD LNWENPAVRA VLKKVCEFWA DRGVDGLRLD VVNLISKDPR FPDDLDGDGR RFYTDGPRAH EFLHEMNRDV FTPRGLMTVG EMSSTSLEHC QRYAALTGSE LSMTFNFHHL KVDYPGGEKW TLAKPDFVAL KTLFRHWQQG MHNVAWNALF WCNHDQPRIV SRFGDEGEYR VPAAKMLAMV LHGMQGTPYI YQGEEIGMTN PHFTRITDYR DVESLNMFAE LRNDGRDADE LLAILASKSR DNSRTPMQWS NGDNAGFTAG EPWIGLGDNY QQINVEAALA DDSSVFYTYQ KLIALRKQEA ILTWGNYQDL LPNSPVLWCY RREWKGQTLL VIANLSREIQ PWQPGQMRGN WQLVMHNYEE ASPQPCAMNL RPFEAVWWLQ K
|
| |