Gene SbBS512_E4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4844 
SymboltreC 
ID6269416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4514975 
End bp4516630 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content53% 
IMG OID641728582 
Producttrehalose-6-phosphate hydrolase 
Protein accessionYP_001882976 
Protein GI187733088 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02403] alpha,alpha-phosphotrehalase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.257097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAATC CTCCCCACTG GTGGCAAAAC GGCGTTATCT ACCAGATTTA TCCAAAGAGT 
TTTCAGGACA CCACGGGTAG CGGTACCGGC GATTTACGTG GCGTTATCCA ACGCCTGGAC
TATCTGCATA AACTGGGCGT TGATGCCATC TGGCTAACCC CCTTTTATGT CTCTCCCCAG
GTCGATAACG GTTACGACGT AGCGAACTAT ACGGCGATTG ATCCCACCTA CGGCACGCTG
GACGATTTTG ACGAACTGGT GACGCAGGCA AAATCGCGCG GGATTCGTAT CATTCTCGAT
ATGGTGTTTA ACCATACCTC TACCCAACAT GCCTGGTTTC GCGAGGCGCT GAACAAAGAA
AGCCCTTACC GCCAGTTTTA TATCTGGCGC GATGGAGAAC CAGAAACGCC ACCGAACAAC
TGGCGTTCAA AATTTGGCGG TAGTGCGTGG CGCTGGCATG CGGAAAGCGA ACAGTACTAT
TTGCATCTCT TTGCACCAGA ACAGGCGGAT CTCAACTGGG AGAATCCAGC GGTACGCGCA
GTGCTGAAAA AAGTCTGTGA GTTCTGGGCC GATCGTGGGG TCGACGGGTT GCGCCTGGAT
GTGGTGAATC TGATCTCCAA AGACCCGCGT TTCCCTGATG ACCTGGATGG CGACGGGCGT
CGCTTCTACA CCGACGGGCC ACGAGCACAC GAGTTTTTGC ACGAGATGAA CCGCGATGTG
TTTACGCCAC GCGGGTTAAT GACCGTAGGT GAAATGTCCT CCACCAGCCT TGAGCATTGC
CAGCGATACG CGGCACTGAC AGGCAGTGAA TTGTCGATGA CCTTTAATTT TCATCACCTG
AAGGTCGATT ATCCCGGTGG TGAAAAATGG ACGCTGGCTA AACCTGACTT TGTGGCGTTG
AAAACATTGT TCCGCCACTG GCAACAAGGA ATGCACAACG TAGCATGGAA TGCCTTGTTC
TGGTGTAACC ACGATCAGCC GCGCATTGTT TCTCGCTTTG GTGATGAAGG TGAATACCGC
GTGCCTGCGG CAAAAATGCT GGCGATGGTG CTGCATGGCA TGCAGGGAAC GCCGTATATC
TACCAGGGCG AAGAGATTGG CATGACCAAC CCGCATTTCA CGCGCATTAC TGACTATCGC
GACGTAGAGA GCCTCAATAT GTTTGCCGAG CTGCGCAACG ATGGTCGTGA TGCCGACGAG
TTATTGGCAA TCCTTGCCAG TAAATCCCGT GACAACAGCC GCACGCCCAT GCAATGGAGC
AACGGCGATA ATGCCGGATT TACGGCTGGC GAACCGTGGA TTGGCCTAGG TGATAACTAT
CAACAAATCA ACGTAGAAGC CGCGCTGGCC GATGATTCCT CGGTGTTTTA CACCTACCAA
AAGTTAATCG CACTGCGTAA GCAGGAAGCC ATCCTGACAT GGGGCAATTA CCAGGATCTG
CTGCCAAACA GCCCTGTATT GTGGTGCTAT CGCCGTGAAT GGAAGGGGCA AACCTTGCTG
GTCATTGCCA ACCTTAGCCG TGAGATCCAA CCCTGGCAGC CAGGGCAAAT GCGCGGCAAC
TGGCAGCTTG TGATGCATAA CTACGAAGAA GCCTCACCAC AACCCTGTGC CATGAATTTA
CGGCCTTTTG AGGCTGTCTG GTGGTTACAG AAGTAA
 
Protein sequence
MTNPPHWWQN GVIYQIYPKS FQDTTGSGTG DLRGVIQRLD YLHKLGVDAI WLTPFYVSPQ 
VDNGYDVANY TAIDPTYGTL DDFDELVTQA KSRGIRIILD MVFNHTSTQH AWFREALNKE
SPYRQFYIWR DGEPETPPNN WRSKFGGSAW RWHAESEQYY LHLFAPEQAD LNWENPAVRA
VLKKVCEFWA DRGVDGLRLD VVNLISKDPR FPDDLDGDGR RFYTDGPRAH EFLHEMNRDV
FTPRGLMTVG EMSSTSLEHC QRYAALTGSE LSMTFNFHHL KVDYPGGEKW TLAKPDFVAL
KTLFRHWQQG MHNVAWNALF WCNHDQPRIV SRFGDEGEYR VPAAKMLAMV LHGMQGTPYI
YQGEEIGMTN PHFTRITDYR DVESLNMFAE LRNDGRDADE LLAILASKSR DNSRTPMQWS
NGDNAGFTAG EPWIGLGDNY QQINVEAALA DDSSVFYTYQ KLIALRKQEA ILTWGNYQDL
LPNSPVLWCY RREWKGQTLL VIANLSREIQ PWQPGQMRGN WQLVMHNYEE ASPQPCAMNL
RPFEAVWWLQ K