Gene SbBS512_E4842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4842 
SymboltreR 
ID6272435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4512438 
End bp4513403 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content54% 
IMG OID641728580 
Producttrehalose repressor 
Protein accessionYP_001882974 
Protein GI187730489 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID[TIGR02405] trehalose operon repressor, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATC GGCTGACCAT CAAAGACATC GCGCGCTTAA GCGGCGTGGG GAAATCTACA 
GTTTCCCGGG TGCTGAATAA CGAAAGCGGC GTGAGCCAGC GCACCCGCGA GCGTGTTGAA
GCAGTGATGA ATCAGCATGG ATTTTCCCCT TCCCGCTCTG CGCGCGCTAT GCGTGGGCAA
AGCGATAAAG TGGTCGCCAT CATTGTTACC CGCCTGGATT CGTTGTCAGA AAATCTCGCC
GTTCAAACCA TGCTGCCAGA GTTCTATGAA CAAGGTTACG ACCCAATCAT GATGGAAAGT
CAGTTTTCCC CGCAATTAGT TGCCGAACAT TTGGGGGTGC TGAAACGGCG TAATATCGAC
GGCGTAGTGC TGTTCGGTTT TACCGGCATA ACAGAAGAAA TGTTAGCCCA CTGGCAGTCA
TCGCTGGTTC TGCTGGCGCG TGACGCAAAA GGCTTTGCTT CGGTCTGTTA TGACGACGAA
GGGGCAATCA AGATCCTGAT GCAACGGCTG TATGACCAGG GGCATCGTAA TATCAGTTAT
CTCGGCGTGC CGCACAGTGA CGTGACGACC GGTAAGCGAC GTCACGAAGC CTACCTGGCG
TTCTGCAAAG CGCATAAACT GCATCCCGTT GCCGCTCTGC CAGGCCTTGC TATGAAGCAA
GGCTATGAGA ACGTAGCAAA AGTGATTACG CCTGAAACTA CCGCCTTACT GTGCGCAACC
GACACGCTGG CACTTGGCGC AAGTAAATAC CTGCAAGAGC AACGCATCGA CACCTTGCAA
CTGGCGAGCG TCGGTAATAC GCCGTTAATA AAATTCCTCC ATCCGGAGAT CGTCACCGTA
GATCCCGGTT ACGCCGAAGC TGGACGCCAG GCGGCCTGCC AGTTGATCGC ACAGGTAACC
GGGCGCAGCG AACCGCAACA AATCATCATC CCCGCCACCC TGTCCAGATC GTTTCCTGAA
CGATAA
 
Protein sequence
MQNRLTIKDI ARLSGVGKST VSRVLNNESG VSQRTRERVE AVMNQHGFSP SRSARAMRGQ 
SDKVVAIIVT RLDSLSENLA VQTMLPEFYE QGYDPIMMES QFSPQLVAEH LGVLKRRNID
GVVLFGFTGI TEEMLAHWQS SLVLLARDAK GFASVCYDDE GAIKILMQRL YDQGHRNISY
LGVPHSDVTT GKRRHEAYLA FCKAHKLHPV AALPGLAMKQ GYENVAKVIT PETTALLCAT
DTLALGASKY LQEQRIDTLQ LASVGNTPLI KFLHPEIVTV DPGYAEAGRQ AACQLIAQVT
GRSEPQQIII PATLSRSFPE R