Gene SbBS512_E2809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2809 
Symbol 
ID6271869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2606322 
End bp2607374 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID641726762 
Producttranscriptional regulator EutR 
Protein accessionYP_001881235 
Protein GI187731112 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CCCGTACAGC CAATTTGCAC CATCTTTATC ATGAACCCTT ACCCGAAAAC 
CTGAAGCTCA CGCCGAAGGT CGAAGTGGAT AATGTTCATC AACGACAGAC AACGGATGTC
TATGAACATG CTTTAACGAT TACCGCCTGG CAGCAGATTT ACGATCAGCT GCATCCGGGC
AAGTTTCATG GTGAATTTAC GGAAATTCTA CTCGATGATA TTCAGGTTTT TCGTGAATAC
ACCGGTCTGG CGCTGCGTCA GTCGTGCCTG GTCTGGCCGA ACTCGTTCTG GTTTGGCATT
CCGGCGACGC GCGGTGAGCA GGGATTTATC GGTTCGCAAT GTCTGGGAAG CGCGGAAATC
GCCACCCGCC CTGGTGGCAC TGAATTTGAA CTGAGCACGC CGGATGATTA CACGATCCTG
GGCGTGGTGC TTTCTGAAGA TGTCATCACC CGGCAGGCTA ACTTTTTGCA TAACCCGGAT
CGGGTATTAC ATATGCTGCG TAGCCAGTCG GCGCTGGAAG TGAAAGAGCA GCATAAAGCC
GCGCTGTGGG GCTTTGTCCA ACAGGCGCTG GCGACGTTTT GCGAGAACCC GGAAAATCTC
CATCAGCCTG CAGTGCGAAA AGTGCTGGGG GATAATTTGC TAATGGCGAT GGGGGCGATG
CTGGAAGACG CGCAACCAAT GGTGACGGCG GAAAGCATCA GTCATCAGAG TTACCGTAGA
TTACTTTCCC GCGCCCGTGA ATATGTGCTG GAAAATATGT CTGAGCCGGT GACGGTGCTG
GACTTGTGTA ATCAACTGCA TGTCAGTCGC CGCACGCTAC AAAACGCGTT TCACGCTATT
TTAGGCATTG GCCCAAACGC GTGGCTGAAA CGCATTCGCC TGAACGCCGT ACGCCGCGAA
CTGATAAGCC CGTGGTCGCA AAGCACAACG GTAAAAGACG CCGCCATGCA GTGGGGATTC
TGGCATCTGG GGCAATTTGT CACGGATTAC CAGCAGCTGT TTGCCGAGAA GCCGTCGTTG
ACGTTGCATC AGCGGATGCG GGAATGGGGG TGA
 
Protein sequence
MKKTRTANLH HLYHEPLPEN LKLTPKVEVD NVHQRQTTDV YEHALTITAW QQIYDQLHPG 
KFHGEFTEIL LDDIQVFREY TGLALRQSCL VWPNSFWFGI PATRGEQGFI GSQCLGSAEI
ATRPGGTEFE LSTPDDYTIL GVVLSEDVIT RQANFLHNPD RVLHMLRSQS ALEVKEQHKA
ALWGFVQQAL ATFCENPENL HQPAVRKVLG DNLLMAMGAM LEDAQPMVTA ESISHQSYRR
LLSRAREYVL ENMSEPVTVL DLCNQLHVSR RTLQNAFHAI LGIGPNAWLK RIRLNAVRRE
LISPWSQSTT VKDAAMQWGF WHLGQFVTDY QQLFAEKPSL TLHQRMREWG