Gene SbBS512_E3437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3437 
Symbol 
ID6272416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3196819 
End bp3197775 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID641727323 
Producttranscriptional regulator, AraC family 
Protein accessionYP_001881772 
Protein GI187731739 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.833762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACAAA ATTGCGCACA ATCAAATTGC CGCATTATTC CTAAGAAATT ACGCGATATG 
AAACGTGAAG AGATTTGCCG CTTGCTGGCG GATAAAGTTA ATAAACTGAA AAATAAAGAA
AATAGTTTGT CAGAACTGTT GCCCGATGTG CGTTTGTTGT ATGGCGAGAC GCCTTTCGCA
CGTACACCGG TGATGTACGA GCCTGGCATC ATAATTCTCT TTTCCGGGCA TAAAATCGGT
TATATCAATG AACGCGTGTT TCGTTATGAT GCCAATGAAT ACCTGCTGCT GACGGTGCCG
TTGCCGTTTG AGTGCGAAAC CTATGCCACG TCAGAGGTGC CGCTGGCAGG GTTGCGTCTC
AATGTCGATA TTTTGCAGTT ACAGGAACTG TTGATGGACA TTGGCGAAGA TGAGCATTTC
CAGCCGTCGA TGGCAGCCAG CGGGATTAAC TCCGCCACGT TATCAGAAGA GATTTTATGC
GCGGCGGAGC GGTTACTCGA CGTGATGGAG CGACCACTGG ATGCGCGTAT TCTCGGCAAA
CAGATCATCC GCGAAATTCT GTACTACGTG CTGACCGGAC CTTGCGGCGG CGCGTTACTG
GCGCTGGTCA GTCGCCAGAC TCACTTCAGT CTGATTAGCC GCGTGCTGAA ACGGATTGAG
AATAAATACA CCGAAAACCT GAGCGTCGAG CAACTGGCGG CAGAAGCCAA CATGAGCGTA
TCGGCGTTCC ACCATAATTT TAAGTCTGTC ACCAGCACCT CGCCGTTGCA GTATTTGAAG
AATTACCGTC TGCATAAGGC GCGGATGATG ATCATCCATG ACGGCATGAA GGCCAGCGCA
GCAGCGATGC GCGTCGGCTA TGAAAGCGCA TCGCAATTTA GCCGTGAGTT TAAACGTTAC
TTCGGTGTGA CGCCGGGGGA AGATGCGGCA AGAATGCGGG CGATGCAGGG GAATTAA
 
Protein sequence
MLQNCAQSNC RIIPKKLRDM KREEICRLLA DKVNKLKNKE NSLSELLPDV RLLYGETPFA 
RTPVMYEPGI IILFSGHKIG YINERVFRYD ANEYLLLTVP LPFECETYAT SEVPLAGLRL
NVDILQLQEL LMDIGEDEHF QPSMAASGIN SATLSEEILC AAERLLDVME RPLDARILGK
QIIREILYYV LTGPCGGALL ALVSRQTHFS LISRVLKRIE NKYTENLSVE QLAAEANMSV
SAFHHNFKSV TSTSPLQYLK NYRLHKARMM IIHDGMKASA AAMRVGYESA SQFSREFKRY
FGVTPGEDAA RMRAMQGN