Gene SbBS512_E1346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1346 
Symbol 
ID6269975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1223297 
End bp1224829 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content50% 
IMG OID641725459 
ProductSpoVR family protein 
Protein accessionYP_001879969 
Protein GI187732298 
COG category[S] Function unknown 
COG ID[COG2719] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00649322 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA TCGATTCTAT GAATAAGGAC ACCACACGGT TGAGCGATGG ACCCGACTGG 
ACGTTCGACC TGCTGGATGT TTATCTGGCA GAGATAGACC GGGTGGCGAA ACTCTACCGG
CTGGATACCT ACCCGCACCA GATTGAAGTG ATAACCTCAG AACAGATGAT GGATGCCTAC
TCCAGCGTCG GCATGCCAAT TAACTATCCG CACTGGTCAT TCGGTAAAAA GTTTATCGAG
ACTGAACGGC TGTATAAGCA CGGTCAGCAA GGACTGGCCT ATGAAATCGT CATTAACTCT
AACCCGTGTA TCGCTTACCT GATGGAAGAG AACACCATTA CCATGCAAGC GCTGGTGATG
GCACACGCCT GCTATGGGCA TAACTCTTTC TTTAAAAACA ATTACTTATT CCGCAGTTGG
ACCGACGCCA GTTCGATTGT CGATTATCTG ATTTTTGCCC GTAAATATAT TACCGAGTGC
GAAGAACGTT ATGGCGTGGA TGAAGTAGAA CGGCTTCTGG ACTCGTGCCA CGCGCTGATG
AACTACGGCG TGGACCGGTA CAAACGCCCG CAAAAAATCT CGCTGCAAGA AGAGAAAGCC
CGGCAGAAAA GTCGCGAAGA GTATCTACAA AGTCAGGTCA ATATGCTCTG GCGTACCCTG
CCGAAGCGCG AGGAAGAGAA AACGGTTGCT GAAGCGCGCC GCTATCCGTC CGAACTACAA
GAAAACCTGC TCTATTTTAT GGAGAAAAAT GCGCCACTGC TGGAATCATG GCAGCGTGAA
ATCCTGCGTA TTGTGCGTAA GGTGAGCCAG TATTTTTATC CGCAAAAACA GACTCAGGTG
ATGAACGAAG GCTGGGCCAC CTTCTGGCAC TACACCATCC TTAACCATCT GTATGATGAA
GGGAAAGTAA CGGAACGTTT TATGCTGGAG TTTTTGCACA GCCACACCAA TGTGGTCTTC
CAGCCCCCCT ATAACAGCCC GTGGTACAGC GGCATCAACC CGTATGCCCT CGGGTTCGCC
ATGTTCCAGG ATATTAAACG GATTTGTCAG TCGCCAACGG AAGAAGACAA ATACTGGTTC
CCGGATATCG CCGGTTCCGA CTGGCTGGAA ACGCTGCATT TCGCGATGCG TGATTTCAAA
GATGAGAGTT TTATCAGCCA GTTTCTGTCA CCGAAAGTGA TGCGTGATTT CCGCTTCTTC
ACCGTGCTGG ATGACGATCG GCATAATTAT CTGGAGATTT CCGCTATTCA TAATGAAGAA
GGTTATCGGG AGATCCGTAA CCGGTTATCG TCGCAATATA ACTTAAGTAA TCTGGAGCCG
AATATTCAGA TCTGGAACGT GGATTTGCGC GGTGACCGTT CGCTGACGCT GCGTTACATT
CCACATAATC GCGCACCGCT GGATCGGGGG CGCAAAGAAG TGCTGAAGCA TGTGCATCGC
CTGTGGGGAT TTGATGTGAT GCTGGAACAG CAAAACGAAG ATGGCAGCGT CGAGTTGCTG
GAACGTTGCC CGCCAAGAAT GGGAAATCTG TAA
 
Protein sequence
MATIDSMNKD TTRLSDGPDW TFDLLDVYLA EIDRVAKLYR LDTYPHQIEV ITSEQMMDAY 
SSVGMPINYP HWSFGKKFIE TERLYKHGQQ GLAYEIVINS NPCIAYLMEE NTITMQALVM
AHACYGHNSF FKNNYLFRSW TDASSIVDYL IFARKYITEC EERYGVDEVE RLLDSCHALM
NYGVDRYKRP QKISLQEEKA RQKSREEYLQ SQVNMLWRTL PKREEEKTVA EARRYPSELQ
ENLLYFMEKN APLLESWQRE ILRIVRKVSQ YFYPQKQTQV MNEGWATFWH YTILNHLYDE
GKVTERFMLE FLHSHTNVVF QPPYNSPWYS GINPYALGFA MFQDIKRICQ SPTEEDKYWF
PDIAGSDWLE TLHFAMRDFK DESFISQFLS PKVMRDFRFF TVLDDDRHNY LEISAIHNEE
GYREIRNRLS SQYNLSNLEP NIQIWNVDLR GDRSLTLRYI PHNRAPLDRG RKEVLKHVHR
LWGFDVMLEQ QNEDGSVELL ERCPPRMGNL