Gene SbBS512_E0591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0591 
Symbol 
ID6271890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp564596 
End bp566473 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content50% 
IMG OID641724795 
Productpeptidase, S54 (rhomboid) family 
Protein accessionYP_001879335 
Protein GI187731357 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid)
[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCAT CTTCGGTTAA GCCGTTAAAT GTTCAATTAC CCGCAATAAC CCTTATCCTT 
TTTGCACTCT GTATTGGGAT ATTTTGTTAT CTCGCACAAT GGATGAGTTA TGAAGAGGTC
GATCAATCCG CACTTATCCA TCTCGGCGCA AACGTTGCGC CTCTTACTTT GTCGGGTGAA
CCCTGGCGCT TATTGAGCAG TATCTTTCTG CACAGTAGTG TTTCTCATCT GCTGATGAAT
ATGTTTGCAT TCCTGGTCGT GGGAGGCGTG GCGGAACAGA TTCTGGGGAA ATGGCGACTC
CTGATTACCT GGTTATTCTC CGGCGTCTTT GGTGGGCTTA TCAGCGCCTG TTATGCGTTA
CGCGATAGTG ATCAGATAGT CATCAGCGTT GGGGCATTCG GGGCAATTAT GGGAATAGCT
GGCGCTGCGA TAGCAACACA GCTTGCTTCA GGTGCGGGCA CATACCATAA AAACCAGCGG
CGAGTATTTT CTCTGTTGGG TATGGTGGCG CTGACACTGT TGTATGGTGC CCGGCAAACA
GGAATAGATA ATGCTTGTCA CATTGGCGGC CTGATTGCGG GTGGCGCGTT GGGTTGGCTG
AGCGCGCGTT TATCTGGGCA AAACCGACTC GTTACGGAAG GCGGGATTAT TGTTGCGGGC
AGTCTTCTTC TGACCGGGGC TATCTGGCTT GCGCAGCAGC AGATGGATGA GTCAGTTTTA
CAAGTCAGGC AAAGTCTGCG TGAAGCGTTT TATCCACAGG AGATTGAACA AGAGCGACGG
CAAAAAAAGC AGCAGTTAGC GGAGGAACGC AACGCCCTCA AGGAAACATT ATCCGCTCCG
GTAAGTCGTG AACAGGCCAG TGGTGATTTG CTCGCTGAGA TTGCCGATAT CCATGATATG
GCGATCAGTC GGGATGGTAA TATGTTGTAT GCCGCAATTG AAAACACGAA CAGCATTGTT
GTTTTCGACC TCGGACAAAA GAAAATCCTG CATACCTTTA CAGCGCCCAT AGCGAAAGAA
AAGTCAGTCA AACATTGTGG TGGCTGTAAA GATCAGGGCG TCAGATCGCT GGCATTAAGC
CCGGATGAAA AGTTGATTTA TGCGACTTCA TTTGAAGCGA ATGCGTTATC GGTCATTAAC
GTGGCGACAG GGGAGATTAT TCAGTCGATT ACCACCGGTG CACATCCTGA CAGCCTTATC
CTCTCGCGTG ATGGCACAAA AGCCTGGGTG ATGAATCGCA CCAGTAATAG TGTGTCAGCG
ATTGATCTGG TGACTTATCA GCATGTGGCG GATATCCCGC TGGAGAAATA CGACGGGGCG
GGGACGAGCG GTAAACCAGG CGCCTGGGTT ATGGCACTTT CCCCGGATGA AAGAACATTA
CTGGTTCCGG GAGCAGGCAG AGGTAACATC GTGCGGATCA ATACCATCAC GCATCAGAAA
GAAGACTTTC CCGCAGGTGA TGCGCGTGGA ACGATATCGG CGATGCGTTT TCGACCTGAA
AACGGCGAGG TTATTTTTGC AGATAGTCAG GGGATTTCAC GTATAAGCGT AGGGGCTCAA
CAAGCCAGCA TTATGACGCA ATGGTGTAGC AGGAGCGTTT ATTCCGTTGA GGGTATTAGC
CCGGACGGTC AATATTTAGC GTTGGTGTCA TATGGCCTGC AAGGTTATGT CATCCTGCTC
AATATTAATG CCGGGCAGAT TATTGGCGTT TATCCTGCCA GCTACGTTAA TCACCTTCGT
TTTTCAGCGG ATGGTAGAAA AATATTTGTC ATGGCGAAGA ACGGGTTGAT CCAAATGGAC
AGGACGCGCT CGCTTGATCC GCAGGCAATT ATTCGTCATC CCCAATATGG CAATGTGGCT
TGTATCCCTG AACCGTAA
 
Protein sequence
MSASSVKPLN VQLPAITLIL FALCIGIFCY LAQWMSYEEV DQSALIHLGA NVAPLTLSGE 
PWRLLSSIFL HSSVSHLLMN MFAFLVVGGV AEQILGKWRL LITWLFSGVF GGLISACYAL
RDSDQIVISV GAFGAIMGIA GAAIATQLAS GAGTYHKNQR RVFSLLGMVA LTLLYGARQT
GIDNACHIGG LIAGGALGWL SARLSGQNRL VTEGGIIVAG SLLLTGAIWL AQQQMDESVL
QVRQSLREAF YPQEIEQERR QKKQQLAEER NALKETLSAP VSREQASGDL LAEIADIHDM
AISRDGNMLY AAIENTNSIV VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLALS
PDEKLIYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA
IDLVTYQHVA DIPLEKYDGA GTSGKPGAWV MALSPDERTL LVPGAGRGNI VRINTITHQK
EDFPAGDARG TISAMRFRPE NGEVIFADSQ GISRISVGAQ QASIMTQWCS RSVYSVEGIS
PDGQYLALVS YGLQGYVILL NINAGQIIGV YPASYVNHLR FSADGRKIFV MAKNGLIQMD
RTRSLDPQAI IRHPQYGNVA CIPEP