Gene SbBS512_E1452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1452 
Symbol 
ID6270669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1325232 
End bp1326299 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content41% 
IMG OID641725553 
Producthypothetical protein 
Protein accessionYP_001880059 
Protein GI187732391 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAC ATCAATATTA TCCACAGCTG AAATGGAAGC CTGCTGAATA TGAATCTCTG 
ATGCTTTTAG ATCAAACTAC GCTCTCTGGT TTTACTCCGA TCATTACCAT TCCAGACATA
GACTGGGATT ATGAAAACGA ATGCTACAAG AAGAATTTGA GTTCTTACTT ATCTGACTTC
GGTATTAACC TTGCGGCATC CTGGAAAGCC AATCGTCCTG TTTTGCTGGA TGTTAAATAT
TTAGATAAAC ATGGTTCGAG CCGCCATCAT CCTCTAGATA TGTGTATCCA AGATGCTAGA
GTAAATGGTA AGGAAATTAT CCCTGTTGTT TCTCCCGCAT ATTCAACAAA CTATATACAT
GCTGTTCAAC GCAACTTAAT CAATGGGCTC GCTATATCTA TCACCCCCCA GACATGGCAC
CAATTCACAA GTCTGGTTAA CCACTTAAAT ATTCATCCTA GTTTAATTGA TGTAATCATT
GATTTTGGAG ATATTCAAAA CGCAACTGAT AGTTTAAAAC AACAAGCATT AAGCATGGTC
AACACATTAT CAGGCCAAGC TCCGTGGAGA AACTTGATTT TATCTTCAAC CGCATACCCG
GCATCACAGG CAGGGATACC GCAACATCAA GTTCATCATA TTCCGCGCCA TGAATACGAT
CTTTGGATGT ACGTAGTACA GAATTTTAGC AATGGAAGAA CGCCAAGTTT TAGTGATTAT
CCCACCGCTA GCTCTACCAT TACGAGCGTA GACCCACGCT TCATGTCTCA GTATGTCTCA
GTGAGATATT CGAACGATAC CTCATGGATC TTTGTAAAAG GTACCGCAGT TAAAGGAAAT
GGATGGGGCC AAACTAAAAA CTTATGTACT ACCCTTGTTA GTTCGCCAGA GTATCAAGTC
TTTGGCTCCA AATTTAGTTG GGGGGATGAT TACATTTACC AAAGATCATT AGGCGCTAAC
AAATCTGGCG GCTCTAAAGA ATGGCGTAAA GTTGCACATA CGCACCATAT TACGTTAGTC
GTGAGACAGC TTTATTGGTT GGCGCAGACT CAGCCTGCCA AGCCTTAA
 
Protein sequence
MSQHQYYPQL KWKPAEYESL MLLDQTTLSG FTPIITIPDI DWDYENECYK KNLSSYLSDF 
GINLAASWKA NRPVLLDVKY LDKHGSSRHH PLDMCIQDAR VNGKEIIPVV SPAYSTNYIH
AVQRNLINGL AISITPQTWH QFTSLVNHLN IHPSLIDVII DFGDIQNATD SLKQQALSMV
NTLSGQAPWR NLILSSTAYP ASQAGIPQHQ VHHIPRHEYD LWMYVVQNFS NGRTPSFSDY
PTASSTITSV DPRFMSQYVS VRYSNDTSWI FVKGTAVKGN GWGQTKNLCT TLVSSPEYQV
FGSKFSWGDD YIYQRSLGAN KSGGSKEWRK VAHTHHITLV VRQLYWLAQT QPAKP