Gene SbBS512_A0132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_A0132 
Symbol 
ID6273548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010660 
Strand
Start bp84669 
End bp86393 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content47% 
IMG OID641728775 
Productinvasion plasmid antigen 
Protein accessionYP_001883166 
Protein GI187734297 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones97 
Plasmid unclonability p-value0.102369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGA TCAACAATCA TTCTTTTTTT CGTTCCCTTT GTGGCTTATC ATGTATATCT 
CGTTTATCGG TAGAAGAACA GTGTACCAGA GATTACCACC GCATCTGGGA TGACTGGGCT
AGGGAAGGAA CAACAACAGA AAATCGCATC CAGGCGGTTC GATTATTGAA AATATGTCTG
GATACCCGGG AGCCTGTTCT CAATTTAAGC TTACTGAAAC TACGTTCTTT ACCACCACTC
CCTTTGCATA TACGTGAACT TAATATTTCC AACAATGAGT TAATCTCCCT ACCTGAAAAT
TCTCCGCTTT TGACAGAACT TCATGTAAAT GGTAACAACT TGAATATACT CCCGACACTT
CCATCTCAAC TGATTAAGCT TAATATTTCA TTCAATCGAA ATTTGTCATG TCTGCCATCA
TTACCACCAT ATTTACAATC ACTCTCGGCA CGTTTTAATA GTCTGGAGAC GTTACCAGAG
CTTCCATCAA CGCTAACAAT ATTACGTATT GAAGGTAATC GCCTTACTGT CTTGCCTGAA
TTGCCTCATA GACTACAAGA ACTCTTTGTT TCCGGCAACA GACTACAGGA ACTACCAGAA
TTTCCTCAGC GCTTAAAATA TTTGAAGGTA GGTGAAAATC AACTACGCAG ATTATCCAGA
TTACCGCAAG AACTATTGAC ACTGGATGTT TCCAATAACC TACTAACTTC ATTACCCGAA
AATATAATCA CATTGCCCAT TTGTACGAAT GTTAACATTT CAGGGAATCC ATTGTCGACT
CGCGTTCTGC AATCCCTGCA AAGATTAACC TCTTCGCCGG ACTACCACGG CCCGCAGATT
TACTTCTCCA TGAGTGACGG ACAACAGAAT ACACTCCATC GCCCCCTGGC TGATGCCGTG
ACAGCATGGT TCCCGGAAAA CAAACAATCT GATGTATCAC AGATATGGCA TGCTTTTGAA
CATGAAGAGC ACGCCAACAC CTTTTCCGCG TTCCTTGACC GCCTTTCCGA TACCGTCTCT
GCACGCAATA CCTCCGGATT CCGTGAACAG GTCGCTGCAT GGCTGGAAAA ACTCAGTGCC
TCTGCGGAGC TTCGACAGCA GTCTTTCGCT GTTGCTGCTG ATGCCACTGA GAGCTGTGAG
GACCGTGTCG CGCTCACATG GAACAATCTC CGGAAAACCC TCCTGGTCCA TCAGGCATCA
GAAGGCCTTT TCGATAATGA TACCGGCGCT CTGCTCTCCC TGGGCAGGGA AATGTTCCGC
CTCGAAATTC TGGAGGACAT TGCCCGGGAT AAAGTCAGAA CTCTCCATTT TGTGGATGAG
ATAGAAGTCT ACCTGGCCTT CCAGACCATG CTCGCAGAGA AACTTCAGCT CTCCACTGCC
GTGAAGGAAA TGCGTTTCTA TGGCGTGTCG GGAGTGACAG CAAATGACCT CCGCACTGCC
GAAGCCATGG TCAGAAGCCG TGAAGAGAAT GAATTTAAGG ACTGGTTCTC CCTCTGGGGA
CCATGGCATG CTGTACTGAA GCGTACGGAA GCTGACCGCT GGGCGCAGGC AGAAGAGCAG
AAGTATGAGA TGCTGGAGAA TGAGTACTCT CAGAGGGTGG CTGACCGGCT GAAAGCATCA
GGTCTGAGCG GTGATACGGA TGCGGAGAGG GAAGCCGGTG CACAGGTGAT GCGTGAGACT
GAACAGCAGA TTTACCGTCA GTTGACTGAC GAGGTACTGG CCTGA
 
Protein sequence
MKPINNHSFF RSLCGLSCIS RLSVEEQCTR DYHRIWDDWA REGTTTENRI QAVRLLKICL 
DTREPVLNLS LLKLRSLPPL PLHIRELNIS NNELISLPEN SPLLTELHVN GNNLNILPTL
PSQLIKLNIS FNRNLSCLPS LPPYLQSLSA RFNSLETLPE LPSTLTILRI EGNRLTVLPE
LPHRLQELFV SGNRLQELPE FPQRLKYLKV GENQLRRLSR LPQELLTLDV SNNLLTSLPE
NIITLPICTN VNISGNPLST RVLQSLQRLT SSPDYHGPQI YFSMSDGQQN TLHRPLADAV
TAWFPENKQS DVSQIWHAFE HEEHANTFSA FLDRLSDTVS ARNTSGFREQ VAAWLEKLSA
SAELRQQSFA VAADATESCE DRVALTWNNL RKTLLVHQAS EGLFDNDTGA LLSLGREMFR
LEILEDIARD KVRTLHFVDE IEVYLAFQTM LAEKLQLSTA VKEMRFYGVS GVTANDLRTA
EAMVRSREEN EFKDWFSLWG PWHAVLKRTE ADRWAQAEEQ KYEMLENEYS QRVADRLKAS
GLSGDTDAER EAGAQVMRET EQQIYRQLTD EVLA