Gene SbBS512_A0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_A0253 
Symbol 
ID6273599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010660 
Strand
Start bp170105 
End bp171742 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content48% 
IMG OID641728875 
Productinvasion plasmid antigen 
Protein accessionYP_001883266 
Protein GI187734472 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones93 
Plasmid unclonability p-value0.0402071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCGA TAAATAATAA CTTTTCATTG CCCCAAAATT CTTTTTATAA CACTATTTCC 
GGTACATATG CTGATTACTT TTCAGCATGG GATAAATGGG AAAAACAAGC GCTCCCCGGT
GAAGAGCGTG ATGAGGCTGT CTCCCGACTT AAAGAATGTC TTATCAATAA TTCCGATGAA
CTTCGACTGG ACCGTTTAAA TCTGTCCTCG CTACCTGACA ACTTACCAGC TCAGATAACG
CTGCTCAATG TATCATATAA TCAATTAACT AACCTACCTG AACTGCCTGT TACGCTAAAA
AAATTATATT CCGCCAGCAA TAAATTATCA GAATTGCCCG TGCTACCTCC TGCGCTGGAG
TCACTTCAGG TACAACACAA TGAGCTGGAA AACCTGCCAG CTTTACCCGA TTCGTTATTG
ACTATGAATA TCAGCTATAA CGAAATAGTC TCCTTACCAT CGCTCCCACA GGCTCTTAAA
AATCTCAGAG CGACCCGTAA TTTCCTCACT GAGCTACCAG CATTTTCTGA GGGAAATAAT
CCCGTTGTCA GAGAGTATTT TTTTGATAGA AATCAGATAA GTCATATCCC GGAAAGCATT
CTTAATCTGA GGAATGAATG TTCAATACAT ATTAGTGATA ACCCATTATC ATCCCATGCT
CTGCAAGCCC TGCAAAGATT AACCTCTTCG CCGGACTACC ACGGCCCACG GATTTACTTC
TCCATGAGTG ACGGACAACA GAATACACTC CATCGCCCCC TGGCTGATGC CGTGACAGCA
TGGTTCCCGG AAAACAAACA ATCTGATGTA TCACAGATAT GGCATGCTTT TGAACATGAA
GAGCATGCCA ACACCTTTTC CGCGTTCCTT GACCGCCTTT CCGATACCGT CTCTGCACGC
AATACCTCCG GATTCCGTGA ACAGGTCGCT GCATGGCTGG AAAAACTCAG TGCCTCTGCG
GAGCTTCGAC AGCAGTCTTT CGCTGTTGCT GCTGATGCCA CTGAGAGCTG TGAGGACCGT
GTCGCGCTCA CATGGAACAA TCTCCGGAAA ACCCTCCTGG TCCATCAGGC ATCAGAAGGC
CTTTTCGATA ATGATACCGG CGCTCTGCTC TCCCTGGGCA GGGAAATGTT CCGCCTCGAA
ATTCTGGAGG ACATTGCCCG GGATAAAGTC AGAACTCTCC ATTTTGTGGA TGAGATAGAA
GTCTACCTGG CCTTCCAGAC CATGCTCGCA GAGAAACTTC AGCTCTCCAC TGCCGTGAAG
GAAATGCGTT TCTATGGCGT GTCGGGAGTG ACAGCAAATG ACCTCCGCAC TGCCGAAGCC
ATGGTCAGAA GCCGTGAAGA GAATGAATTT AAGGACTGGT TCTCCCTCTG GGGACCATGG
CATGCTGTAC TGAAGCGTAC GGAAGCTGAC CGCTGGGCGC AGGCAGAAGA GCAGAAATAT
GAGATGCTGG AGAATGAGTA CCCTCAGAGG GTGGCTGACC GGCTGAAAGC ATCAGGTCTG
AGCGGTGATG CGGATGCGGA GAGGGAAGCC GGTGCACAGG TGATGCGTGA GACTGAACAG
CTGATTTACC GTCAGCTGAC TGACGAGGTA CTGGCCCTGC GATTGTCTGA AAACGGCTCA
CAACTGCACC ATTCATAA
 
Protein sequence
MLPINNNFSL PQNSFYNTIS GTYADYFSAW DKWEKQALPG EERDEAVSRL KECLINNSDE 
LRLDRLNLSS LPDNLPAQIT LLNVSYNQLT NLPELPVTLK KLYSASNKLS ELPVLPPALE
SLQVQHNELE NLPALPDSLL TMNISYNEIV SLPSLPQALK NLRATRNFLT ELPAFSEGNN
PVVREYFFDR NQISHIPESI LNLRNECSIH ISDNPLSSHA LQALQRLTSS PDYHGPRIYF
SMSDGQQNTL HRPLADAVTA WFPENKQSDV SQIWHAFEHE EHANTFSAFL DRLSDTVSAR
NTSGFREQVA AWLEKLSASA ELRQQSFAVA ADATESCEDR VALTWNNLRK TLLVHQASEG
LFDNDTGALL SLGREMFRLE ILEDIARDKV RTLHFVDEIE VYLAFQTMLA EKLQLSTAVK
EMRFYGVSGV TANDLRTAEA MVRSREENEF KDWFSLWGPW HAVLKRTEAD RWAQAEEQKY
EMLENEYPQR VADRLKASGL SGDADAEREA GAQVMRETEQ LIYRQLTDEV LALRLSENGS
QLHHS