Gene SbBS512_E2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2110 
Symbol 
ID6272320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1918645 
End bp1920351 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content49% 
IMG OID641726145 
Productinvasion plasmid antigen 
Protein accessionYP_001880639 
Protein GI187731730 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCGA TAAATAATAA CTTTTCATTG TCCCAAAATT CTTTTTATAA CACTATTTCC 
GGTACATATG CTGATTACTT TTCAGCATGG GATAAATGGG AAAAACAAGC GCTCCCCGGT
GAAAATCGGA ATGAAGCGGT CTCCCTACTT AAAGAATGTC TCATCAATCA GTTCAGTGAG
CTTCAACTGA ATCGTTTAAA TCTGTCCTCG CTACCTGACA ACTTACCACC TCAAATCACT
GTTCTGGAAA TTACTCAGAA TGCCCTAATA TCATTACCAG AATTGCCAGC ATCGCTGGAA
TACCTTGACG CCTGTGACAA TCACCTGTCA ACACTTCCTG AATTACCCGC ATCTCTGAAA
CATCTTGATG TAGATAACAA CCAACTAACC ATGCTTCCTG AATTGCCTGC ATTGCTGGAA
TATATTAATG CAGATAACAA TCAGCTAACC ATGCTTCCTG AATTACCTAC ATCGCTGGAA
GTGCTCTCAG TAAGAAATAA CCAGCTGACA TTTCTTCCTG AGTTACCTGA ATCACTGGAA
GCGCTCGATG TAAGTACTAA TCTTCTGGAA AGCCTACCAG CCGTACCTGT AAGAAATCAT
CACTCAGAGG AAACCGAGAT ATTTTTCCGG TGCCGCGAGA ATCGCATCAC ACACATTCCG
GAAAATATAC TTAGCCTTGA TCCGACCTGC ACTATCATCC TCGAAGACAA TCCTCTGTCC
TCACGGATCA GGGAGTCTCT GTCGCAACAA ACCGCCCAAC CGGACTACCA CGGCCCACGG
ATTTACTTCT CCATGAGTGA CGGACAACAG AATACACTCC ATCGCCCCCT GGCTGATGCC
GTGACAGCAT GGTTCCCGGA AAACAAACAA TCTGATGTAT CACAGATATG GCATGCTTTT
GAACATGAAG AGCACGCCAA CACCTTTTCC GCGTTCCTTG ACCGCCTTTC CGATACCGTC
TCTGCACGCA ATACCTCCGG ATTCCGTGAA CAGGTCGCTG CATGGCTGGA AAAACTCAGT
GCCTCTGCGG AGCTTCGACA GCAGTCTTTC GCTGTTGCTG CTGATGCCAC TGAGAGCTGT
GAGGACCGTG TCGCGCTCAC ATGGAACAAT CTCCGGAAAA CCCTCCTGGT CCATCAGGCA
TCAGAAGGCC TTTTCGATAA TGATACCGGC GCTCTGCTCT CCCTGGGCAG GGAAATGTTC
CGCCTCGAAA TTCTGGAGGA CATTGCCCGG GATAAAGTCA GAACTCTCCA TTTTGTGGAT
GAGATAGAAG TCTACCTGGC CTTCCAGACC ATGCTCGCAG AGAAACTTCA GCTCTCCACT
GCCGTGAAGG AAATGCGTTT CTATGGCGTG TCGGGAGTGA CAGCAAATGA CCTCCGCACT
GCCGAAGCCA TGGTCAGAAG CCGTGAAGAG AATGAATTTA CGGACTGGTT CTCCCTCTGG
GGACCATGGC ATGCTGTACT GAAGCGTACG GAAGCTGACC GCTGGGCGCT GGCAGAAGAG
CAGAAATATG AGATGCTGGA GAATGAGTAC CCTCAGAGGG TGGCTGACCG GCTGAAAGCA
TCAGGTCTGA GCGGTGATGC GGATGCGGAG AGGGAAGCCG GTGCACAGGT GATGCGTGAG
ACTGAACAGC AGATTTACCG TCAGCTGACT GACGAGGTAC TGGCCCTGCG ATTGTCTGAA
AACGGCTCAC AACTGCACCA TTCATAA
 
Protein sequence
MLPINNNFSL SQNSFYNTIS GTYADYFSAW DKWEKQALPG ENRNEAVSLL KECLINQFSE 
LQLNRLNLSS LPDNLPPQIT VLEITQNALI SLPELPASLE YLDACDNHLS TLPELPASLK
HLDVDNNQLT MLPELPALLE YINADNNQLT MLPELPTSLE VLSVRNNQLT FLPELPESLE
ALDVSTNLLE SLPAVPVRNH HSEETEIFFR CRENRITHIP ENILSLDPTC TIILEDNPLS
SRIRESLSQQ TAQPDYHGPR IYFSMSDGQQ NTLHRPLADA VTAWFPENKQ SDVSQIWHAF
EHEEHANTFS AFLDRLSDTV SARNTSGFRE QVAAWLEKLS ASAELRQQSF AVAADATESC
EDRVALTWNN LRKTLLVHQA SEGLFDNDTG ALLSLGREMF RLEILEDIAR DKVRTLHFVD
EIEVYLAFQT MLAEKLQLST AVKEMRFYGV SGVTANDLRT AEAMVRSREE NEFTDWFSLW
GPWHAVLKRT EADRWALAEE QKYEMLENEY PQRVADRLKA SGLSGDADAE REAGAQVMRE
TEQQIYRQLT DEVLALRLSE NGSQLHHS