Gene SbBS512_E1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1016 
Symbol 
ID6269433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp938734 
End bp940377 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content48% 
IMG OID641725160 
Productinvasion plasmid antigen 
Protein accessionYP_001879682 
Protein GI187734205 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCTG TAAATAATCC CCCCCTATCC ACTGGAAACG TCTCTTTTTA CAGAACTACA 
TCAATCGACA ATGTTCACAA TAATTATCTC TCCGAATGGG TTGAATGGAC TAAAAACAGC
ATTTCCGGAG AAAACAGGGA AACTGCTTTT ACCCGGCTCC AATTATGTCT GGAGAACAGT
GAAACATCGT TGGACTTATC TTGTTTAGGT CTCAGATCTC TACCACGATT GCCTGACAAT
CTTGATGAAA TTAATGTAAG CAATAACCAA CTATCAATGC TCCCCGAGCT ACCAAGGGCA
TTGAAAGAGC TGAATGCAAG CAGTAATCAA TTATCTGCAC TTCCTGAATT ACCAGTGTCG
CTGGAATATA TAAATGTGAG TGATAACCAT TTGTTCGCAC TTCCTGAATT ACCTGCGTCA
CTAGAATATA TTAATGTAAG TGACAATCAC CTGTCTGTAC TTCCGAGGTT ACCAATGTCA
TTGGAATTAC TTGATGCAGC CAGAAATGCT TTGGAAGTAA TACCAGATTT TCCAGAAAGA
GATGATCATA TTATAAGAAT ATTCTGGCTT AATCAGAACC GGATCACGGC AATTCCGGAA
AGCATACTTG GCCTCAGTTC TGATAGCGTT GTCAATCTTA GAGAAAATCA ACTATCTCCC
AGAATAATGC AAACTTTGTT ACAACAAACC GCCCAACCGG ACTACCACGG CCCACGGATT
TACTTCTCCA TGAGTGACGG ACAACAGAAT ACACTCCATC GCCCCCTGGC TGATGCCGTG
ACAGCATGGT TCCCGGAAAA CAAACAATCT GATGTATCAC AGATATGGCA TGCTTTTGAA
CATGAAGAGC ACGCCAACAC CTTTTCCGCG TTCCTTGACC GCCTTTCCGA TACCGTCTCT
GCACGCAATA CCTCCGGATT CCGTGAACAG GTCGCTGCAT GGCTGGAAAA ACTCAGTGCC
TCTGCGGAGC TTCGACAGCA GTCTTTCGCT GTTGCTGCTG ATGCCACTGA GAGCTGTGAG
GACCGTGTCG CGCTCACATG GAACAATCTC CGGAAAACCC TCCTGGTCCA TCAGGCATCA
GAAGGCCTTT TCGATAATGA TACCGGCGCT CTGCTCTCCC TGGGCAGGGA AATGTTCCGC
CTCGAAATTC TGGAGGACAT TGCCCGGGAT AAAGTCAGAA CTCTCCATTT TGTGGACGAG
ATAGAAGTCT ACCTGGCCTT CCAGACCATG CTCGCAGAGA AACTTCAGCT CTCCACTGCC
GTGAAGGAAA TGCGTTTCTA TGGCGTGTCG GGAGTGACAG CAAATGACCT CCGCACTGCC
GAAGCCATGG TCAGAAGCCG TGAAGAGAAT GAATTTACGG ACTGGTTCTC CCTCTGGGGA
CCATGGCATG CTGTACTGAA GCGTACGGAA GCTGACCGCT GGGCGCTGGC AGAAGAGCAG
AAATATGAGA TGCTGGAGAA TGAGTACCCT CAGAGGGTGG CTGACCGGCT GAAAGCATCA
GGTCTGAGCG GTGATGCGGA TGCGGAGAGG GAAGCCGGTG CACAGGTGAT GCGTGAGACT
GAACAGCAGA TTTACCGTCA GCTGACTGAC GAGGTACTGG CCCTGCGATT GTCTGAAAAC
GGCTCACAAC TGCACCATTC ATAA
 
Protein sequence
MLPVNNPPLS TGNVSFYRTT SIDNVHNNYL SEWVEWTKNS ISGENRETAF TRLQLCLENS 
ETSLDLSCLG LRSLPRLPDN LDEINVSNNQ LSMLPELPRA LKELNASSNQ LSALPELPVS
LEYINVSDNH LFALPELPAS LEYINVSDNH LSVLPRLPMS LELLDAARNA LEVIPDFPER
DDHIIRIFWL NQNRITAIPE SILGLSSDSV VNLRENQLSP RIMQTLLQQT AQPDYHGPRI
YFSMSDGQQN TLHRPLADAV TAWFPENKQS DVSQIWHAFE HEEHANTFSA FLDRLSDTVS
ARNTSGFREQ VAAWLEKLSA SAELRQQSFA VAADATESCE DRVALTWNNL RKTLLVHQAS
EGLFDNDTGA LLSLGREMFR LEILEDIARD KVRTLHFVDE IEVYLAFQTM LAEKLQLSTA
VKEMRFYGVS GVTANDLRTA EAMVRSREEN EFTDWFSLWG PWHAVLKRTE ADRWALAEEQ
KYEMLENEYP QRVADRLKAS GLSGDADAER EAGAQVMRET EQQIYRQLTD EVLALRLSEN
GSQLHHS