Gene SbBS512_E1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1692 
Symbol 
ID6268639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1540884 
End bp1542635 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content47% 
IMG OID641725773 
Productinvasion plasmid antigen 
Protein accessionYP_001880271 
Protein GI187732810 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.804686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCCGA CAAATAACAA TCACAGATTA ATTTCAAATT CGTTCTCCAC TTATTCAATC 
GACACTAGCC GCGCATATGA AAGTTATCTA ACCCATTGGA CTGAATGGAA AAATAACCGC
ATACAAGAAG AACAACGAGA CATCGCTTTT CAGCGACTAG TATCATGTCT ACAAAACCAA
GAGACGAACC TAGACTTGTC TGAATTAGGC CTGACAACAT TACCTGAAAT CCCCCCGGGA
ATTAAATCAA TTAATATAAG TAAAAATAAT TTAAGCTTAA TCTCCCCATT GCCTGCGTCC
CTTACACAGC TTAATGTCAG CTATAACAGA CTTATTGAAC TGCCTGCTTT GCCTCAAGGA
CTTAAATTAT TGAATGCGTC CCACAATCAA CTAATCACAC TACCCACACT CCCCATATCT
TTGAAGGAGC TTCATGTCTC AAATAATCAA TTATGTTCTC TTCCTGTTTT ACCAGAACTA
CTGGAAACAT TAGATGTATC ATGTAATGGG CTGGCAGTTT TACCACCTTT ACCATTTTCT
TTACAAGAGA TTAGCGCAAT AGGGAATCTT CTTAGTGAAC TCCCCCCTCT ACCTCACAAC
ATTCACTCCA TATGGGCAAT CGACAATATG TTAACCGATA TTCCATACCT GCCGGAAAAT
TTAAGGAACG GTTATTTTGA CATAAATCAG ATAAGTCATA TCCCGGAAAG CATTCTTAAT
CTGAGGAATG AATGTTCAAT AGATATTAGT GATAACCCAT TGTCATCCCA TGCTCTGCAA
TCCCTGCAAA GATTAACCTC TTCGCCGGAC TACCACGGCC CGCAGATTTA CTTCTCCATG
AGTGACGGAC AACAGAATAC ACTCCATCGC CCCCTGGCTG ATGCCGTGAC AGCATGGTTC
CCGGAAAACA AACAATCTGA TGTATCACAG ATATGGCATG CTTTTGAACA TGAAGAGCAC
GCCAACACCT TTTCCGCGTT CCTTGACCGC CTTTCCGATA CCGTCTCTGC ACGCAATACC
TCCGGATTCC GTGAACAGGT CGCTGCATGG CTGGAAAAAC TCAGTGCCTC TGCGGAGCTT
CGACAGCAGT CTTTCGCTGT TGCTGCTGAT GCCACTGAGA GCTGTGAGGA CCGTGTCGCG
CTCACATGGA ACAATCTCCG GAAAACCCTC CTGGTCCATC AGGCATCAGA AGGCCTTTTC
GATAATGATA CCGGCGCTCT GCTCTCCCTG GGCAGGGAAA TGTTCCGCCT CGAAATTCTG
GAGGACATTG CCCGGGATAA AGTCAGAACT CTCCATTTTG TGGATGAGAT AGAAGTCTAC
CTGGCCTTCC AGACCATGCT CGCAGAGAAA CTTCAGCTCT CCACTGCCGT GAAGGAAATG
CGTTTCTATG GCGTGTCGGG AGTGACAGCA AATGACCTCC GCACTGCCGA AGCCATGGTC
AGAAGCCGTG AAGAGAATGA ATTTACGGAC TGGTTCTCCC TCTGGGGACC ATGGCATGCT
GTACTGAAGC GTACGGAAGC TGACCGCTGG GCGCTGGCAG AAGAGCAGAA ATATGAGATG
CTGGAGAATG AGTACCCTCA GAGGGTGGCT GACCGGCTGA AAGCATCAGG TCTGAGCGGT
GATGCGGATG CGGAGAGGGA AGCCGGTGCA CAGGTGATGC GTGAGACTGA ACAGCAGATT
TACCGTCAGC TGACTGACGA GGTACTGGCC CTGCGATTGT CTGAAAACGG CTCACAACTG
CACCATTCAT AA
 
Protein sequence
MLPTNNNHRL ISNSFSTYSI DTSRAYESYL THWTEWKNNR IQEEQRDIAF QRLVSCLQNQ 
ETNLDLSELG LTTLPEIPPG IKSINISKNN LSLISPLPAS LTQLNVSYNR LIELPALPQG
LKLLNASHNQ LITLPTLPIS LKELHVSNNQ LCSLPVLPEL LETLDVSCNG LAVLPPLPFS
LQEISAIGNL LSELPPLPHN IHSIWAIDNM LTDIPYLPEN LRNGYFDINQ ISHIPESILN
LRNECSIDIS DNPLSSHALQ SLQRLTSSPD YHGPQIYFSM SDGQQNTLHR PLADAVTAWF
PENKQSDVSQ IWHAFEHEEH ANTFSAFLDR LSDTVSARNT SGFREQVAAW LEKLSASAEL
RQQSFAVAAD ATESCEDRVA LTWNNLRKTL LVHQASEGLF DNDTGALLSL GREMFRLEIL
EDIARDKVRT LHFVDEIEVY LAFQTMLAEK LQLSTAVKEM RFYGVSGVTA NDLRTAEAMV
RSREENEFTD WFSLWGPWHA VLKRTEADRW ALAEEQKYEM LENEYPQRVA DRLKASGLSG
DADAEREAGA QVMRETEQQI YRQLTDEVLA LRLSENGSQL HHS