Gene SbBS512_E0713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0713 
Symbol 
ID6270253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp669100 
End bp670431 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content51% 
IMG OID641724902 
Productinvasion plasmid antigen 
Protein accessionYP_001879431 
Protein GI187734068 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.29021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAGTCAC TCTCAGCCCT TCTCAATAGC CTGGAGACGC TACCTGATCT TCCCCCGGCT 
CTACAAAAAC TTTCTGTTGG CAACAACCAG CTTACTGCCT TACCAGAATT ACCATGTGAA
CTACAGGAAC TAAGTGCTTT TGATAACAGA TTACAAGAGC TACCGCCCCT TCCTCAAAAT
CTGAGGCTTT TAAACGTTGG GGAAAACCAA CTACACAGAC TGCCCGAACT TCCACAACGT
CTGCAATCAC TATATATCCC TAACAATCAG CTGAACACAT TGCCAGACAG TATCATGAAT
CTGCACATTT ATGCAGATGT TAATATTTAT AACAATCCAT TGTCGACTCG CACTCTGCAA
GCCCTGCAAA GATTAACCTC TTCGCCGGAC TACCACGGCC CACGGATTTA CTTCTCCATG
AGTGACGGAC AACAGAATAC ACTCCATCGC CCCCTGGCTG ATGCCGTGAC AGCATGGTTC
CCGGAAAACA AACAATCTGA TGTATCACAG ATATGGCATG CTTTTGAACA TGAAGAGCAC
GCCAACACCT TTTCCGCGTT CCTTGACCGC CTTTCCGATA CCGTCTCTGC ACGCAATACC
TCCGGATTCC GTGAACAGGT CGCTGCATGG CTGGAAAAAC TCAGTGCCTC TGCGGAGCTT
CGACAGCAGT CTTTCGCTGT TGCTGCTGAT GCCACTGAGA GCTGTGAGGA CCGTGTCGCG
CTCACATGGA ACAATCTCCG GAAAACCCTC CTGGTCCATC AGGCATCAGA AGGCCTTTTC
GATAATGATA CCGGCGCTCT GCTCTCCCTG GGCAGGGAAA TGTTCCGCCT CGAAATTCTG
GAGGACATTG CCCGGGATAA AGTCAGAACT CTCCATTTTG TGGATGAGAT AGAAGTCTAC
CTGGCCTTCC AGACCATGCT CGCAGAGAAA CTTCAGCTCT CCACTGCCGT GAAGGAAATG
CGTTTCTATG GCGTGTCGGG AGTGACAGCA AATGACCTCC GCACTGCCGA AGCCATGGTC
AGAAGCCGTG AAGAGAATGA ATTTACGGAC TGGTTCTCCC TCTGGGGACC ATGGCATGCT
GTACTGAAGC GTACGGAAGC TGACCGCTGG GCGCTGGCAG AAGAGCAGAA ATATGAGATG
CTGGAGAATG AGTACCCTCA GAGGGTGGCT GACCGGCTGA AAGCATCAGG TCTGAGCGGT
GATGCGGATG CGGAGAGGGA AGCCGGTGCA CAGGTGATGC GTGAGACTGA ACAGCAGATT
TACCGTCAGC TGACTGACGA GGTACTGGCC CTGCGATTGT CTGAAAACGG CTCACAACTG
CACCATTCAT AA
 
Protein sequence
MQSLSALLNS LETLPDLPPA LQKLSVGNNQ LTALPELPCE LQELSAFDNR LQELPPLPQN 
LRLLNVGENQ LHRLPELPQR LQSLYIPNNQ LNTLPDSIMN LHIYADVNIY NNPLSTRTLQ
ALQRLTSSPD YHGPRIYFSM SDGQQNTLHR PLADAVTAWF PENKQSDVSQ IWHAFEHEEH
ANTFSAFLDR LSDTVSARNT SGFREQVAAW LEKLSASAEL RQQSFAVAAD ATESCEDRVA
LTWNNLRKTL LVHQASEGLF DNDTGALLSL GREMFRLEIL EDIARDKVRT LHFVDEIEVY
LAFQTMLAEK LQLSTAVKEM RFYGVSGVTA NDLRTAEAMV RSREENEFTD WFSLWGPWHA
VLKRTEADRW ALAEEQKYEM LENEYPQRVA DRLKASGLSG DADAEREAGA QVMRETEQQI
YRQLTDEVLA LRLSENGSQL HHS