Gene SbBS512_E1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1289 
Symbol 
ID6270518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1178897 
End bp1180402 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content60% 
IMG OID641725410 
Producthead-tail preconnector protein GP5 
Protein accessionYP_001879921 
Protein GI187733565 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACGTA ATCTTTCACA CATTATTGCC GCAGCATTCA ATGAACCGCT GCTTCTGGAG 
CCCGCCTATG CGCGGGTTTT CTTTTGCGCG CTCGGGCGCG AGATGGGCGC AGCAAGTCTT
TCGGTACCAC AACAGCAGGT ACAGCTTGAT GCTCCCGGAA TGCTGGCTGA AACGGACGAG
TACATGGCCG GAGGTAAACG ACCGGCCCGT GTTTACCGGG TGGTGAACGG TATTGCGGTA
CTGCCGGTGA CCGGCACGCT GGTGCACCGG CTGGGGGGGA TGCGGCCATT TTCCGGAATG
ACTGGCTATG ACGGCATTGT CGCCTGTCTT CAGCAGGCAA TGGCAGATAG CCAGGTGCGG
GGCATACTGC TGGACATTGA CAGTCCGGGC GGGCAGGCCG CCGGCGCGTT TGACTGCGCT
GACATGATTT ACCGCCTCCG GCAGCAGAAG CCGGTCTGGG CACTGTGCAA TGACACGGCC
TGTTCTGCAG CCATGCTGCT GGCGTCGGCC TGCTCCCGAC GGCTGGTTAC CCAGACATCC
CGTATCGGCT CCATTGGCGT GATGATGAGC CATGTCAGCT ATGCCGGTCA TCTGGCGCAG
GCCGGTGTGG ATATCACGCT GATTTATGCC GGGGCGCACA AGGTGGATGG CAATCAGTTT
GAAGCGTTGC CGGCAGAGGT TCGCCAGGAT ATGCAGCAGC GGATTGATGC GGCGCACCGG
ATGTTTGCCG AAAAAGTGGC GATGTATACC GGGTTGTCTG TGGATGCGGT CACGGGAACA
GAGGCCGCCG TTTTTGAAGG TCAGTCCGGC ATTGAGGCCG GGCTGGCGGA TGAATTAATC
AATGCGTCGG ATGCCATCAG TGTGATGGCC ACGGCGCTGA ACAGTAATGT CAGAGGAGGC
ACTATGCCGC AATTAACTGC AACGGAAGCC GCCGTGCAGG AGAACCAGCG AGTGATGGGG
ATCCTGACAT GCCAGGAAGC GAAAGGACGT GAACAGCTTG CCACGATGCT GGCAGGGCAA
CAGGGCATGA GCGTTGAACA GGCCCGGGCG ATTCTGGCCG CGGCGGCACC GCAGCAGCCG
GTGGCATCCG CGCAGAGTGA AGCCGATCGC ATTATGGCGT GTGAAGAAGC GAACGGTCGT
GAACAACTGG CAGCAACGCT GGCGGCGATG CCGGAGATGA CGGTGGAAAA AGCCCGCCCG
ATCCTGGCGG CTGCACCACT GGCGAATGCC GGACCATCAC TCCGTGATCA GATCATGGCA
CTGGATGAGG CAAAAGGGGC TGAGGCGCAG GCTGAACAGC TGGCTGCCTG CCCGGGAATG
ACTGTGGAGA GCGCCCGGGC TGTGCTGGCT GCGGGATCAG GTAAGGCAGA ACCGGTCTCT
GCATCCACAA CCGCCCTGTT TGAACATTTC ATGGCGAACC ATTCACCGGC TGCGGTCCAG
GGGGGCGTGT CACAGGCATC AGAAGACGGT GATGCGGACG TGAAAATGCT CATGGCCATG
CCATGA
 
Protein sequence
MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE 
YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVACL QQAMADSQVR
GILLDIDSPG GQAAGAFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS
RIGSIGVMMS HVSYAGHLAQ AGVDITLIYA GAHKVDGNQF EALPAEVRQD MQQRIDAAHR
MFAEKVAMYT GLSVDAVTGT EAAVFEGQSG IEAGLADELI NASDAISVMA TALNSNVRGG
TMPQLTATEA AVQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP
VASAQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAAAPLANA GPSLRDQIMA
LDEAKGAEAQ AEQLAACPGM TVESARAVLA AGSGKAEPVS ASTTALFEHF MANHSPAAVQ
GGVSQASEDG DADVKMLMAM P