Gene SbBS512_E0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0894 
Symbol 
ID6270917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp833877 
End bp835382 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content60% 
IMG OID641725056 
Producthead-tail preconnector protein GP5 
Protein accessionYP_001879583 
Protein GI187733678 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACGTA ATCTTTCACA CATTATTGCA GCAGCATTCA ATGAACCGCT GCTTCTGGAG 
CCCGCCTATG CGCGGGTTTT CTTTTGCGCG CTGGGGCGCG AGATGGGGGC AGCAAGTCTT
TCGGTACCGC AACAGCAGGT ACAGCTTGAT GCTCCCGGGA TGCTGGCTGA AACGGACGAG
TACATGGCCG GAGGTAAACG ACCGGCCCGT GTTTACCGGG TGGTGAACGG TATTGCGGTA
CTGCCGGTGA CCGGCACGCT GGTGCACCGG CTGGGGGGGA TGCGGCCATT TTCCGGAATG
ACTGGCTATG ACGGCATTGT CACCTGTCTT CAGCAGGCAA TGGCAGATAG CCAGGTGCGG
GGCATACTGC TGGACATTGA CAGTCCGGGC GGGCAGGCCG CCGGCGTGTT TGACTGCGCT
GACATGATTT ACCGCCTCCG GCAGCAGAAG CCGGTCTGGG CACTGTGTAA TGACACGGCC
TGTTCTGCGG CCATGCTGCT GGCGTCGGCC TGCTCCCGAC GGCTGGTTAC CCAGACATCC
CTTATCGGTT CCATTGGCGT GATGATGAGC CATGTCAGCT ATGCCGGTCA TCTGGCGCAG
GCCGGTGTGG ATATCACGCT GATTTACTCA GGGGCGCACA AGGTGGATGG CAATCAGTTT
GAAGCGTTGC CGGCAGAGGT TCGCCAGGAC ATGCAGCAGC GGATTGATGC GGCGCGCCGG
ATGTTTGCCG AAAAAGTGGC GATGTTTACC GGTCTGTCTG TTGATGCAGT CACGGGAACA
GAGGCCGCTG TTTTTGAAGG TCAGTCCGGC ATTGAGGCCG GGCTGGCGGA TGAATTAATC
AATGCGTCGG ATGCCATCAG TGTGATGGCC ACGGCGCTGA ACAGTAATGT CAGAGGAGGC
ACTATGCCGC AATTAACTGC AACGGAAGCC GCCGTGCAGG AGAACCAGCG AGTGATGGGG
ATCCTGACAT GCCAGGAAGC GAAAGGACGT GAACAGCTTG CCACGATGCT GGCAGGGCAA
CAGGGCATGA GCGTTGAACA GGCCCGGGCG ATTCTGGCCG CGGCGGCACC GCAGCAGCCG
GTGGCATCCG CGCAGAGTGA AGCCGATCGC ATTATGGCGT GTGAAGAAGC GAACGGTCGT
GAACAACTGG CGGCAACGCT GGCGGCGATG CCGGAGATGA CGGTGGAAAA AGCCCGCCCG
ATCCTGGCGG CTGCACCACT GGCGGATGCC GGGCCCTCAC TTCGTGATCA GATCATGGCC
CTGGATGAGG CAAAAGGGGC AGAAGCGCAG GCTGAAAAAC TGGCGACCTG CCCGGGAATG
ACCGTGGAGA ACGCCCGGGC TGTGCTGGCT GCGGGATCAG GTAAGGCCGA ACCGGTCTCT
GCATCCACAA CCGCCCTGTT TGAACATTTC ATGGCGAATC ATTCACCGGC AGCGGTGCGG
GGTGGCGTGT CACAGACGTC AGCAGACGGT GATGCGGACG TGAAAATGCT CATGGCCATG
CCATGA
 
Protein sequence
MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE 
YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVTCL QQAMADSQVR
GILLDIDSPG GQAAGVFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS
LIGSIGVMMS HVSYAGHLAQ AGVDITLIYS GAHKVDGNQF EALPAEVRQD MQQRIDAARR
MFAEKVAMFT GLSVDAVTGT EAAVFEGQSG IEAGLADELI NASDAISVMA TALNSNVRGG
TMPQLTATEA AVQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP
VASAQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAAAPLADA GPSLRDQIMA
LDEAKGAEAQ AEKLATCPGM TVENARAVLA AGSGKAEPVS ASTTALFEHF MANHSPAAVR
GGVSQTSADG DADVKMLMAM P