Gene SbBS512_E2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2021 
SymbolsppA 
ID6273053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1840081 
End bp1841937 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID641726069 
Productprotease 4 
Protein accessionYP_001880563 
Protein GI187733275 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000124959 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT 
TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTCC TCGTACTGGT TGGTGTGGGG
ATCTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG CGCACTGCTG
CTGGACATTT CCGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA
AGCCGCCAGC TGCTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC
GTCAACACCA TTCGCCAGGC GAAGGACGAC CGCAATATCA CCGGTATCGT GATGGATCTG
AAAAACTTCG CAGGCGGCGA CCAGCCGTCT ATGCAGTACA TCGGCAAAGC CCTGAAAGAG
TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT
TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCACCGC AAGGCGTGGT TGATCTGCAC
GGTTTTGCTA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC
CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGATATG
TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG
AATACTGTTG CCGCTAACCG GCAGATCCCT GCTCAGCAGG TATTCCCTGG CGCGCAAGGG
TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG
CTGGTCGATG CACTGGCATC GAGTGCCGAA ATCGAAAAAG CTCTGACCAA AGAGTTCGGC
TGGAGTAAGA CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG
CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTCGCTA ATGGCGCAAT TATGGATGGC
GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC
CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC
GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCCTGTGGTT
GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC
ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATCGGTA TCTTCGGCGT GATCACCACC
GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG
GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCAGATGAT GCAATTAAGC
ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG
GAGCAAATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGATGC AAAAGCTAAC
GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTTG CCAAAGCAGC AGAGCTCGCA
AAAGTGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG
ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA
CCTGCACCGC TTGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGCGACAA GCTGGCCGCG
TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
 
Protein sequence
MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL 
LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL
KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH
GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG
WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR
LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY
IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS
IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA
KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA
FNDPQNRYAF CLTCANVR