Gene SbBS512_E0362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0362 
Symbollon 
ID6270728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp352938 
End bp355292 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content52% 
IMG OID641724600 
ProductDNA-binding ATP-dependent protease La 
Protein accessionYP_001879150 
Protein GI187730322 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000116404 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCTG AGCGTTCTGA ACGCATTGAA ATCCCCGTAT TGCCGCTGCG CGATGTGGTG 
GTTTATCCGC ACATGGTCAT CCCCTTATTT GTCGGGCGGG AAAAATCTAT CCGTTGTCTG
GAAGCGGCGA TGGACCATGA TAAAAAAATT ATGCTGGTCG CGCAGAAAGA AGCTTCAACG
GATGAGCCGG GTGTAAACGA TCTTTTCACC GTCGGGACCG TGGCCTCTAT ATTGCAGATG
CTGAAACTGC CTGACGGCAC CGTCAAAGTG CTGGTCGAGG GGTTACAGCG CGCGCGTATT
TCTGCGCTCT CTGACAATGG CGAACACTTT TCTGCGAAGG CGGAGTATCT GGAGTCGCCG
ACCATTGATG AGCGGGAACA GGAAGTGCTG GTGCGTACTG CAATCAGCCA GTTCGAAGGC
TACATCAAGC TGAACAAAAA AATCCCACCA GAAGTGCTGA CGTCGCTGAA TAGCATCGAC
GATCCGGCGC GTCTGGCGGA TACCATTGCT GCACATATGC CACTGAAACT GGCTGACAAA
CAGTCCGTTC TGGAGATGTC CGACGTTAAC GAACGTCTGG AATATCTGAT GGCAATGATG
GAATCGGAAA TCGATCTGCT GCAGGTTGAG AAACGCATTC GCAACCGCGT TAAAAAGCAG
ATGGAGAAAT CCCAGCGTGA GTACTATCTG AACGAGCAAA TGAAAGCTAT TCAGAAAGAA
CTCGGTGAAA TGGACGACGC GCCGGACGAA AACGAAGCCC TGAAGCGCAA AATCGACGCG
GCGAAGATGC CGAAAGAGGC AAAAGAGAAA GCGGAAGCAG AGTTGCAGAA GCTGAAAATG
ATGTCTCCGA TGTCGGCAGA AGCGACCGTA GTGCGTGGTT ATATCGACTG GATGGTACAG
GTACCGTGGA ATGCGCGCAG CAAGGTCAAA AAAGACCTGC GTCAGGCGCA GGAAATCCTT
GATACCGACC ATTATGGTCT GGAGCGCGTG AAAGATCGCA TCCTTGAGTA CCTTGCGGTT
CAAAGCCGTG TCAACAAAAT CAAGGGACCG ATCCTCTGCC TGGTAGGGCC GCCGGGGGTA
GGTAAAACCT CTCTTGGTCA GTCCATTGCC AAAGCCACCG GGCGTAAATA TGTCCGTATG
GCGCTGGGCG GCGTGCGTGA TGAAGCGGAA ATCCGTGGTC ACCGCCGTAC TTACATCGGT
TCTATGCCGG GTAAACTGAT CCAGAAAATG GCGAAAGTGG GCGTGAAAAA CCCGCTGTTC
CTGCTCGATG AGATCGACAA AATGTCTTCT GACATGCGAG GCGATCCGGC CTCTGCACTG
CTTGAAGTGC TGGATCCAGA GCAGAACGTA GCGTTCAGCG ACCACTACCT GGAAGTGGAT
TACGATCTCA GCGACGTGAT GTTTGTCGCG ACGTCGAACT CCATGAACAT TCCGGCACCG
CTGCTGGATC GTATGGAAGT GATTCGCCTC TCCGGTTATA CCGAAGATGA AAAACTGAAC
ATCGCCAAAC GTCACCTGCT GCCGAAGCAG ATTGAACGTA ATGCACTGAA AAAAGGTGAG
CTGACCGTCG ACGATAGCGC CATTATCGGC ATTATTCGTT ACTACACCCG TGAGGCGGGC
GTGCGTGGTC TGGAGCGTGA AATCTCCAAA CTGTGCCGCA AAGCGGTTAA GCAGTTACTG
CTCGATAAGT CATTAAAACA TATCGAAATT AACGGCGATA ACCTGCATGA CTACCTCGGT
GTTCAGCGTT TCGACTATGG TCGCGCTGAT AACGAAAACT GTGTCGGTCA GGTAACCGGT
CTGGCGTGGA CGGAAGTGGG CGGTGACTTG CTGACCATTG AAACCGCATG TGTTCCGGGT
AAAGGCAAAC TGACCTATAC CGGTTCGCTC GGCGAAGTGA TGCAGGAGTC TATTCAGGCG
GCGTTAACGG TGGTTCGTGC GCGTGCGGAA AAACTGGGGA TCAACCCTGA TTTTTACGAA
AAACGTGACA TCCACGTCCA CGTACCGGAA GGTGCGACGC CGAAAGATGG TCCGAGTGCC
GGTATTGCTA TGTGCACCGC GCTGGTTTCT TGCCTGACCG GTAACCCGGT TCGTGCCGAT
GTGGCAATGA CCGGTGAGAT CACTCTGCGT GGTCAGGTAC TGCCGATCGG TGGTTTGAAA
GAAAAACTCC TGGCAGCGCA TCGCGGCGGG ATTAAAACAG TGCTAATTCC GTTCGAAAAT
AAACGCGATC TGGAAGAGAT TCCTGACAAC GTAATTGCCG ATCTGGACAT TCATCCTGTG
AAGCGCATTG AGGAAGTTCT GACTCTGGCG CTGCAAAATG AACCGTCTGG CATGCAGGTT
GTGACTGCAA AATAG
 
Protein sequence
MNPERSERIE IPVLPLRDVV VYPHMVIPLF VGREKSIRCL EAAMDHDKKI MLVAQKEAST 
DEPGVNDLFT VGTVASILQM LKLPDGTVKV LVEGLQRARI SALSDNGEHF SAKAEYLESP
TIDEREQEVL VRTAISQFEG YIKLNKKIPP EVLTSLNSID DPARLADTIA AHMPLKLADK
QSVLEMSDVN ERLEYLMAMM ESEIDLLQVE KRIRNRVKKQ MEKSQREYYL NEQMKAIQKE
LGEMDDAPDE NEALKRKIDA AKMPKEAKEK AEAELQKLKM MSPMSAEATV VRGYIDWMVQ
VPWNARSKVK KDLRQAQEIL DTDHYGLERV KDRILEYLAV QSRVNKIKGP ILCLVGPPGV
GKTSLGQSIA KATGRKYVRM ALGGVRDEAE IRGHRRTYIG SMPGKLIQKM AKVGVKNPLF
LLDEIDKMSS DMRGDPASAL LEVLDPEQNV AFSDHYLEVD YDLSDVMFVA TSNSMNIPAP
LLDRMEVIRL SGYTEDEKLN IAKRHLLPKQ IERNALKKGE LTVDDSAIIG IIRYYTREAG
VRGLEREISK LCRKAVKQLL LDKSLKHIEI NGDNLHDYLG VQRFDYGRAD NENCVGQVTG
LAWTEVGGDL LTIETACVPG KGKLTYTGSL GEVMQESIQA ALTVVRARAE KLGINPDFYE
KRDIHVHVPE GATPKDGPSA GIAMCTALVS CLTGNPVRAD VAMTGEITLR GQVLPIGGLK
EKLLAAHRGG IKTVLIPFEN KRDLEEIPDN VIADLDIHPV KRIEEVLTLA LQNEPSGMQV
VTAK