Gene SbBS512_E3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3041 
SymbolptrA 
ID6271154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2839555 
End bp2842443 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content51% 
IMG OID641726974 
Productprotease III 
Protein accessionYP_001881438 
Protein GI187733842 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCA GCACTTGGTT CAAAGCATTA TTGTTGTTAG TTGCCCTTTG GGCACCCTTA 
AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT
AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT
CCGCAGGCAG TTAAATCGCT CTCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC
GAGGCGTACC AGGGGCTGGC ACATTACCTT GAACATATGA GTCTGATGGG GTCGAAAAAG
TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGTAG TCACAATGCC
AGCACTGCGC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCTGGT
GCGGTAGACC GCCTGGCCGA TGCTATTGCT GAACCTTTGC TCGACAAGAA ATATGCCGAA
CGTGAGCGTA ATGCGGTGAA CGCTGAATTA ACCATGGCGC GTACGCGTGA CGGGATGCGC
ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT
GGTAACCTCG AAACTTTAAG CGATAAACCT GGTAATCCGG TGCAGCAGGC GCTGAAAGAT
TTCCACGAGA AGTACTATTC CGCCAATCTG ATGAAGGCGG TTATTTACAG CAATAAACCG
CTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACCTTTGGTC GCGTGCCGAA CAAAGAGAGC
AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT
TACGTCCCGG CACTGCCGCG TAAAGTTCTG CGCGTTGAGT TTCGCATCGA TAACAACTCA
GCGAAGTTCC GTAGTAAAAC GGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA
GGTACACTTT CCGACTGGCT GCAAAAGCAG GGATTAGTTG AGGGCATTAG CGCCAACTCC
GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA
GGCCTGGCTA ATCGCGATCA GGTTGTGGCG GCAATTTTTA GCTATCTCAA TCTGTTACGT
GAAAAAGGCA TTGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC
CGTTATCCGT CGATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT
CGCGTTCCGG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGATGCTAAA
GCAGTAAAAG AACGTCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC
CCGAAAGAGC CGCACAATAA AACGGCTTAT TTTGTCGATG CGCCGTATCA GGTCGATAAA
ATCAGCGAAC AAACTTTCGC TGACTGGCAG CAAAAAGCTG CCAATATTGC GCTCTCCTTA
CCGGAGCTTA ACCCCTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA GAAGAAATAT
GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGCG TGGTGTATGC GCCAAGCCGT
TATTTTTCCA GCGAACCCAA AGCTGATGTC AGCCTGATTT TGCGTAATCC GAAAGCCATG
GACAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT
GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA CGCTAACAAC
GGCCTTATGG TTAATGCTAA TGGTTACACC CAGCGTCTGC CGCAGCTGTT CCAGGCATTG
CTCGAGGGGT ACTTTAGCTA TACCGCTACG GAAGATCAGC TTGAGCAGGC GAAGTCTTGG
TATAACCAGA TGATGGATTC CGCAGAAAAG GGTAAAGCGT TTGAGCAGGC GATTATGCCC
GCGCAGATGC TCTCGCAAGT GCCGTACTTC TCGCGAGATG AACGGCGTAA AATTTTGCCC
TCCATTACGT TGAAAGAGGT GCTGGCCTAT CGCGACGCCT TAAAATCAGG GGCTCGACCA
GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCAGGC ACGCGATGTG
CAAAAACAGT TGGGCGCTGA TGGTTCAGAG TGGTGTCGAA ACAAAGATGT AGTGGTCGAT
AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG
GTATTTGTAC CGACTGGCTA CGATGAATAC ACCAGCTCAG CCTATAGCTC TCTGTTGGGG
CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAATT GGGCTATGCC
GTGTTTGCGT TTCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT TTTGCAAAGC
AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC AACCGCAGAG
GCAAAATTGC GAGCGATGAA GCCAGATGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC
CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT CGAAGTTAAG TAAAGATTTC
GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG
ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGCATGGCT
ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTACA CCCTGAAGGC
TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG
AATGAGTGA
 
Protein sequence
MPRSTWFKAL LLLVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP
LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS
AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK
GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI
RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK
ISEQTFADWQ QKAANIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR
YFSSEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN
GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP
AQMLSQVPYF SRDERRKILP SITLKEVLAY RDALKSGARP EFMVIGNMTE AQATTQARDV
QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE
AKLRAMKPDE FAQIQQAVIT QMLQAPQTLG EEASKLSKDF DRGNMRFDSR DKIVAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK
NE