Gene EcSMS35_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2968 
SymbolptrA 
ID6143373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3044627 
End bp3047515 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content51% 
IMG OID641617837 
Productprotease III 
Protein accessionYP_001744989 
Protein GI170682458 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.436713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA GCATCTGGTT CAAAGCATTA TTGTTGTTTG TTGCCCTTTG GGCACCCTTA 
AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT
AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT
CCGCAGGCGG TTAAATCGCT CTCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC
GAGGCGTATC AGGGGCTGGC GCATTACCTT GAACACATGA GTCTGATGGG GTCGAAAAAG
TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGCAG TCACAATGCC
AGCACGGCAC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCCGGT
GCGGTAGACC GCCTGGCGGA CGCTATTGCA GAACCCTTGC TCGACAAGAA ATACGCCGAA
CGTGAACGTA ATGCAGTGAA TGCCGAATTA ACCATGGCGC GTACGCGTGA CGGGATGCGC
ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT
GGTAACCTCG AAACCTTAAG CGACAAACCA GGTAATCCGG TACAGCAGGC GCTGAAAGAT
TTCCACGAGA AGTACTATTC CGCCAATCTG ATGAAGGCGG TTATTTACAG TAATAAACCG
TTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACCTTTGGTC GCGTGCCGAA CAAAGAGAGC
AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT
TACGTCCCGG CGTTGCCGCG TAAAGTTCTG CGCGTTGAGT TTCGCATCGA TAACAATTCA
GCGAAGTTCC GTAGTAAAAC GGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA
GGTACACTTT CTGACTGGCT GCAAAAGCAG GGATTAGTTG AGGGTATTAG CGCCAATTCC
GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA
GGTCTGGCGA ATCGCGATCA GGTTGTGGCG GCCATTTTTA GCTACCTCAA TCTGTTACGC
GAAAAAGGGA TTGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC
CGTTATCCGT CAATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT
CGCGTTCCTG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGACGCTAAA
GCAGTAAAAG AACGCCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC
CCGAAAGAGC CGCACAACAA AACGGCTTAC TTTGTCGATG CGCCGTATCA GGTCGATAAA
ATTAGCGCAC AAACTTTCGC TGACTGGCAG AAAAAAGCCG CCGACATTGC GCTCTCTTTG
CCAGAGCTTA ACCCCTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA GAAGAAATAT
GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGTG TGGTGTATGC GCCAAGCCGT
TATTTTTCCA GCGAACCCAA AGCTGATGTC AGCCTGATTT TGCGTAATCC AAAAGCCATG
GATAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT
GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA TGCTAACAAC
GGCCTTATGG TTAATGCCAA TGGTTACACC CAGCGCCTGC CGCAGCTGTT CCAGGCATTG
CTCGAGGGGT ACTTTAGCTA TACCGCTACG GAAGATCAGC TTGAGCAGGC GAAGTCCTGG
TATAACCAGA TGATGGATTC TGCAGAAAAG GGCAAAGCGT TCGAGCAGGC GATTATGCCC
GCGCAGATGC TCTCGCAAGT ACCATACTTC TCGCGAGATG AACGGCGCAA AATTTTGCCC
TCCATTACTT TGAAAGAGGT GATGGCCTAT CGCGACGCCT TAAAATCAGG GGCTCGACCA
GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCTGGC ACGCGATGTG
CAAAAACAGT TGGGCGCTGA TGGTTCGGAG TGGTGTCGTA ACAAAGATGT CGTGGTCGAT
AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG
GTATTTGTAC CGACTGGCTA CGATGAATAC ACCAGTTCAG CGTATAGTTC TCTGTTGGGG
CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAGTT GGGCTATGCC
GTGTTTGCGT TTCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT GTTGCAAAGC
AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC GACCGCAGAA
GCGAAACTGC GGGCGATGAA GCCAGAAGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC
CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT CGAAGTTAAG TAAAGATTTC
GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG
ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGTATGGCT
ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTGCA TCCTGAAGGC
TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG
AATGAGTGA
 
Protein sequence
MPRSIWFKAL LLFVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP
LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS
AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK
GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI
RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK
ISAQTFADWQ KKAADIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR
YFSSEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN
GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP
AQMLSQVPYF SRDERRKILP SITLKEVMAY RDALKSGARP EFMVIGNMTE AQATTLARDV
QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE
AKLRAMKPEE FAQIQQAVIT QMLQAPQTLG EEASKLSKDF DRGNMRFDSR DKIVAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK
NE