Gene EcE24377A_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3141 
SymbolptrA 
ID5588723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3152687 
End bp3155575 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content51% 
IMG OID640926783 
Productprotease III 
Protein accessionYP_001464156 
Protein GI157156589 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCA GCACCTGGTT CAAAGCATTA TTGTTGTTAG TTGCCCTTTG GGCACCCTTA 
AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT
AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT
CCGCAGGCAG TTAAATCGCT CTCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC
GAGGCGTACC AGGGGCTGGC ACATTACCTT GAACATATGA GTCTGATGGG GTCGAAAAAG
TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGTAG TCACAATGCC
AGCACTGCGC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCTGGT
GCGGTAGACC GCCTGGCTGA TGCTATTGCA GAACCCTTGC TCGACAAGAA ATACGCCGAA
CGTGAGCGTA ATGCAGTGAA CGCTGAATTA ACCATGGCGC GTACGCGTGA CGGGATGCGC
ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT
GGTAACCTCG AAACTTTAAG CGACAAACCT GGTAATCCGG TGCAGCAGGC GCTGAAAGAT
TTCCACGAGA AGTACTATTC CGCCAATCTG ATGAAGGCGG TTATTTACAG CAATAAACCG
CTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACCTTTGGTC GCGTGCCGAA CAAAGAGAGC
AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT
TACGTCCCGG CGCTGCCGCG TAAAGTGTTG CGCGTAGAGT TTCGTATTGA TAACAACTCG
GCGAAATTCC GTAGCAAAAC CGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA
GGTACACTTT CCGACTGGCT GCAAAAGCAG GGATTAGTTG AGGGCATTAG CGCCAACTCC
GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA
GGTCTGGCGA ATCGCGATCA GGTTGTGGCG GCTATTTTTA GCTACCTCAA TTTGTTACGT
GAAAAAGGGA TCGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC
CGTTATCCGT CAATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT
CGCGTTCCTG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGATGCTAAA
GCAGTAAAAG AACGTCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC
CCGAAAGAGC CGCACAATAA AACGGCTTAT TTTGTCGATG CGCCGTATCA GGTCGATAAA
ATCAGCGCAC AAACTTTCGC CGACTGGCAG AAAAAAGCCG CCGACATTGC GCTCTCTTTG
CCAGAGCTTA ACCCCTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA GAAGAAATAT
GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGTG TGGTGTATGC GCCAAGCCGT
TATTTTTCCA GCGAACCCAA AGCTGATGTC AGCCTGATTT TGCGTAATCC GAAAGCCATG
GACAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT
GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA CGCTAACAAC
GGCCTTATGG TTAATGCTAA TGGTTACACC CAGCGTCTGC CGCAGCTGTT CCAGGCATTG
CTCGAGGGGT ACTTTAGCTA TACCGCCACG GAAGATCAGC TTGAGCAGGC AAAATCCTGG
TATAACCAGA TGATGGATTC CGCAGAAAAG GGCAAAGCGT TTGAGCAGGC GATTATGCCC
GCGCAGATGC TCTCGCAAGT GCCGTACTTC TCGCGAGATG AACGGCGCAA AATTTTGCCC
TCCATTACGT TGAAAGAGGT GCTGGCCTAT CGCGACGCCT TAAAATCAGG GGCTCGACCA
GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCTGGC ACGCGATGTG
CAAAAACAGT TGGGCGCTGA TGGTTCAGAA TGGTGTCGTA ACAAAGATGT CGTGGTCGAT
AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG
GTATTTGTAC CGACTGGCTA CGATGAATAC ACCAGCTCAG CCTATAGCTC TCTGTTGGGG
CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAGTT GGGCTATGCC
GTGTTTGCAT TCCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT TTTGCAAAGC
AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC AACCGCAGAG
GCAAAATTGC GGGCGATGAA GCCAGAAGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC
CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT CGAAGTTAAG TAAAGATTTC
GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG
ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGCATGGCT
ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTGCA TCCTGAAGGC
TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG
AATGAGTGA
 
Protein sequence
MPRSTWFKAL LLLVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP
LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS
AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK
GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI
RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK
ISAQTFADWQ KKAADIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR
YFSSEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN
GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP
AQMLSQVPYF SRDERRKILP SITLKEVLAY RDALKSGARP EFMVIGNMTE AQATTLARDV
QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE
AKLRAMKPEE FAQIQQAVIT QMLQAPQTLG EEASKLSKDF DRGNMRFDSR DKIVAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK
NE