Gene EcE24377A_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2059 
Symbolprc 
ID5589847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2040365 
End bp2042413 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content51% 
IMG OID640925729 
Productcarboxy-terminal protease 
Protein accessionYP_001463132 
Protein GI157155122 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.8938e-09 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATGT TTTTTAGGCT TACCGCGTTA GCTGGCCTGC TTGCAATAGC AGGCCAGACC 
TTCGCTGTAG AAGATATCAC GCGTGCTGAT CAAATTCCGG TATTAAAGGA AGAGACGCAG
CATGCGACGG TAAGTGAGCG CGTAACGTCG CGCTTCACCC GTTCTCATTA TCGCCAGTTC
GACCTCGATC AGGCATTTTC GGCCAAAATC TTTGACCGCT ACCTGAATCT GCTCGATTAC
AGCCACAACG TGCTGCTGGC AAGCGATGTT GAACAGTTCG CGAAAAAGAA AACCGAGTTA
GGCGATGAAC TGCGTTCAGG CAAACTCGAC GTTTTCTACG ATCTCTACAA TCTGGCGCAA
AAGCGCCGTT TTGAGCGTTA CCAGTACGCT TTGTCGGTAC TGGAAAAGCC GATGGATTTC
ACCGGCAACG ACACTTATAA CCTTGACCGC AGCAAAGCGC CCTGGCCGAA AAACGAGGCT
GAGTTGAACG CGCTGTGGGA CAGTAAAGTC AAATTCGACG AGTTAAGCCT GAAGCTGGCA
GGAAAAACGG ATAAAGAAAT TCGTGAAACC CTGACACGCC GCTACAAATT TGCCATTCGT
CGTCTGGCGC AAACCAACAG CGAAGATGTT TTCTCGCTGG CAATGACGGC GTTTGCGCGT
GAAATCGACC CGCATACCAA CTATCTTTCC CCGCGTAATA CCGAACAGTT CAACACTGAA
ATGAGTTTGT CGCTGGAAGG TATTGGCGCA GTGCTGCAAA TGGATGATGA CTACACCGTT
ATCAATTCGA TGGTGGCAGG TGGTCCGGCA GCGAAGAGTA AAGCTATCAG CGTTGGTGAC
AAAATTGTCG GTGTTGGTCA AACAGGCAAG CCGATGGTTG ACGTGATTGG CTGGCGTCTT
GATGATGTGG TTGCCTTAAT TAAAGGGCCG AAGGGCAGTA AAGTTCGTCT GGAAATTTTA
CCTGCTGGTA AAGGGACCAA GACCCGCACT GTAACATTGA CCCGTGAACG TATTCGTCTC
GAAGACCGCG CGGTTAAAAT GTCGGTGAAG ACCGTCGGTA AAGAGAAAGT CGGCGTGCTG
GATATTCCGG GCTTCTATGT GGGTTTGACA GACGATGTCA AAGTTCAACT GCAGAAACTG
GAAAAACAGA ATGTCAGCAG TGTGATCATC GACCTGCGTA GCAATGGCGG TGGGGCGCTG
ACCGAAGCGG TATCGCTCTC AGGTCTGTTT ATTCCTGCGG GTCCCATTGT TCAGGTCCGC
GATAACAACG GCAAGGTTCG TGAAGACAGC GATACCGACG GGCAGGTGTT CTATAAAGGC
CCGCTGGTGG TACTGGTTGA CCGTTTCAGT GCTTCGGCTT CAGAAATCTT TGCCGCGGCA
ATGCAGGATT ACGGTCGTGC GCTGGTTGTG GGTGAACCGA CGTTTGGTAA AGGCACCGTT
CAGCAATACC GTTCATTGAA CCGTATTTAC GATCAGATGT TACGTCCTGA ATGGCCAGCG
CTGGGTTCTG TGCAGTACAC GATCCAGAAA TTCTATCGCG TTAACGGCGG CAGTACGCAA
CGTAAAGGCG TAACGCCAGA CATCATCATG CCGACGGGTA ATGAAGAAAC GGAAACGGGT
GAGAAATTCG AAGATAACGC GCTGCCGTGG GATAGCATTG ATGCCGCGAC TTATGTGAAA
TCAGGCGATT TAACGGCCTT TGGGCCGGAG CTGCTGAAGG AACATAATGC GCGTATCGCG
AAAGATCCTG AGTTCCAGAA CATCATGAAG GATATCGCGC GCTTCAACGC TATGAAGGAC
AAGCGCAATA TCGTTTCTCT GAACTACGCT GTGCGTGAGA AAGAGAATAA TGAAGATGAT
GCGACGCGTC TGGCGCGTTT GAACGAACGC TTTAAACGCG AAGGTAAACC GGAGTTGAAG
AAGCTGGATG ATCTACCTAA AGATTACCAG GCGCCGGATC CTTATCTGGA TGAGACGGTG
AATATCGCAC TCGATCTGGC GAAGCTTGAA AAAGCCAGAC CCGCGGAACA ACCCGCTCCC
GTCAAGTAA
 
Protein sequence
MNMFFRLTAL AGLLAIAGQT FAVEDITRAD QIPVLKEETQ HATVSERVTS RFTRSHYRQF 
DLDQAFSAKI FDRYLNLLDY SHNVLLASDV EQFAKKKTEL GDELRSGKLD VFYDLYNLAQ
KRRFERYQYA LSVLEKPMDF TGNDTYNLDR SKAPWPKNEA ELNALWDSKV KFDELSLKLA
GKTDKEIRET LTRRYKFAIR RLAQTNSEDV FSLAMTAFAR EIDPHTNYLS PRNTEQFNTE
MSLSLEGIGA VLQMDDDYTV INSMVAGGPA AKSKAISVGD KIVGVGQTGK PMVDVIGWRL
DDVVALIKGP KGSKVRLEIL PAGKGTKTRT VTLTRERIRL EDRAVKMSVK TVGKEKVGVL
DIPGFYVGLT DDVKVQLQKL EKQNVSSVII DLRSNGGGAL TEAVSLSGLF IPAGPIVQVR
DNNGKVREDS DTDGQVFYKG PLVVLVDRFS ASASEIFAAA MQDYGRALVV GEPTFGKGTV
QQYRSLNRIY DQMLRPEWPA LGSVQYTIQK FYRVNGGSTQ RKGVTPDIIM PTGNEETETG
EKFEDNALPW DSIDAATYVK SGDLTAFGPE LLKEHNARIA KDPEFQNIMK DIARFNAMKD
KRNIVSLNYA VREKENNEDD ATRLARLNER FKREGKPELK KLDDLPKDYQ APDPYLDETV
NIALDLAKLE KARPAEQPAP VK