Gene EcHS_A1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1921 
Symbolprc 
ID5592467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1931745 
End bp1933793 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content51% 
IMG OID640921064 
Productcarboxy-terminal protease 
Protein accessionYP_001458615 
Protein GI157161297 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000149225 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATGT TTTTTAGGCT TACCGCGTTA GCTGGCCTGC TTGCAATAGC AGGCCAGACC 
TTCGCTGTAG AAGATATCAC GCGTGCTGAT CAAATTCCGG TATTAAAGGA AGAGACGCAG
CATGCGACGG TAAGTGAGCG CGTAACGTCG CGCTTCACCC GTTCTCATTA TCGCCAGTTC
GACCTCGATC AGGCATTTTC GGCCAAAATC TTTGACCGCT ACCTGAATCT GCTCGATTAC
AGCCACAACG TGCTGCTGGC AAGCGATGTT GAACAGTTCG CGAAAAAGAA AACCGAGTTA
GGCGATGAAC TGCGTTCAGG CAAACTCGAC GTTTTCTACG ATCTCTACAA TCTGGCGCAA
AAGCGCCGTT TTGAGCGTTA CCAGTACGCT TTGTCGGTAC TGGAAAAGCC GATGGATTTC
ACCGGCAACG ACACTTATAA CCTTGACCGC AGCAAAGCGC CCTGGCCGAA AAACGAGGCT
GAGTTGAACG CGCTGTGGGA CAGTAAAGTC AAATTCGACG AGTTAAGCCT GAAGCTGGCA
GGAAAAACGG ATAAAGAAAT TCGTGAAACC CTGACACGCC GCTACAAATT TGCCATTCGT
CGTCTGGCGC AAACCAACAG CGAAGATGTT TTCTCGCTGG CAATGACGGC GTTTGCGCGT
GAAATCGACC CGCATACCAA CTATCTTTCC CCGCGTAATA CCGAACAGTT CAACACTGAA
ATGAGTTTGT CGCTGGAAGG TATTGGCGCA GTGCTGCAAA TGGATGATGA CTACACCGTT
ATCAATTCGA TGGTGGCAGG TGGTCCGGCA GCGAAGAGTA AAGCTATCAG CGTTGGTGAC
AAAATTGTCG GTGTTGGTCA AACAGGCAAG CCGATGGTTG ACGTGATTGG CTGGCGTCTT
GATGATGTGG TTGCCTTAAT TAAAGGGCCG AAGGGCAGTA AAGTTCGTCT GGAAATTTTA
CCTGCTGGTA AAGGGACCAA GACCCGCACT GTAACATTGA CCCGTGAACG TATTCGTCTC
GAAGACCGCG CGGTTAAAAT GTCGGTGAAG ACCGTCGGTA AAGAGAAAGT CGGCGTGCTG
GATATTCCGG GCTTCTATGT GGGTTTGACA GACGATGTCA AAGTTCAACT GCAGAAACTG
GAAAAACAGA ATGTCAGCAG TGTGATCATC GACCTGCGTA GCAATGGCGG TGGGGCGTTG
ACCGAAGCGG TATCGCTCTC AGGTCTGTTT ATTCCTGCGG GTCCCATTGT TCAGGTCCGC
GATAACAACG GCAAGGTTCG TGAAGACAGC GATACCGACG GGCAGGTGTT CTATAAAGGC
CCGCTGGTGG TACTGGTTGA CCGTTTCAGT GCTTCGGCTT CAGAAATCTT TGCCGCGGCA
ATGCAGGATT ACGGTCGTGC GCTGGTTGTG GGTGAACCGA CGTTTGGTAA AGGCACCGTT
CAGCAATACC GTTCATTGAA CCGTATTTAC GATCAGATGT TACGTCCTGA ATGGCCAGCG
CTGGGTTCTG TGCAGTACAC GATCCAGAAA TTCTATCGCG TTAACGGCGG CAGTACGCAA
CGTAAAGGCG TAACGCCAGA CATCATCATG CCGACGGGTA ATGAAGAAAC GGAAACGGGT
GAGAAATTCG AAGATAACGC GCTGCCGTGG GATAGCATCG ATGCCGCGAC TTATGTGAAA
TCAGGAGATT TAACTGCCTT TGAACCGGAG CTGCTGAAGG AACATAATGC GCGTATCGCG
AAAGATCCTG AGTTCCAGAA CATCATGAAG GATATCGCGC GCTTCAACGC TATGAAGGAC
AAGCGCAATA TCGTTTCTCT GAACTACGCT GTGCGTGAGA AAGAGAATAA TGAAGATGAT
GCGACGCGTC TGGCGCGTTT GAACGAACGC TTTAAACGCG AAGGTAAACC GGAGTTGAAG
AAGCTGGATG ATCTACCGAA AGATTACCAG GAGCCGGATC CTTATCTGGA TGAGACGGTG
AATATCGCAC TCGATCTGGC GAAGCTTGAA AAAGCCAGAC CCGCGGAACA ACCCGCTCCC
GTCAAGTAA
 
Protein sequence
MNMFFRLTAL AGLLAIAGQT FAVEDITRAD QIPVLKEETQ HATVSERVTS RFTRSHYRQF 
DLDQAFSAKI FDRYLNLLDY SHNVLLASDV EQFAKKKTEL GDELRSGKLD VFYDLYNLAQ
KRRFERYQYA LSVLEKPMDF TGNDTYNLDR SKAPWPKNEA ELNALWDSKV KFDELSLKLA
GKTDKEIRET LTRRYKFAIR RLAQTNSEDV FSLAMTAFAR EIDPHTNYLS PRNTEQFNTE
MSLSLEGIGA VLQMDDDYTV INSMVAGGPA AKSKAISVGD KIVGVGQTGK PMVDVIGWRL
DDVVALIKGP KGSKVRLEIL PAGKGTKTRT VTLTRERIRL EDRAVKMSVK TVGKEKVGVL
DIPGFYVGLT DDVKVQLQKL EKQNVSSVII DLRSNGGGAL TEAVSLSGLF IPAGPIVQVR
DNNGKVREDS DTDGQVFYKG PLVVLVDRFS ASASEIFAAA MQDYGRALVV GEPTFGKGTV
QQYRSLNRIY DQMLRPEWPA LGSVQYTIQK FYRVNGGSTQ RKGVTPDIIM PTGNEETETG
EKFEDNALPW DSIDAATYVK SGDLTAFEPE LLKEHNARIA KDPEFQNIMK DIARFNAMKD
KRNIVSLNYA VREKENNEDD ATRLARLNER FKREGKPELK KLDDLPKDYQ EPDPYLDETV
NIALDLAKLE KARPAEQPAP VK