Gene Phep_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0166 
Symbol 
ID8251251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp194178 
End bp196358 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content41% 
IMG OID644933816 
Productcarboxyl-terminal protease 
Protein accessionYP_003090454 
Protein GI255530082 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTA AAATAGTAAA AGAGATGTTG AAAAAGATAT TACTGGTAAC GTTTACTGCC 
GCTGTTCTTG CATGTCATGC TGCACCTAAA CCTCAGCCAA TGGTTCAGGG GGTAAACAAT
ATCCTGCCTG ATGAAAAGCA GGCTTTGGTT TGTAAAGAAA TAGTGGGGTT GATTGAGAAC
TACAATTACA AAAAGATCAA GGTGAACGAT TCTATCTCTT CGCTTGTTTT AGACAAATAC
ATTAAGGCAC TGGATCCTTA TAAATATTAT TTCCTGGCAG CTGATATTAA GGAATTTGAA
AAATTCAGGT ATACACTTGA CGACGATTTT CGGAATGGGG AATTAAGTGC GCCATTTTAT
ATTTTTAATG TTTATTTAAA ACGCTATAAT GAATTTATAA ATTATGCATT TACCCAGATC
AAGGCAAAGC AAAATTTTAA TTCGACTGAG ACCTATGTAT ATGACCGTGA GAAAATGCCG
TGGACCACTT CTAAGGCAAC GCTCGATGAT CTATGGAGGA AACGTGTAAA ATATGAACTG
GTAAACCTCA AAATTGCCGG AACACCTGAG GACAAAAATG TAGAAACCTT AACAAAACGT
TACGAGGCAC TTAAATCGCA GGCATCCAAA CTAAATAATC AGGACGTTTT TCAAACCATT
ATGGATGCGT TTACGGAGAC CATTGATCCG CATACCAACT ATTTTAACCC GGCAAATGCA
CAACAGTTTA ATGAAGATAT GGCCCGTTCT TTTGAGGGTA TCGGGGCAAG GCTGCAACTG
GAAAATGAGA TACTTAAGAT TGCCGAGGTT ATTCCTGGCG GCCCGGCGTT TAAAAGCAAA
TTGCTAAGTG CTGGCGATAG GATCATCGGT GTTGCGCAAG GTTCTGGTGA ATTTGAAGAT
ATCATTGGCT GGAGAATAGA AAACTCGGTT TCTAAAATTA AAGGACCTAA AGGCACCAAG
GTCCGTTTAA AGATTATTCC GGTAGGTATG GAAATGTCGT CCAAGCCAGT AATCATAGAG
CTGGTGAGAG AGAAGATCGT AATGGAAGAT CAGTCGGCCA AAAAGAAAGT CCAAACCATC
GAATCAAATG GCAGGCCATA TAAAATAGGT ATCATTACTG TACCTGCTTT CTATGCTGAT
TTTAAAGCTG CAAATGCAGG CGATCCTAAT TATAAAAGTA CCACACGGGA TGTAAAACTG
CTGATCGATA CTTTAAAGAA TAAAGATAAG GTAGATGCCA TTGTAATGGA CCTTCGGGCA
AATGGTGGTG GCTCTTTGGT AGAAGCTATA GACCTTACCG GTCTGTTTAT TGACAAAGGC
CCGGTGGTAC AGGTAAAAGA TCTGAGGAAC AATATAGAAG TTAATGAGGA TACTAACCCT
GGCGTGGCCT GGAGCGGGCC TTTTGGTGTA ATGGTTGACC GTTTAAGCGC TTCAGCATCA
GAAATTTTTG CCGGGGCCAT TCAGGATTAC GGAAGAGGGA TCATCATGGG GACCCAAACT
TATGGTAAAG GTACCGTACA GTCGTCGATA GACATGAATA AGCTGGTGAA CCCTTCCATG
TTGCAAAGAC TTGCAGCCCT TGTAGGCAAG AATGCCGGGC TCACAGGAAC AAATAAGGAA
GGTGTTCAGC TGGGACAGAT CAATCTTACC ATGGCTAAAT TTTACCGTGT AACGGGAAGC
AGCACCCAGC ATAAAGGCGT GATGCCGGAT ATTGAGTTCC CATCTATCTA TCCGATGGAT
AAAATTGGTG AAGATACAGA ACCTTCGGCC TTACCATGGG ATGAGGTCAA ACGGTCTAAC
TTTACAACGG TTGCGAACTT GGCACCAGTT AAACCGGAAC TTATTGCATG GCATAAAGCA
CGGATGGCAA AGTCGCTCGA CTATAAAATA ATGGAACAGG ATATCGCGGA TGCCAAAAAA
CGCGAGGGGG AAGTTTCGGT TTCTTTAAAT GAAGTGAAAC TGAAGGCAGA AAGAGATAGT
CTGGAAGCAA AAAATTTGGC CAAGCTAAAT TCCTTAAGGG CTTCAAGAGG ATTGGCGCCA
ATGAAAAAGG GCGAGAAAAT CAAGAAAGAG GATAATTTCG ATTTTATACT GGATGAAAGC
TTAAGGATAA TGACCGATTT TATGCAGGTT ACTGATCCGG CAAGTAAAGG ATTGGTAAAG
AATCAGGGTG CTTTAAATTA G
 
Protein sequence
MDFKIVKEML KKILLVTFTA AVLACHAAPK PQPMVQGVNN ILPDEKQALV CKEIVGLIEN 
YNYKKIKVND SISSLVLDKY IKALDPYKYY FLAADIKEFE KFRYTLDDDF RNGELSAPFY
IFNVYLKRYN EFINYAFTQI KAKQNFNSTE TYVYDREKMP WTTSKATLDD LWRKRVKYEL
VNLKIAGTPE DKNVETLTKR YEALKSQASK LNNQDVFQTI MDAFTETIDP HTNYFNPANA
QQFNEDMARS FEGIGARLQL ENEILKIAEV IPGGPAFKSK LLSAGDRIIG VAQGSGEFED
IIGWRIENSV SKIKGPKGTK VRLKIIPVGM EMSSKPVIIE LVREKIVMED QSAKKKVQTI
ESNGRPYKIG IITVPAFYAD FKAANAGDPN YKSTTRDVKL LIDTLKNKDK VDAIVMDLRA
NGGGSLVEAI DLTGLFIDKG PVVQVKDLRN NIEVNEDTNP GVAWSGPFGV MVDRLSASAS
EIFAGAIQDY GRGIIMGTQT YGKGTVQSSI DMNKLVNPSM LQRLAALVGK NAGLTGTNKE
GVQLGQINLT MAKFYRVTGS STQHKGVMPD IEFPSIYPMD KIGEDTEPSA LPWDEVKRSN
FTTVANLAPV KPELIAWHKA RMAKSLDYKI MEQDIADAKK REGEVSVSLN EVKLKAERDS
LEAKNLAKLN SLRASRGLAP MKKGEKIKKE DNFDFILDES LRIMTDFMQV TDPASKGLVK
NQGALN