Gene Phep_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4121 
Symbol 
ID8255256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4978018 
End bp4979718 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content42% 
IMG OID644937786 
Productcarboxyl-terminal protease 
Protein accessionYP_003094374 
Protein GI255534002 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.132003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAT ACAAAATTGC CGCAGTTGCT CTCCTTATTA CCTTTGCTTC TATAACCTGG 
TCTTTTAAGG AAGACCTTTT TCAGGTGTCA AAGAATCTGG ACATTTTTGC TTCTTTATAT
AAAGAAATAA ACATCAATTA TGTAGAGGAA ACAAACCCTT CCAGCCTCAT GCGCAGCAGC
ATTGATGCGA TGCTCGAAAA TCTGGACCCT TATACAGAAT ACGTTCCTGA ATCAGAAGTA
GAAGATTATA AGCTGAAATA CGTGAGTACT CAATACGGTG GCATTGGGGC CAGCACCATT
TTTATTGAGG GTAAGTTATT TGTAAATGAG GTTAATGAAG GCTATCCGGC CGATAAACAG
GGAGTAAAAC CTGGCGATCA ACTTGTAAAA ATTAATGGTA ATGAGGTTAA GGGGAAAGAC
CGGGCGCAGG TAAGCCAGTT GCTGAGGGGA CCAAAAGGCT CTGTTGTCGA ACTCCTGATC
ATTAGGGAAG GTACCTTGAT TACCAAAAAC CTGACCCGTG ATGAAATCAA ACAGCCCAAT
GTTGCCTACT CCGGCATGAC AGCAGATAAT ATTGCCTATA TCCGTTTGGA TAAATTCCTT
GAAAACTCTG CTCAGGAGGT TAAGGATGCT GCAGTTACAT TGGGTAGACA GCAGCCTAAG
GGTATGATCC TCGATTTGCG ATACAACGGT GGGGGGATAC TGCAGGAAGC TGTTAAAATT
GTCAACATTT TTGTGGATAA GGATATCCTG ATCGTGACCC AGAAAGGAAG AAATCCGCAA
AAAACCATTA CCTATAAAAC AATTAACCAG CCCTTATTTC CAAACGTTCC ACTGGTGGTG
TTAATCAGTG GATCTTCTGC CTCGGCTTCT GAAATTGTTG CTGGAGCACT GCAGGACCTC
GACAGGGCTA TAATTGTTGG ACAGAGGAGC TATGGAAAGG GGCTGGTTCA ACAAACCTTT
AACCTGCCTT ATAACAGCCT TGTTAAGGTT ACTGTAGCCA AATATTTTAC CCCCTCGGGC
AGGTGCATCC AGGCGCTTGA CTATGCGCAT AAGGATGCCA ACGGCAAAAC ACTCAAATTT
GCAGATTCGC TGATGAGTAA ATTCAGTACA AAAACCGGGA GAAATGTATA TGACGGAAAT
GGCATTTATC CTGATGTGCT GGTAAATAGC CCTAAGCTTA GCCCGGTAAC CATTTCACTG
TTGAATAAGA ACCTGTTTTT TGATTATGCC AATAATTATA AAAAGAACAA TAAAGAAATT
GCTCCGGCAG CTTCTTTTCA GCTTACGGAA AACGATTATG CCGCTTTTGT AAATACCATG
GCAGGACGGG ATTACTCATA CACCTCACGT ACAGAACGCT TATTGTCTGA CCTGAGAACA
GAGGCAGAAA AAGAGAATAA ACTGACGCTT GTTAAGGCCG ACCTTGAAGA TTTAAAAGAA
AAAATGCTTG GTGCCAGAAA AACAGACCTG ACTACCTATA AAGCAGAGAT CAAAAGAGTT
TTAGAAACCC AGATCGTAAG CCGCTACTAC TATGAAAAGG GTAAAGTGAT CCAGGCGTTT
CAGTACGATA AGGAGCTGAA TGCAGCAAAA AGTCTGTTAA ATAACAACAA TAAAATGCTG
GCCATCCTTA AAGGTGAGGG CGAATATAAA ACAATAGGCA GCCCTATAAA AACAATAGCA
GCTGCCTCTG ATAATAATTA A
 
Protein sequence
MKIYKIAAVA LLITFASITW SFKEDLFQVS KNLDIFASLY KEININYVEE TNPSSLMRSS 
IDAMLENLDP YTEYVPESEV EDYKLKYVST QYGGIGASTI FIEGKLFVNE VNEGYPADKQ
GVKPGDQLVK INGNEVKGKD RAQVSQLLRG PKGSVVELLI IREGTLITKN LTRDEIKQPN
VAYSGMTADN IAYIRLDKFL ENSAQEVKDA AVTLGRQQPK GMILDLRYNG GGILQEAVKI
VNIFVDKDIL IVTQKGRNPQ KTITYKTINQ PLFPNVPLVV LISGSSASAS EIVAGALQDL
DRAIIVGQRS YGKGLVQQTF NLPYNSLVKV TVAKYFTPSG RCIQALDYAH KDANGKTLKF
ADSLMSKFST KTGRNVYDGN GIYPDVLVNS PKLSPVTISL LNKNLFFDYA NNYKKNNKEI
APAASFQLTE NDYAAFVNTM AGRDYSYTSR TERLLSDLRT EAEKENKLTL VKADLEDLKE
KMLGARKTDL TTYKAEIKRV LETQIVSRYY YEKGKVIQAF QYDKELNAAK SLLNNNNKML
AILKGEGEYK TIGSPIKTIA AASDNN