Gene Phep_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3598 
Symbol 
ID8254720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4293074 
End bp4294834 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content41% 
IMG OID644937250 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003093851 
Protein GI255533479 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.799906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA GATCTATAGT ATTTTTTATT GCCTGCTTTT TCCTGCATAC CGCAGTTTTT 
TCGCAGGACA AGAATGTAAA GCTTCCGGCA AACTGGTATA ATCTGGACCT GATAAAAGAC
GGATATTTTG GTATCAGTAC TGAAAAGGCA TATACTGAAC TTTTACAGCA TAGAAAGCCA
AAAGAGAAGA TCATTGTGGC GGTGATTGAT GGTGGAGTGG ACATTAGCCA TGAAGATTTG
AAGGATGTAC TGTGGACAAA CAAAAAGGAG ATTGCAGGAA ATGGCATAGA TGATGATGGC
AATGGATATG CGGATGATGT ACATGGCTGG AATTTTATAG GCTCAAAAAA GGGGAACCTG
GCTTATGACA ATCTGGAGCT GGTAAGGATT TTCAGGGAGT ATCAGCCAAA ATACAGGTCT
ACAATTAAAT CAACAATTTT GGACAGCACG CAAAAAGAGG AATTTGCCTT GTATACCAAG
GTAACTGCGG CATTTGGAAA AAAATACGAT GAAGCACACC AGACTTTTGC AGTTGTGGCT
ATGATCAATA AAGTGCTGGA TTCTGTAGGG CAGATCAACC ATAAGGCAAT TCCTTCACTG
GAAGATATTG AGCGTTATAA GGCCGACAGT GAAGAAGAGG AACAATGCAA AAAGATTATC
AGGAAAGGGG CCAGGGAAAG CGGATCTATA GAGAAGTTCC ACAAGGAAAT GAAGGATGCT
TATAAACAGT ATGATGTGAT GCTGAAGTAT AACCTAAATC CTAAATATGA TGAACGTGGG
GCACTGGTGG GGGATGACTA TTCGAATGCA AAAGAGCGGT TTTATGGAAA TAATGATGTA
GCCGGACCAA ATGCGGAGCA TGGCACACAT GTTTCCGGTA TAATTGCTGC AAATAGAAAG
AACAACATAG GCATAAACGG TGTGGCCGAT AATGTGAGTA TTATGGCCAT CAGAGTAGTG
CCGGAAGGTG ATGAGCGCGA TAAGGATGTT GCCAATGGGA TAAGATATGC GGTAGATAAT
GGTGCAAGAG TAATTAATAT GAGCTTTGGA AAAGGCTTTA AATGGAATAA GGAGGTTGTT
GATGATGCTG TTAAATATGC TGAGAAAAAA GGCGTATTGC TGGTACATGC AGCCGGTAAT
GATAACCAGA ATAATGACCT GGAAGAAAAT TATCCTACTA AATATTATGA CAGTCCGGAA
GCCATAGCCT ATAAAAAGGC CCATAAGAAG CCAGACCTTA GTGCAATGTT GTTCAGGCCG
AATGCCAATC AGCAGCAAGG CCCTGGCATG GGGCGTAATG TGCCGACACT GCCCTTGAAA
CCGGTAATTG ATACCGCTAA GTTTAATTTG CCCCATGCCA ATAACTGGAT TGAGGTTGGT
GCAAGTGCTT ATAAGAACGA TGCGAGTTTG AAGGCGTCTT TTTCTAATTA CGGCAAATAT
ACCGTAGATG TTTTTGCGCC GGGTTTCATG ATTAAATCAA CTGTTCCGGG ATCTAAGTAC
GAAGAGTTTG ACGGTACCAG TATGGCTGCT CCTGTTGTTT CTGGCCTGGC TGCCTTAATT
TTGAGCTATT ATCCTGAACT TAAACCGCGT GAAGTAAGAG AGATCATTAT GAAATCTGTG
GTTAAGGTTG AGCAGAAGGT AAAGCATGAA AATTCAAGGG GTGAAAGTGA ACGGATCAGT
TTTAAGGAAC TGTGTGTAAG CGGGGGTGTT GTTAATGCTT ATGAAGCCTT AAAATTGGCA
GAACATTATA AAACAAAATA G
 
Protein sequence
MKLRSIVFFI ACFFLHTAVF SQDKNVKLPA NWYNLDLIKD GYFGISTEKA YTELLQHRKP 
KEKIIVAVID GGVDISHEDL KDVLWTNKKE IAGNGIDDDG NGYADDVHGW NFIGSKKGNL
AYDNLELVRI FREYQPKYRS TIKSTILDST QKEEFALYTK VTAAFGKKYD EAHQTFAVVA
MINKVLDSVG QINHKAIPSL EDIERYKADS EEEEQCKKII RKGARESGSI EKFHKEMKDA
YKQYDVMLKY NLNPKYDERG ALVGDDYSNA KERFYGNNDV AGPNAEHGTH VSGIIAANRK
NNIGINGVAD NVSIMAIRVV PEGDERDKDV ANGIRYAVDN GARVINMSFG KGFKWNKEVV
DDAVKYAEKK GVLLVHAAGN DNQNNDLEEN YPTKYYDSPE AIAYKKAHKK PDLSAMLFRP
NANQQQGPGM GRNVPTLPLK PVIDTAKFNL PHANNWIEVG ASAYKNDASL KASFSNYGKY
TVDVFAPGFM IKSTVPGSKY EEFDGTSMAA PVVSGLAALI LSYYPELKPR EVREIIMKSV
VKVEQKVKHE NSRGESERIS FKELCVSGGV VNAYEALKLA EHYKTK