Gene Phep_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3748 
Symbol 
ID8254880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4489233 
End bp4490828 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content42% 
IMG OID644937410 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003094001 
Protein GI255533629 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA TGAAAATGAA TTTTTTCTGC AGTGGGCTGG TATGCCTGGT TTTGCTGACA 
GCAATGCCAT TTATATCATT TGCCCAGAAG CCTAACTGGC AGAACCTCGA CTTAAAAGCC
GATTCTACAT TTGGCATTAG TACAGAAAAA GCTTATAACG AGTTATTAAA GGGCAAAAAA
TCTGTACCTG TAGTTGTAGC CGTTCTGGAT GGGGGAGTGG ATATTGAACA TGAAGATTTG
AAACCGGTGG TTTGGGTAAA TAAGAAAGAA AAAGCTGGTA ATGGTAAGGA CGATGATAAA
AATGGTTACA TTGATGATGT TCATGGATGG AACTTTTTGG GTTCATCAAA AGGTTCGGTG
AATTACGAAA CACTTGAACT TACACGTCTG GTACGTCGTG ACAATGCCAG GTTTGCCAAT
TCCGATGCTA CTTCAGTAAA AGCAAGTGAA CTTCCGGCCT TTGAAGCCTA TCAGAAAAAT
AAGGCAGAAT TTGAACAGGA ACTTGCTGAG GCAAAAATAG GCCTGGAGCG TTTTGAAGGT
ATCAAAAACG CCATAGATGA AGTGGTAAAG AAAATCGGTA AAGAAAATCC GACAATTCAG
GATTTCCAAT CCTTTAAGCC TTCAAATGAT ATGGAGGCCA ATATTCAGCG CATTTTGCTG
GCGCAGTTAA AGGAAAGTAG CTTTGAAGAT TTTTATAAAA ACCAGATCAT GTCCGGTTAT
GATCATTTTA AAAGTCAGGT AGATTATAAT TTGAACCTGG ATTACGATCC CCGCGGAATT
GTTGGCGATG ACCCTAACAA TAGCAAAGAA CGTTTTTATG GGAACAATGA CGTAACCGGG
CCTGATGCAA GACATGGGTC GCACGTTGCC GGTATCATTG CTGCTGTGCG TACCAATAAC
CTTGGAATTA TGGGCGTTGC TGATAACGTG CTGATTATGA GTGTCCGCAA TACCCCAAAT
GGGGATGAAC GGGATAAAGA TGTAGCCAAC TCTATCCGTT ATGCAGTAGA TAAAGGCGCT
AAGGTAATCA ACATGAGCTT CGGTAAGTCA TACTCATGGG ATAAAGCGAT TGTGGATGAA
GCGGTGAAAT ATGCAGTATC TAAAGACGTT GTACTGGTAC ATGCTGCGGG CAACGACAAT
AAGGACCTGG AAGTTGAACC AAATTTCCCT GACCCTGAAT ATATTGGTGG TGGTAAAGCG
GCCAGCTGGA TAACAGTAGG GGCTTCTGGC TGGACAAACG ATGGCACAAT AAAAGCCAGT
TTCTCTAATT ATGGAAAAAC TAAAGTTGAT GTCTTTGCGC CGGGGGTAAA TATTAATTCA
ACTGTACCGG GCTCAAAATA TGAGAAATTA AACGGAACAA GTATGGCTTC CCCTGTCGTT
GCAGGCCTTG CTGCCCTTAT CCGTTCTTAT TATCCCAAGC TAACTGCTGT ACAAGTTAAG
GATATCATTA TGAAGTCTGT GGTAAAGGTC GATCAGTCTG TCGAAATCAG GGATGAAGCA
GGTACTAAAA AAGTTCCTTT TTCAGATCTT TGTGTAAGCG GTGGCATTGT AAATGCCTAT
GATGCATTAA AACTTGCTGC CGGGTATAAG AAATAG
 
Protein sequence
MNKMKMNFFC SGLVCLVLLT AMPFISFAQK PNWQNLDLKA DSTFGISTEK AYNELLKGKK 
SVPVVVAVLD GGVDIEHEDL KPVVWVNKKE KAGNGKDDDK NGYIDDVHGW NFLGSSKGSV
NYETLELTRL VRRDNARFAN SDATSVKASE LPAFEAYQKN KAEFEQELAE AKIGLERFEG
IKNAIDEVVK KIGKENPTIQ DFQSFKPSND MEANIQRILL AQLKESSFED FYKNQIMSGY
DHFKSQVDYN LNLDYDPRGI VGDDPNNSKE RFYGNNDVTG PDARHGSHVA GIIAAVRTNN
LGIMGVADNV LIMSVRNTPN GDERDKDVAN SIRYAVDKGA KVINMSFGKS YSWDKAIVDE
AVKYAVSKDV VLVHAAGNDN KDLEVEPNFP DPEYIGGGKA ASWITVGASG WTNDGTIKAS
FSNYGKTKVD VFAPGVNINS TVPGSKYEKL NGTSMASPVV AGLAALIRSY YPKLTAVQVK
DIIMKSVVKV DQSVEIRDEA GTKKVPFSDL CVSGGIVNAY DALKLAAGYK K