Gene Phep_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3074 
Symbol 
ID8254191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3672334 
End bp3674106 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content41% 
IMG OID644936727 
Productsignal peptide peptidase SppA, 67K type 
Protein accessionYP_003093333 
Protein GI255532961 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00250927 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAGAAT TTTTTAAATA TGTTTTTGCT ACAGTGGTAG GCGTAGTCAT TTCACTGGCC 
TTATTTTTAG TATTTTTTGT ACTATTGATC ATCGGAATTG TTAAAACATC CCTAAAGGAA
CAGAAAGTAA GTGTAAACAG CAATTCTGTA CTTTTGCTTA ACCTGGACCA AAGCATAACT
GAACGCACCG CTGACGATCC TTTGTCAAGA TTGCCCCTGG TTGGCTCAGA TGAGGATAAA
AGCATTGGTT TTAACGATGT GATCCGTGCG TTGCAAAAAG CCAAAACAGA CGACAACATT
AAATGTGTTT ACATCAATGT AACCTCACCT CAGGCGGGTT TTGCTACCAT GCGTGAAATC
AGGGATGCAC TGATTGATTT TAAAACAAGT AAAAAGAAAA TCATAGCCTA TAGTGAGGTT
TACACACAGG GAGCCTATTA CCTGGCTTCT GCAGCCGATA AGATTTATTT AAATCCCGAA
GGTGCACTGG AGTTTAAAGG CCTGAAATCT GAGACCATGT TTTTTAAAGG TGCACTGGAT
AAACTTGGCA TCGAAGCCCA GATTGTGAGG GTCGGTAATT ATAAAAGTGC TGTAGAGCCT
TTCTTTGCAG AAAAAATGAG CGACAAAAAC CGCGAACAGG TTACGGCCTA CCTGCATGGC
CTATATGATA CTTTTCTGGA AGGGATAGCC AAAAGCCGTA ACATCAGCAC CGAAACGCTT
TACCAGATTG CAGATGATTA TAAGATCAGG CAGCCTCAAG ACGCTGTTGC CTATAAAATG
GTGGATGAGC TGAAATATAA AGACCAGATT ATTGATGAAC TCAAAACATT AAGTGGAAAA
GAGAAAAAAA ATGATCTCAC AACCATTTCT ATTAATGACT ATGCCAGAAA TGTTGTTAAA
GACCAGCCTA CTTCCGCAAG CCAGAAAGTA GCCATAATTT ACGCCAATGG CGACATCATG
GGTGGCGATG GCTCCGATCA CCAGATTGGT TCGGAACGGA TCTCCAAAGC CATTCGGAAA
GCCCGTCTTG ATACCGATAT CAAAGCTGTT GTTTTAAGGG TAAATTCACC AGGTGGCAGC
GCTTTGGCCT CAGATGTGAT CTGGAGAGAA ATTGTACTCA CCAAAAAGAT AAAACCTGTT
ATCGCTTCAT TTGGAGATGT AGCAGCCTCT GGAGGCTATT ACATTGCATG CGCGGCCGAC
TCCATATTTG TACAACCCAA CACCATAACC GGATCTATTG GCGTATTTGG TATTATTCCT
AATTTCCAGA AATTATTTAA TGATAAACTT GGAATCACCT TTGATGGTGT TAAAACCGGA
AAATATGCAG ATATCATGAG TGTGGACAGA CCCATGACTG CAGGTGAAAG GCTGATTGTA
CAAAATGATG TAAACCGGGT ATACGATAGT TTTATTACCC GTGTGGCAGA TGGGCGTAAA
AGAACAAAAG CTTACATAGA CAGCATAGGT GGTGGCCGTG TTTGGGTTGG GACTGATGCA
GTCAGAATTG GACTGGCCGA CAGAACAGGT AACTTTAATG ATGCCATAAT TGCCGCAGCT
AAAAAAGCTA AAATAAAAGA TTACAGTATT GTGGAATATC CTGAAATGCG GGATCCCTTT
AAATCACTGA TGGACAACAC AACAGATAAG ATAAGAACTT ATTATGCCAA ACAAGAGCTT
GGTGCGAGCT ATCCTATCTA TCAGCAAATA AAATCAGCTG TATCCAGATC AGGAATCCAG
ACCAGGATGC CTTTTGAAAT TAAAATACAA TAA
 
Protein sequence
MREFFKYVFA TVVGVVISLA LFLVFFVLLI IGIVKTSLKE QKVSVNSNSV LLLNLDQSIT 
ERTADDPLSR LPLVGSDEDK SIGFNDVIRA LQKAKTDDNI KCVYINVTSP QAGFATMREI
RDALIDFKTS KKKIIAYSEV YTQGAYYLAS AADKIYLNPE GALEFKGLKS ETMFFKGALD
KLGIEAQIVR VGNYKSAVEP FFAEKMSDKN REQVTAYLHG LYDTFLEGIA KSRNISTETL
YQIADDYKIR QPQDAVAYKM VDELKYKDQI IDELKTLSGK EKKNDLTTIS INDYARNVVK
DQPTSASQKV AIIYANGDIM GGDGSDHQIG SERISKAIRK ARLDTDIKAV VLRVNSPGGS
ALASDVIWRE IVLTKKIKPV IASFGDVAAS GGYYIACAAD SIFVQPNTIT GSIGVFGIIP
NFQKLFNDKL GITFDGVKTG KYADIMSVDR PMTAGERLIV QNDVNRVYDS FITRVADGRK
RTKAYIDSIG GGRVWVGTDA VRIGLADRTG NFNDAIIAAA KKAKIKDYSI VEYPEMRDPF
KSLMDNTTDK IRTYYAKQEL GASYPIYQQI KSAVSRSGIQ TRMPFEIKIQ