Gene Phep_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4159 
Symbol 
ID8255294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5030693 
End bp5032096 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content39% 
IMG OID644937824 
ProductTPR repeat-containing protein 
Protein accessionYP_003094412 
Protein GI255534040 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00004642 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGAGG AATTTTACTT TGATTTTAGT GATGACGCCC AACGGTCTGT TGAGCGTTAC 
GAGGAGATGA TACGCAATCA GGATCAGTAC TTTTTTGATG CCCAGGCTTT TGAGAATATT
ATTGATTACT ATATTGAGAA GAGTGACCCT GTAAAGGCCC TTCAGGTAAT AGAATATGCA
CTTAATCAGC ATCCTTATGC TGCGGTATTT CTGGTTAAAC AGGCGCAGTT GCTGTTTATA
ACCGATCAGA CGGAAAGGGC ATTCCTGTCG CTGCAGAAAG CGGAAATGCT GGAGGCTTCT
GAAGCAGAGA TCTATATTTT AAGAGGGAAT ATCTTTAATA GCCTTGAGCG TTATTCTGAA
GCATTGGATA ATTTCCAGAA AGCATTGGAA TTTGCTGAAA CAACAGATGA AATTCTTTTG
CAGATTGCTT ATGTATACCA GAATATGCTG GACTATGAAA GCGCGATTAT ATATATTAAA
CAGAGCCTGG AGCAAAACAT GGAGAACAAG GACGGTTTGT ATGAGCTTGC TTTTTGTTAT
GACATTTTAG ATAAACAGGA AGAAAGTATT AAGTTTTATC AGGAATACAT AGATAATGAT
CCGTATTCTT ATGCCGCATG GTATAACCTG GCCAATTCTT ACCATAAACT GGATTTGTTC
GAGAAAGCAA TTGATGCTTA TGATTATGCC ATCTTAATTA AAGACAATTT TGCTTCTGCC
TACTACAATA AAGGGAATGC ACTGGTTCAG CTTGACCGCT ACACGGAAGC TATTGAGGTT
TATAAACAAA CCTTTGAATA CGAACCACCC AATGCCGATA CCTATTGTGC TATCGGGGAA
TGTTATGAAA AGCTTGAAAG GATGGATGAA GCGCGTTCAT ACTACAAAAA ATCTGTTAAG
ATGGACGCGA AGATGGCCGA TGCCTGGTTT GGCATAGGTG TTACCCTGAA TTTTGAGGAA
CGCTACTTTG AGTCGCTGCA TTTTTACAGA AAAGCGCTGG AACTGGATGC AGAGAATCCG
GATTTTTGGT TTGCCATGGC CGATGCCCAT TATAAGCTCG GGCAGATAGA ACAATCTGTT
GAAGCCTATT ATAAAGTTTT GGAGTACAAT CCGGTAGATG TAGAAGCCTG GCTTGATTTT
TCAACTGTAT TGTACGAGCA GGGAAAACTA CTCGAAGCTT CAGAGACGAT GTCTGATGCG
ATAAAAAATA ATCCTGATGC CGCAGAGCTA TATTATCGTA TGGTAGCTTA TTTATTTGCC
CTTGGAAAAA AGAGCGAAGC CCTCTTATAT CTTGAAACAG CCCTGGTTAC AGATCCTGAA
AAGCACTATA TTTTGTTTGA ATATTTACCT CAATTACAAG ATAACAGCTC AATAGTGGAC
GTTATCAATA GGTATATAAA ATAG
 
Protein sequence
MEEEFYFDFS DDAQRSVERY EEMIRNQDQY FFDAQAFENI IDYYIEKSDP VKALQVIEYA 
LNQHPYAAVF LVKQAQLLFI TDQTERAFLS LQKAEMLEAS EAEIYILRGN IFNSLERYSE
ALDNFQKALE FAETTDEILL QIAYVYQNML DYESAIIYIK QSLEQNMENK DGLYELAFCY
DILDKQEESI KFYQEYIDND PYSYAAWYNL ANSYHKLDLF EKAIDAYDYA ILIKDNFASA
YYNKGNALVQ LDRYTEAIEV YKQTFEYEPP NADTYCAIGE CYEKLERMDE ARSYYKKSVK
MDAKMADAWF GIGVTLNFEE RYFESLHFYR KALELDAENP DFWFAMADAH YKLGQIEQSV
EAYYKVLEYN PVDVEAWLDF STVLYEQGKL LEASETMSDA IKNNPDAAEL YYRMVAYLFA
LGKKSEALLY LETALVTDPE KHYILFEYLP QLQDNSSIVD VINRYIK