Gene Phep_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3840 
Symbol 
ID8254974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4611133 
End bp4612185 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content44% 
IMG OID644937504 
ProductTPR repeat-containing protein 
Protein accessionYP_003094093 
Protein GI255533721 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.745152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAA AAGTATTTTT ATTGATTTTG CTGAGCTATA TACCTGCAGT ACGGGCGCAG 
CAGCCTGTGC AGGGATATGT AGATAGTAAT CTCGTGAAAA CGCTTTTTTT TGCCGGCCTG
CGTGATAAGT TGAACGAGGA TTATAGCAGG GCCAATGAAA GTTTTACCAA AATACTGGTG
CTAGACCCTA ACAATGCAGC CGTCCACTAT GAAATTGCGG TAATGAACTA CCGGCAGAAT
AAACTTTTTG AAGCTGAAAT GGCCATAAAA AAAGCCCTTG TTGCTGATGG TAACAATGTG
TGGTACTGGA TGCTGATGGC TGAGTTGTAT AAACGAAAGG GCGATATGGA AGCTCTGGTA
GAGGTTCTGA ACCAGATGAT CAGGCTTGCA CCGGATAAGG AGGCCTATTA TTACGACCGG
TCTAACGCCT GGCTACTGGC TGGGAATACA GACGCCGCTA TGAAAGGTTA TGATGAGCTG
GAAAAAAAAT TCGGCAATTC TGAAGCACTG AACCATGCCA GGCAACGGGT AACGATGGAA
AAGGATGATA CTGCAGGTGG ACAAAATGAG GGCCATCAGG CAGCAGCCTC GCTGAGCCCG
GAACAAACAA TGCTGGTACT TGGCGAAAAA TTGTACAGGC AGGGCGATCT GAAAGGGGCA
ATGGCCCAGT TTAAATCAAT ACTTAAAAAT ACCGATCAGA TTTATATGGC CTGGGAACGT
GCAATACATA TTGAAGTGGT ACTGGGTTTA TATGCCGAGG CTTTAAAAAC AGCAGATGAA
GCTTTATCTT TATATCCTAG TCAGGCAGTT CTGTATTATT ATAAGGCTGT AGCCCTGCAA
CATATAAGTA ATTATGCGGA AGCCCTGACA AATATCGAAA CTGCCTTGCA GCTGGATGAA
GGAAATGCGC TTTATATGGA GCTTTATGGC GATGTTTTGT TTTTGAAAGG AGAGCCTGCC
CAGGCATTGC TGCAATGGAA AAAGTCGAAA GCGGCAGGGA ACAGTTCTGA AAAATTAAAC
AAAAAGATCA ATGAACGGAA GTATTTGGAA TAA
 
Protein sequence
MKGKVFLLIL LSYIPAVRAQ QPVQGYVDSN LVKTLFFAGL RDKLNEDYSR ANESFTKILV 
LDPNNAAVHY EIAVMNYRQN KLFEAEMAIK KALVADGNNV WYWMLMAELY KRKGDMEALV
EVLNQMIRLA PDKEAYYYDR SNAWLLAGNT DAAMKGYDEL EKKFGNSEAL NHARQRVTME
KDDTAGGQNE GHQAAASLSP EQTMLVLGEK LYRQGDLKGA MAQFKSILKN TDQIYMAWER
AIHIEVVLGL YAEALKTADE ALSLYPSQAV LYYYKAVALQ HISNYAEALT NIETALQLDE
GNALYMELYG DVLFLKGEPA QALLQWKKSK AAGNSSEKLN KKINERKYLE