Gene Phep_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3687 
Symbol 
ID8254818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4400450 
End bp4402495 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content41% 
IMG OID644937348 
ProductNeprilysin 
Protein accessionYP_003093940 
Protein GI255533568 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.961749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAT TGAATAAGCC TGCAGGTATT GCTGTGGTTG CTGGAACCAT CCTTTTTATG 
TCCGCATGTA ACAGTCCAGG GGCTAAAACT ACCGACCTTT ATTCAGGTAT TGTCCTTAAA
AACATGGATA CCACTGTAGC TCCGGGCAAT AATTTTACTG AATACGTAAA TGGCACATGG
GTGAAGAACA CCAAAATCCC CTCAGACAAA GCTTCTTTTG GGGCAATGGC CATTGTTAAT
GATAAAGCCC AGGATGATGT GAAGGCGATC ATTGAGACCG CAGCAAAGGT TAAATCTAAG
GATGGGTCTG AAGAACAAAA GATAGGTGAT TTTTACGAGT CTTATATGAA CATGAAAGTT
CGTGATGCTA TCGGGCTTGA ACCGCTTGCT GCCGAATTTA AAAAGATCGA TGCGCTTACC
ACAAATAAAG ACCTTGCCGG TTATTTTGCC TATGCCAATA AATTGGGTTT TAAAATTCCT
TTTAGCCTTG GGGTAATGGA AGATTTTAAA GATCCGAAAA AATATATGCT GTTCAGCTGG
CAAGGTGGTT TGGGTTTGCC AGATCGTGAT TACTATTTGC TTACTGATGC GAAGTCGAAA
GAGATACGCA ATAAATACAT TCAGCATATA GAAAATATGC TGAACATTGC CGGAATACAG
GATGCAAAAG CTATCGCAAA ACAGGTTATG GCTTTGGAGA CCCTGATGGC TTCCAAACAG
ATGAAAAAAG AAGATACCAG AAATATGGCG GCATTGTATA ATAAGTATGC TGTTAAGGAC
CTGAATAAAC TGCTGGTTGA TTTTGACTGG AATAAATTGC TGCTTGAGGG GGGGATCAGG
GGAGTAGACA GTTTGGTGGT TACACAGGTA GCCTATACCA AGGACCTGAA TGCTATCCTT
AAAAATACAC CTATAGATAC CTGGAAAAAC TATTTGAAAT GGGGAGTAAT TACAGAGAGT
GCAAGTATGT TGAATTCTGC CCTGGATCAG GAAAATTTTA ACTTTTATGG TACAACATTA
AGGGGTATCA AAGAACAGAA GCCGCAATGG CGCCGGGCTG TTGATGTGGT AAATGCAAAC
CTGGGTGAAA TGGTAGGTAA GTTGTATGTG GAAAAACATT TTCCTGCAGA GGCTAAAGAG
CGGATGGTAA AGTTGGTGGT CAATCTTTTA AAAGCGTATG AAGCCAGTAT TAAAGAATTG
GACTGGATGA GCCCTGAGAC CAAAAAACAG GCTTTGATAA AAATTAACAA GTTCACTCCT
AAAATTGGTT ATCCGGATAA ATGGAGAGAT TATAGTGCAT TAAAGGTGGT TAAAGGTGAT
CTTTATGGTA ACAATGTACG TGCAACAGAA TTTGAGTATA ACCGGACCAT AAACAAACTT
GGCAAACCTG TAGACCGTTC TGAATGGGGG ATGAACCCAC AAACGGTAAA TGCTTACTAC
AATCCGCCTA TGAACGAAAT TGTGTTTCCT GCAGCAATCT TACAGCCTCC TTTCTTTGAT
ATGAAAGCAG ATGATGCGGT AAATTATGGC GGTATAGGTG CTGTGATCGG GCACGAGATT
GGTCATGGTT TTGACGATCA AGGCAGTACA TTTGATGGGG ATGGTGTGAT GCGGAACTGG
TGGACAAAGA AAGACAATGA AGAGTTTAAA AAGAGAACCA ATGCACTGGT AGCACAGTAT
AGTGCGTTTA AAGTGCTTCC TGACCTGAAT GTAAATGGAA ATTTTACGCT TGGCGAAAAT
ATTGGTGACC TTGGTGGCTT AAGCATTGCA TTGAGGGCTT ATAAAGCAAG TTTAAATGGT
AAGCCTGCAC CTGTTATGGA TGGTTTTACA GGTGAACAAA GGGTGTTTAT TGGCTGGGGC
CAAGCCTGGT TAAATAAATC TACAAATGAG GCTTTAAGAA CACAAGTGGG GACAGACCCA
CATGCTCCGG CCAAATTCAG GGTAAATGGG GTGGTAAGGA ATATCCCGGA GTTTTATACT
GCATTTAATG TAAAACCTAC AGATTCTTTA TATCTAGCTC CTGAAAAAAG AGTTAAAATC
TGGTAA
 
Protein sequence
MIKLNKPAGI AVVAGTILFM SACNSPGAKT TDLYSGIVLK NMDTTVAPGN NFTEYVNGTW 
VKNTKIPSDK ASFGAMAIVN DKAQDDVKAI IETAAKVKSK DGSEEQKIGD FYESYMNMKV
RDAIGLEPLA AEFKKIDALT TNKDLAGYFA YANKLGFKIP FSLGVMEDFK DPKKYMLFSW
QGGLGLPDRD YYLLTDAKSK EIRNKYIQHI ENMLNIAGIQ DAKAIAKQVM ALETLMASKQ
MKKEDTRNMA ALYNKYAVKD LNKLLVDFDW NKLLLEGGIR GVDSLVVTQV AYTKDLNAIL
KNTPIDTWKN YLKWGVITES ASMLNSALDQ ENFNFYGTTL RGIKEQKPQW RRAVDVVNAN
LGEMVGKLYV EKHFPAEAKE RMVKLVVNLL KAYEASIKEL DWMSPETKKQ ALIKINKFTP
KIGYPDKWRD YSALKVVKGD LYGNNVRATE FEYNRTINKL GKPVDRSEWG MNPQTVNAYY
NPPMNEIVFP AAILQPPFFD MKADDAVNYG GIGAVIGHEI GHGFDDQGST FDGDGVMRNW
WTKKDNEEFK KRTNALVAQY SAFKVLPDLN VNGNFTLGEN IGDLGGLSIA LRAYKASLNG
KPAPVMDGFT GEQRVFIGWG QAWLNKSTNE ALRTQVGTDP HAPAKFRVNG VVRNIPEFYT
AFNVKPTDSL YLAPEKRVKI W