Gene Phep_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0974 
Symbol 
ID8252068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1141517 
End bp1143232 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content43% 
IMG OID644934629 
Productoligoendopeptidase, M3 family 
Protein accessionYP_003091258 
Protein GI255530886 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02289] oligoendopeptidase, M3 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.338378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACC TTACCATACC CAAGAAAAAA AGAGTTTATA TCCCGCAGGA CCTTCAAATC 
AAATGGGAAA ACCTGGAAGT TATTTTAAAT GAGCTATTGG AACGTCCGAT CGGGAATGTT
CAGGAACTGG AACAATGGCT TAAAGACAAA AGTGAACTGG AAGCAGCCCT GGAAGAAGAT
TTTGCCTGGC GATACATCAG GATGAGCTGT GATACAGCAA ATGAAAGCCT GGTTAAGGAT
TTCCAGTATT TTGCTACCGA AATTGAACCT AAAATTTCTC CGGTTGCCAA CAAGCTGAAC
CAGAAATTTA ACGATAGCCC ATTTATTGAT GAACTGGACC AGGACAAGTT TTTTGTCTAT
ATCCGGGCAA TCAGAAAAGC ACTCGAAATT TACAGGGAAG AAAACGTGGA GCTGCTGACC
AGGCTGCAGG TTACCCAGCA AAAATACCAG GGCATTACAG GTGCCATGAG TGTAACCCTC
AATGGCCAGG AATATACCCT GGAACAGGCC GCGAATTTTA TTAAAGATAC AGACAGACAA
GTCAGACAAC AGGCCTGGGA AACTATACAA CAACGTCGTA TGGTGGACAA GGACGAATTG
AACATGCTGT TTGATGAGCT CATCGTGATG CGTCATGAAG TAGCTTTAAA TGCCGGTTTC
GAAAACTACC GCGATTATAT GTTCCAGGCT TTGGGTCGCT TTGACTATAG CCCTCAGGAC
TGCTATCATT TTCATGAGGC CATTGAAAAG CAGATTGTAC CAATCCTTAA AGAACAGGCA
GAAAAAAGGG CAGAACTGTT GGATATAGCC CCACTTAAGC CATGGGATAT GGAGGTGAGC
ACCACCGGAA AACCTGCGCT TAAGCCCTTC AAAAATGGCG GGGAACTGAT AGATAAAAGC
ATAGCCTGCT TCAATGTTAT TGATCCTAAG CTCGGACAAA TGTTGTCCAT CATGAAAGCA
AACAACCTTT TTGATGTGGA AAGCAGGAAA GGCAAAGCAC CAGGTGGATA TAACTACCCA
CTGGCAGAAA CCGGAGCCCC CTTTATTTTT ATGAACTCGG CCAATTCTTT GCGCGATTTA
ACAACAATGG TACATGAAGG CGGGCACGCA GTACATACCT TCCTCACTGC AAACCTGGAG
CTGAACGACT TTAAACATTG TCCTTCCGAA GTTGCAGAGT TAGCTTCAAT GAGTATGGAG
CTGATCTCTA TGGACAACTG GAACATTTAC TTCGACAATG AAGAAGACCT GATCCGTGCA
AAAAAAGAAC AGCTGGTAGA TGTACTCAAA ACCTTGCCCT GGGTGGCTGT TATCGACCAG
TTCCAGCACT GGATCTATAC CAATCCCAGT CATAATGCGG CCGACCGTGA GGAGGCCTTT
AAACAAATTT ACACCCGTTT TGGGGCAGGC TTTGCCAACT GGGATGGCCA GGAAAAGGAA
TTTGGAAACA TCTGGCAAAA ACAGCTGCAT CTTTTTGAGG TCCCTTTTTA CTACATTGAA
TATGCCATTG CCCAGTTAGG GGCCATTGCC ATCTGGAAAA ATTATAAAGA AAATCCTTCC
AAAGCGCTGG AACAATATCT TAATGCACTT TCATTAGGTT ATACAAAACC AATTAATGAA
ATTTATGAAA CTGCAGGGAT AAAATTCGAT TTTAGTTTAA GTTATATAGA ACAGCTTGCC
AGTTTTGTAA AAGATGAATT GCAGAAATTA AACTAA
 
Protein sequence
MINLTIPKKK RVYIPQDLQI KWENLEVILN ELLERPIGNV QELEQWLKDK SELEAALEED 
FAWRYIRMSC DTANESLVKD FQYFATEIEP KISPVANKLN QKFNDSPFID ELDQDKFFVY
IRAIRKALEI YREENVELLT RLQVTQQKYQ GITGAMSVTL NGQEYTLEQA ANFIKDTDRQ
VRQQAWETIQ QRRMVDKDEL NMLFDELIVM RHEVALNAGF ENYRDYMFQA LGRFDYSPQD
CYHFHEAIEK QIVPILKEQA EKRAELLDIA PLKPWDMEVS TTGKPALKPF KNGGELIDKS
IACFNVIDPK LGQMLSIMKA NNLFDVESRK GKAPGGYNYP LAETGAPFIF MNSANSLRDL
TTMVHEGGHA VHTFLTANLE LNDFKHCPSE VAELASMSME LISMDNWNIY FDNEEDLIRA
KKEQLVDVLK TLPWVAVIDQ FQHWIYTNPS HNAADREEAF KQIYTRFGAG FANWDGQEKE
FGNIWQKQLH LFEVPFYYIE YAIAQLGAIA IWKNYKENPS KALEQYLNAL SLGYTKPINE
IYETAGIKFD FSLSYIEQLA SFVKDELQKL N