Gene Phep_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4203 
Symbol 
ID8255339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5082065 
End bp5083147 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content46% 
IMG OID644937869 
Productpeptidase M20 
Protein accessionYP_003094456 
Protein GI255534084 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0267175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGCG ATTTATTTAA GGAAAGTATT AAGAGGGATG CGGTAGAATT ATTGAAACAA 
TTGATCAGCA TCCCATCTTT CAGTAAAGAA GAAGATAAGA CCGCAGATGC AATTGAGGCT
TTCCTGCAGC AGCGGAATAT AAAAACACAC CGTAAGCTGA ACAACATCTG GGCATACAAC
AAATATTTTG ATGCGGCTAA GCCAACCCTG CTCTTAAATT CACATCATGA TACCGTGAAG
CCAAACTCGG GCTATACGCG CGATCCTTAC GCGGCTACAG TGGAAGATGA CAAATTGTAT
GGGCTGGGTA GTAATGATGC CGGAGGCTGC CTGGTATCGC TTATAGCAAC CTTTCTTTAT
TATTATGATC AGGAAGGTTT AAATTATAAC ATCTGTCTGG CTGCTACAGC TGAAGAAGAG
ATTTCAGGCA ACAATGGCCT GGAATGCATA CTGCCCGACC TTGGGGAGCT GGAATTTGCC
ATTGTAGGAG AGCCTACCCT GATGAACCTG GCCATTGCAG AACGGGGTTT GCTGGTGCTG
GACTGTACCA GCTATGGAAA GGCCGGGCAT GCTGCACGTG AAGAAGGTGA CAATGCCATT
TATAAGGCGC TAAAGGATAT AGAATGGTTC CGCAATTATC GTTTTTCGAA GGTGTCCGAA
ATGTTTGGCC CTTTAAAAAT GTCGGTTACC ATTATTAATG CAGGCTCGCA GCACAACGTA
GTGCCCGCTT CCTGTATTTT TACAGTTGAT GTGAGGGTAA CTGATGCCTA TACCAATGAG
GAAGTGTTAA AGATTATCCG GACCAATGTG GACTGCGAGG TTGTACCCCG CTCTATCCGG
CTAAAACCTT CGTCAATTGA TAAGGAACAT CCCATCGTAC AGTCGGGTAT TGCCCTTGGA
AGGACAACTT ATGGTTCACC TACCACATCG GACCAGGCCT TATTGAGCAT TCCTTCCTTA
AAGGTTGGTC CTGGTGATTC TGCGCGGTCG CACATGGCGG ATGAGTATGT TCACCTGAGC
GAAATTGAAA AGGGGATCGG ACTTTATATA GAAATGCTGA AGCCGGTTGT TAACGGGAAA
TAA
 
Protein sequence
MESDLFKESI KRDAVELLKQ LISIPSFSKE EDKTADAIEA FLQQRNIKTH RKLNNIWAYN 
KYFDAAKPTL LLNSHHDTVK PNSGYTRDPY AATVEDDKLY GLGSNDAGGC LVSLIATFLY
YYDQEGLNYN ICLAATAEEE ISGNNGLECI LPDLGELEFA IVGEPTLMNL AIAERGLLVL
DCTSYGKAGH AAREEGDNAI YKALKDIEWF RNYRFSKVSE MFGPLKMSVT IINAGSQHNV
VPASCIFTVD VRVTDAYTNE EVLKIIRTNV DCEVVPRSIR LKPSSIDKEH PIVQSGIALG
RTTYGSPTTS DQALLSIPSL KVGPGDSARS HMADEYVHLS EIEKGIGLYI EMLKPVVNGK