Gene Phep_2434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2434 
Symbol 
ID8253541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2828668 
End bp2829813 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content45% 
IMG OID644936084 
ProductMembrane dipeptidase 
Protein accessionYP_003092700 
Protein GI255532328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.911642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TTCTGCCTGT CGCCCTGCTT TTCAGCTTTT CCAGTATAAT GGCCCAGCAA 
CGCCTGCTGG TTGACACCCA TAACGATGTT TTCTCTTTTC AGCTGGTCAC AAAAGCAGAT
CTGGGCAAAC TGCAGCCTGT CGGAAATTTC GACCTCATAA GGGCAGCAAA AGGCGGATTA
AATGCCCAGG TATTTTCCAT CTGGTGTGGT GAAGACTATG GCAAAGGAAA GGGTTTTGCA
CATGCAAACA GGGAAATCGA TTCACTTTAT GCCTTAATTG CCCGTTATCC GGATAAGATG
GCACTGGTAA AAAGCCCGCA AGAATTGAAA AAAGTGCATC AGCAAGGGCG ATTTGCGGCA
ATGATTGGTG TTGAAGGCGG ACACATGATC GAAGACAGGA TGGATTATCT GGACAGCCTG
ATCAAAAGGG GGCTGGTCTA TTTAACACTC ACCTGGAACA ACAGCACCTC CTGGGCCAGC
TCGGCCAGAG ATGAAACTAC AGGAAAAGGT ATGCGGCAGG CGGGTTTAAA TGAGCTGGGT
AAACAGATCG TAAAAACGCT GAACAAGAAT GGCGTGCTGG TAGATGTTTC ACACGCAGGT
GAAAAAACAT TTTATGATGT GCTGGCTACC AGCACCAAAC CCGTTATTGC TTCGCACAGC
AATGCGTATG CATTGACACC CCACCGGCGT AACCTCAAAG ATGAACAGTT AAAGGCACTG
GCAAAAAACG GCGGGGTGGT ATTTGTTAAC TTCTATAGTG GTTTCCTTGA TAGCAGTTAT
GACCATCGTG TGCAGGATTT TCTTCAGCTG CATGAAAAAG AGCTGGATTC CCTGAACAAC
ATTTACACAA AGGATTTTGC GCCAACCATG TTAAACGTAA TGTATAAAAA AGAAGCTGAT
CAGATCCGTC CGCCATTAAG CGCACTGGTC AAACACATCG ATTATATGGT AAAACTAATT
GGTGTCGACC ATGTTGGTAT CGGCTCAGAT TTCGATGGCT CTGAATCATT TCCGGCAGGA
ATGGATTCTG TTGCCGATTA TCCAAAACTG GAAACCGCGC TTAAAAACAT AGGTTACAAA
AAGGAATGGA TAGATAAGAT CTTCGGAGGG AACTTTGTCA GGGTCTGGGA AGCCAGTAAA
AAATAA
 
Protein sequence
MKKFLPVALL FSFSSIMAQQ RLLVDTHNDV FSFQLVTKAD LGKLQPVGNF DLIRAAKGGL 
NAQVFSIWCG EDYGKGKGFA HANREIDSLY ALIARYPDKM ALVKSPQELK KVHQQGRFAA
MIGVEGGHMI EDRMDYLDSL IKRGLVYLTL TWNNSTSWAS SARDETTGKG MRQAGLNELG
KQIVKTLNKN GVLVDVSHAG EKTFYDVLAT STKPVIASHS NAYALTPHRR NLKDEQLKAL
AKNGGVVFVN FYSGFLDSSY DHRVQDFLQL HEKELDSLNN IYTKDFAPTM LNVMYKKEAD
QIRPPLSALV KHIDYMVKLI GVDHVGIGSD FDGSESFPAG MDSVADYPKL ETALKNIGYK
KEWIDKIFGG NFVRVWEASK K