Gene Phep_2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2364 
Symbol 
ID8253471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2760225 
End bp2761679 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content42% 
IMG OID644936014 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_003092630 
Protein GI255532258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.807685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.043104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTG AAAATTTAGA ACCAAAGGCA CTATGGAACA ATTTTGTTGC TTTAAATGCC 
ATACCCCGTG CTTCAAAAAA AGAAACCCAG GTGATAAAAT TTATGCTTGC TTTTGGCAAA
CAGCTCGGAC TGGAAACCAT TAAAGACCAT GTTGGAAATG TTGTGATCAA AAAGCCGGCA
ACTGAGGGCA TGAAGGATAA ACAAACTGTC ATCCTGCAGT CGCACCTGGA TATGGTCCAT
CAAAAAAATA GCGATACAGA TTTTGATTTC GATAGGGAAG GCATTAAGAT GTATGTTGAT
GGAGATTGGG TAAAAGCCAG AGGCACCACC CTTGGGGCTG ATAATGGAAT TGGTGTAGCT
ACAATTATGG CTATACTGGC GGCTTCCGAC CTGGTGCACC CTACCATTGA AGCCTTGTTT
ACCATAGATG AAGAAACAGG TATGACCGGT GCAAAGGAAC TTGACCCGTC TAATTTATCT
GGAAAAATAT TGTTAAACCT CGATACTGAA GAAGACACAG AATTAACCAT TGGTTGTGCT
GGCGGCATAG ATACAACAAC TACATATCAT TATCATACCC ATCCTGTTGC CAAAAACAGC
ACTGCGTTCC AGATATCAAT TAAAGGCCTG ATTGGCGGGC ATTCCGGAAT GGACATTCAT
AAGGGACGTG CCAATGCCAA TAAGCTGATG AACCGTCTGC TATATAATGG AAATAAAGTT
TTAGACCTGC AGCTGGCCAG CGTTGAAGGT GGAAGCCTGA GAAATGCGAT CCCACGTGAG
TCTGTTGCGG TAGTTGCAGT TTCGGGAAAT CAGAAAAAGG CATTTCTTTC CTTTATAGCA
GATTTTACGG AGGTCATTAA AGCCGAATAC CATGCAATAG AACCTTTTAT GAAAATTACT
GCAGAAGAAA CAGTACTGCC TGCAGAGGTT TTGGAAAAGG AAGAATACAT GGAGATCATC
AATACACTAT ATGCAGTGCC AAATGGCGTG TTCAGGATGA GCCCTGAAAT TCCGGGACTT
GTTGAAGCGT CATCAAATCT GGCAAAAGTG ATCATTAAAG ACGGGGAATT TATTACCTTA
TCGTTACAGC GGAGCAGTGT AGAAAGCACA AAGGAAGATG TTGCTATTGC AGTGGGGGCA
GCTTTTGAGA ATATGGGTTG TAAAGTTAAC AGCAGTGGCG ATTATCCTGG GTGGAAGCCC
AATGCTGCTT CAGAAATACT GTCGCTGATG CGGAGTTTGT ATAAGGTTAA TTTTAAGACT
GAACCCAATG TAAATGCCTG TCATGCCGGT TTGGAATGTG GTATTTTAGG CGCCCATTTA
TCAGAAATGG ACATGATTTC TTTTGGTCCC AATATCCATG GCGCACATTC GCCAGACGAA
CGCGTTCAGA TTTCTTCGGT GAACAAGTTC TGGAACTATC TTTTGAAAGT ACTGGAAGAA
ATACCTGCTC GTTAA
 
Protein sequence
MKVENLEPKA LWNNFVALNA IPRASKKETQ VIKFMLAFGK QLGLETIKDH VGNVVIKKPA 
TEGMKDKQTV ILQSHLDMVH QKNSDTDFDF DREGIKMYVD GDWVKARGTT LGADNGIGVA
TIMAILAASD LVHPTIEALF TIDEETGMTG AKELDPSNLS GKILLNLDTE EDTELTIGCA
GGIDTTTTYH YHTHPVAKNS TAFQISIKGL IGGHSGMDIH KGRANANKLM NRLLYNGNKV
LDLQLASVEG GSLRNAIPRE SVAVVAVSGN QKKAFLSFIA DFTEVIKAEY HAIEPFMKIT
AEETVLPAEV LEKEEYMEII NTLYAVPNGV FRMSPEIPGL VEASSNLAKV IIKDGEFITL
SLQRSSVEST KEDVAIAVGA AFENMGCKVN SSGDYPGWKP NAASEILSLM RSLYKVNFKT
EPNVNACHAG LECGILGAHL SEMDMISFGP NIHGAHSPDE RVQISSVNKF WNYLLKVLEE
IPAR