Gene Phep_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2046 
Symbol 
ID8253150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2363605 
End bp2364735 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content50% 
IMG OID644935694 
Productpeptidase S58 DmpA 
Protein accessionYP_003092313 
Protein GI255531941 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.173525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00567247 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAAGAA AGACCTTGTT CCTCTTGTGT TTATTGTCTC CCCTGATTGG CGCTGCTCAA 
AAAAAAATGA GGGCCAGAGA CTATGGCATC AACATTGGTG TGTTACCTGT TGGCCTGTTC
AATGCCATTA CCGATGTTCC CGGGGTTAAA GTTGGCCATA CCACCCTTAT TAAGGGCAAT
CATATCCGAA CAGGCGTTAC CGCCATATTG CCGCACTCGG GCAACCTTTT TCAGCAAAAA
GTACCGGCTG CCATATTTGC CGGAAACGGA TTTGGCAAGC TTGCCGGAAG CACACAGGTC
ATGGAGCTGG GGAACCTGGA AAGCCCCGTT GTGCTCACCA ATACCTTAAA TGTGGCAACC
GCTATGGATG CTGTAGTTGG CTATACCCTG CAGCAAAAAG GAAATGAGAA AGTGCAATCT
GTAAATGCGC TTGTGGGTGA AACCAACGAT GGCTATTTAA ACGACATCAG GGGAAGGCAT
GTTAGCCGGC AGGATGTGCT CCAGGCTATC CAGACTGCTA CAGGCGGAAA TGTGACCGAA
GGGAATGTTG GCGCCGGCAC TGGCACTGTC TGTTTCGGTT TTAAAGGCGG TATCGGCACT
TCATCCAGAA AATTACCCAA AAGCATGGGT GGCTATACCA TTGGTGTAAT TGTACAAACC
AATTTTGGCG GTGTATTGCA GATTGCAGGT GCCCCTGTTG GTAAAGAGCT GGGTACTTTT
ACTTTCAGCA ACCAGCTGCT GAACAACGTA GACGGATCCT GCATGATTGT AGTAGCTACG
GATGCGCCCG TAGACAGCCG GAACCTGGAG CGTCTGGCGA AACGGGCATT TATGGGACTG
GCCAAAACAG GGGGCATTGC CTCAAACGGC AGTGGCGATT ATGTTATTGC ATTCTCTACG
GCCGAACAGC TGAGAATTGC CCACAGCCCT GCCAGCCCAA CACAGGGCAC CGAACTGTTG
ACAAACGATT ACACCTCGGC TTTGTTTATG GGGGCTATAG AAGCGACAGA AGAAGCCATC
ATCAATTCCC TTTTTGCAGC AGAAAACATG AAAGGCAACG GCAAGGAAGT CGCCGCCCTT
CCGGCCGATA AAGTTATCCC GATCTTAAAA CATTACAACA CCGTAAAATA A
 
Protein sequence
MLRKTLFLLC LLSPLIGAAQ KKMRARDYGI NIGVLPVGLF NAITDVPGVK VGHTTLIKGN 
HIRTGVTAIL PHSGNLFQQK VPAAIFAGNG FGKLAGSTQV MELGNLESPV VLTNTLNVAT
AMDAVVGYTL QQKGNEKVQS VNALVGETND GYLNDIRGRH VSRQDVLQAI QTATGGNVTE
GNVGAGTGTV CFGFKGGIGT SSRKLPKSMG GYTIGVIVQT NFGGVLQIAG APVGKELGTF
TFSNQLLNNV DGSCMIVVAT DAPVDSRNLE RLAKRAFMGL AKTGGIASNG SGDYVIAFST
AEQLRIAHSP ASPTQGTELL TNDYTSALFM GAIEATEEAI INSLFAAENM KGNGKEVAAL
PADKVIPILK HYNTVK