Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2046 |
Symbol | |
ID | 8253150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2363605 |
End bp | 2364735 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644935694 |
Product | peptidase S58 DmpA |
Protein accession | YP_003092313 |
Protein GI | 255531941 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.173525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00567247 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAAGAA AGACCTTGTT CCTCTTGTGT TTATTGTCTC CCCTGATTGG CGCTGCTCAA AAAAAAATGA GGGCCAGAGA CTATGGCATC AACATTGGTG TGTTACCTGT TGGCCTGTTC AATGCCATTA CCGATGTTCC CGGGGTTAAA GTTGGCCATA CCACCCTTAT TAAGGGCAAT CATATCCGAA CAGGCGTTAC CGCCATATTG CCGCACTCGG GCAACCTTTT TCAGCAAAAA GTACCGGCTG CCATATTTGC CGGAAACGGA TTTGGCAAGC TTGCCGGAAG CACACAGGTC ATGGAGCTGG GGAACCTGGA AAGCCCCGTT GTGCTCACCA ATACCTTAAA TGTGGCAACC GCTATGGATG CTGTAGTTGG CTATACCCTG CAGCAAAAAG GAAATGAGAA AGTGCAATCT GTAAATGCGC TTGTGGGTGA AACCAACGAT GGCTATTTAA ACGACATCAG GGGAAGGCAT GTTAGCCGGC AGGATGTGCT CCAGGCTATC CAGACTGCTA CAGGCGGAAA TGTGACCGAA GGGAATGTTG GCGCCGGCAC TGGCACTGTC TGTTTCGGTT TTAAAGGCGG TATCGGCACT TCATCCAGAA AATTACCCAA AAGCATGGGT GGCTATACCA TTGGTGTAAT TGTACAAACC AATTTTGGCG GTGTATTGCA GATTGCAGGT GCCCCTGTTG GTAAAGAGCT GGGTACTTTT ACTTTCAGCA ACCAGCTGCT GAACAACGTA GACGGATCCT GCATGATTGT AGTAGCTACG GATGCGCCCG TAGACAGCCG GAACCTGGAG CGTCTGGCGA AACGGGCATT TATGGGACTG GCCAAAACAG GGGGCATTGC CTCAAACGGC AGTGGCGATT ATGTTATTGC ATTCTCTACG GCCGAACAGC TGAGAATTGC CCACAGCCCT GCCAGCCCAA CACAGGGCAC CGAACTGTTG ACAAACGATT ACACCTCGGC TTTGTTTATG GGGGCTATAG AAGCGACAGA AGAAGCCATC ATCAATTCCC TTTTTGCAGC AGAAAACATG AAAGGCAACG GCAAGGAAGT CGCCGCCCTT CCGGCCGATA AAGTTATCCC GATCTTAAAA CATTACAACA CCGTAAAATA A
|
Protein sequence | MLRKTLFLLC LLSPLIGAAQ KKMRARDYGI NIGVLPVGLF NAITDVPGVK VGHTTLIKGN HIRTGVTAIL PHSGNLFQQK VPAAIFAGNG FGKLAGSTQV MELGNLESPV VLTNTLNVAT AMDAVVGYTL QQKGNEKVQS VNALVGETND GYLNDIRGRH VSRQDVLQAI QTATGGNVTE GNVGAGTGTV CFGFKGGIGT SSRKLPKSMG GYTIGVIVQT NFGGVLQIAG APVGKELGTF TFSNQLLNNV DGSCMIVVAT DAPVDSRNLE RLAKRAFMGL AKTGGIASNG SGDYVIAFST AEQLRIAHSP ASPTQGTELL TNDYTSALFM GAIEATEEAI INSLFAAENM KGNGKEVAAL PADKVIPILK HYNTVK
|
| |