Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1241 |
Symbol | |
ID | 8252339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1460548 |
End bp | 1462713 |
Gene Length | 2166 bp |
Protein Length | 721 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644934894 |
Product | peptidase S9B dipeptidylpeptidase IV domain protein |
Protein accession | YP_003091519 |
Protein GI | 255531147 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.349009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC TGATTTTACT GATCTATCTG TTGGCTGTAG CATCTGGTAG TTTTGCACAG CAGCAGCAGC AATTGAGCAT GCAGGATGCG ATGAGCAATG CCCGGACAAC GCTGGCACCC GAAAACTTAT CCCAACTTCA ATTTATTTAT GGAACAGAGG ATTATGTATA TGCCAAACGT ATCGGCAATA GTCCGGTTTG GTTAAGTGGC AATGCCAAAT CAAAGGAAGA CCAGCCTTTT CTGACCTTAA CACAATTAAA CCAGAAATTA AGGAACGCGA AGAAGGATAC TTTGAAGATG ATGCCCGTGA TCCAGTTTAA CCAGGGGCCG GAATGGATTT TAAACCTTAA CGGTAGTAAA GTAGCGATCA ACCCGGTTAA AAATACGGTA GATGTATTGG TAGACCAGTC GTTAATGGCA AAAACAAACG CAGAGGAAAG CAAGGCCGGT TATGTGGCTT ATCTGGATAA TTTCAACTTG TTTGTTGCTA AAGACGGGGA TCGGAAACAG GTTACTACTG ATGGTAACAG TGATATTGTT TACGCTTCTT CGGTACACAG GGAAGAGTTC GGGATCAGTA AAGGAACTTT CTGGAGTAAT AATGGTAAGG TGCTTGCTTT CTACAGAATG GACCAGCGAA TGGTTACAGA TTATCCGATC ATCGACTGGA CCAGCCGGCC TGCTCACAAT GTAAACATCA AATATCCTAT GGCGGGTGAC AAGAGCCATC ATGTTACTGT GGGGGTGTAT CATGCAGAAA CTAAAGCTGT AGTGTATTTG AAAACCGGCG AGCCGGCAGA GCAGTATTTA ACAAATATTG CCTGGAGTCC GGATGATAAA TATGTTTATA TAGCGGTATT GAACCGTGGA CAAAATCACA TGAAGCTAAA CCAGTACGAC GCGGCTACAG GCGATTTTGT GAAAACCTTA TTTGAAGAGA AAGATGATAA ATATGTAGAG CCACTGGTGC CGATGTTATT CCTGAAAAAT GATCCTTCAA AATTTATATG GCAAAGCAAC AGGGATGGCT GGAACCATTT ATACCTGTAC GATTTAAAAG GCAGGGTGGT AAAACAACTA ACCAGGGGGG CATGGGAAGT GCTGGAGGTA AAAGGTTTTG ATGCTAAAGG TGAGCGGCTG TTTTACGTTT CAACGGAAGA GTCGCCGGTA ACCAGGAATT TATATGTATT AAATGTGAAA TCTGGTCAGT CGCGCAGGCT TACATCTGCT TTTGCGGTAC ACAATACGCA GGTAAGCATT TCCGGAAATA CTGTAATTGA TGTTTACAGT ACACCTGATG TGCCCAGGGT GATCCAGCTT GTAGAAACAC CTGGTTCAAA AGCTAAGTTA TTGTTGAAGT CTGCAAACCC CTTGTCGGCT TATGCTACAG AAAACTCATC GATATTTACC ATTAAAAGTA AATCGGGTGA GGACTTGTAT ATGAACCTGT ACAAGCCGGT AAATTATGAT GCCGGTAAAA AATATCCTGT AGTGGTTTAC TGGTATGGCG GTCCGCATGC ACAGCTGATC ACCAACAGCT GGAATGCCGG TGCAGGCGAT TACTGGTCGC GGTATATGGC GCAACGGGGT TATGTAGTGC TTACGGTTGA TGTAAGGGGT AGCGACAACA GGGGCAGGGC CTTTGAACAA TCTATGTTCC GCAGGGCAGG TGAGGTACAG ATGGAAGATA TGATGAGTGC CGTGGATTAT CTGAAAGCTC AGCCTTATGT AGATGCAGCC AACATGGGCT TATTTGGCTG GAGCTTTGGT GGCTTTGCCA CTACAGATTT TATGCTGACC CACCCGGGTG TGTTTAAAGC TGCCGTAGCT GGCGGGCCGG TAATAAACTG GGCCTTTTAT GAGATCATGT ATACCGAACG TTATATGGAT ACCCCACAGG AAAACCCTGA AGGTTATGCC GCGACTTACC TGAGTAACCG TGTTGATCAG CTGAAAGGAA AGTTATTGCT TATCCATGGA TTACAGGATC CGGTTGTAGT ACAGCAGCAT TCGGTCGATT TTGTGAAACA TGCGGTTGAT AAAGGTGTAC AGGTAGATTA CATGATCTAT CCTGGTCATG AGCACAATGT ATTGGGTAAA GACCGGGTGC AGCTGTATCA GAAAGTAACG GATTATTTTG AACTGTACCT GAAAGGGGGA AAATAA
|
Protein sequence | MKRLILLIYL LAVASGSFAQ QQQQLSMQDA MSNARTTLAP ENLSQLQFIY GTEDYVYAKR IGNSPVWLSG NAKSKEDQPF LTLTQLNQKL RNAKKDTLKM MPVIQFNQGP EWILNLNGSK VAINPVKNTV DVLVDQSLMA KTNAEESKAG YVAYLDNFNL FVAKDGDRKQ VTTDGNSDIV YASSVHREEF GISKGTFWSN NGKVLAFYRM DQRMVTDYPI IDWTSRPAHN VNIKYPMAGD KSHHVTVGVY HAETKAVVYL KTGEPAEQYL TNIAWSPDDK YVYIAVLNRG QNHMKLNQYD AATGDFVKTL FEEKDDKYVE PLVPMLFLKN DPSKFIWQSN RDGWNHLYLY DLKGRVVKQL TRGAWEVLEV KGFDAKGERL FYVSTEESPV TRNLYVLNVK SGQSRRLTSA FAVHNTQVSI SGNTVIDVYS TPDVPRVIQL VETPGSKAKL LLKSANPLSA YATENSSIFT IKSKSGEDLY MNLYKPVNYD AGKKYPVVVY WYGGPHAQLI TNSWNAGAGD YWSRYMAQRG YVVLTVDVRG SDNRGRAFEQ SMFRRAGEVQ MEDMMSAVDY LKAQPYVDAA NMGLFGWSFG GFATTDFMLT HPGVFKAAVA GGPVINWAFY EIMYTERYMD TPQENPEGYA ATYLSNRVDQ LKGKLLLIHG LQDPVVVQQH SVDFVKHAVD KGVQVDYMIY PGHEHNVLGK DRVQLYQKVT DYFELYLKGG K
|
| |