Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_2538 |
Symbol | |
ID | 9139249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 3297854 |
End bp | 3300874 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | Dipeptidyl-peptidase IV |
Protein accession | YP_003630562 |
Protein GI | 296122784 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATC GCTCTCTCTA TTTATCAGCA GTGGTTGCGA GTGCTGTCGT TCTTTCCATG CAGTTCTCTC CTGGCCATCG GGTCTCTGCT GCCGACAATC CGGCTGAGAA ACTCACCCTC GATCGAATTT TTAACTCGAA AGATTTCGAC GAACAACGTG TGGGGAGCTT CCAATGGAGC CGGCTCTCGA ACAGCTATTT TTCATTCGAG AAAGAGAATC CTCAGGCGAA GTTTGTCAGC CTGATGCGGA TTAACATCCA GACGGGGGCG AAAGAAGTTG TCATTGCAGG GAACCTGCTC ATTCCACCAG GAAGCGAACA GCCACTGGCC GTGCAGCGTT TTCAGTTCAC CCAGGACGAA TCAAAACTGC TGATCTATAC CAACAGCCAG AAAGTCTGGC GGCAGAATAC TCGTGGCGAC TACTGGCTGT TTGATCTCAA ACAGCAGAAG TTGACCAAGC TGGGCGGGAC AACTCCCCCC GCGCAGATGA TGTTTGCCAA GATCTCTCCC GATCAAACGA AAGTCGCTTT CGTCTATGCT CATAACCTCT ATGTTCAATC ACTGACGGAC TGGACGGTGA CTCCTTTAAC CACGGATGGT TCGGAGACGT TGATCAATGG GACTTCCGAT TGGGTGAACG AAGAGGAACT CGCGATTCGA GATGGCTGGC GCTGGAGTCC GGATAGCCAG TCGATTGCCT TTTATCAGTT TGATACGACC GGTGTGCCAC GCTTCACCAT GATTGATCAT GGCCAGCAGA ACTATCCCAA GGTGATCACA TTCCCTTACC CGAAAGTGGG TGAAAAGAAT TCATCGACGC GTGTTGGTGT GGTCAACATC TCTGGTGGCA AGCCCACCTG GATTGAACTC CCCGGTGATC CTCGCGAGCA CTACATCCCG CAGGTCGAAT GGACACCGGC GGGTGGCCAA CTCCTGATTC AACAGATGAA TCGTCCGCAG AACCGCAATA TCGTCTTCCT GGTCGATGTG ACTTCCGGAA AACTTCGTAC GGTGATGACA GAAGTCGAAG AGACCTGGAT TGAGAACGAC AATCCCGTCA AATGGCTGGC AGGCGGCAAG GAATTCCTCT GGATCAGTGA GCGTTCGGGC TGGAGGCACG TCTACCGTGC GAATCTGGAA AGTGGCGAAC TTCAGGTCAT CACTAGGGGA TCCTTCGATG TGATTGATGT CGAAGCCATC GATGAGACCA AAGGGATTCT CTACTTCGCT GCTTCGCCGG AGAATCCGAC TCAGCGCTAT CTGTATCAGG TGCCCCTTCG AGGTGGCGAA ATTCAGCGGG TGACTCCCGC CGGGCAATCC GGCTGGCATA CCTATCAGAT CGCACCCTCC TTTGAAGTTG CGATCCATAC TTTCTCGAAT CTGACGACCC CTCCGCTAAC TGAAGTTGTT CGACTTCCCA GCCATGAGGT CGTTCGTACG CTCGCTGATA ATCAGGTGCT TAAAGACAAG CTGGCACAGT GGAAGTTCCC GCAGCCGGAA CTCTTCCGGG TTGATATTGG AAATGGCATT GAACTCGATG GCTGGCGGTT TGCACCGGCA CACACAAAAG GTGAAAAGCA GCATCCTCTA TTCCTGCATG TCTATGGAGA GCCGCATGGG CAGGTGGTGC GTGATGTCTG GATGGGGAAA CGCGGCTTCT GGTACAGCAT GCTGGCCCAG GAGGGCTATA TCGTCGCGGC TGTTGATAAT CGCGGCACCA TGTCTCCCCG CGGGCGTGAC TTCCGCAAAT GCGTTTATAA GCAGATCGGG CTTCTCGCCT CACAAGAGCA GGCACTGGCT GTCAAAGCCC TGTTGGCGAA GTGGCCCTTT GCAGATCCTG CCCGCGTAGG GATCTGGGGC TGGAGTGGTG GTGGCAGTAT GAGCCTCAAT GCACTCTTCC GCTATCCCGA GATTTACAAG ATGGCGATTG CTGTCGCGCC AATGCCTAAT CAGAAGCTCT ACGATACGAT CTATCAGGAG CGCTATATGG GGTTATTAGG CGATAACCAG GAGGGATACA AGCAGGGCTC GCCGACGACG TTTGCGAAAC AACTGCAAGG TGATCTGCTG CTGATTCATG GGACGGGCGA CGACAATTGC CATTATCAGG CAACCGAGCA GTTGATGAAC GAATTGATTG CGCATGGAAA GCAGTTTTCA GTCATGCCCT ATCCCAGCCG GTCGCATAGT ATCAGTGAAG GGCGGGGAAC GAACTTTCAT CTCTATCAGC TCATGACGAA TTATATCCAT GAGAAGCTCC CTCTAAAGAG TTCCCCGGCA AGTGAAAAAG CACCTGTTCC AGTAGCGGTT GCAGACAAGA CCCAGTCAGG CATTTCGCAA GCTTCGAAGG AACCTGTGGA GAAAGAAAAA TCCGGGGTTG AAGCCTACGA TGAGGCTTTC ATTCAAGGCT GGACAGTTCG CATTCATCGG CAACTCAAGG TCGAGAATTC CGAGAAGCTG AAGAAGGCAC TGGAACTTCT GGAGGCACAA CTCAAAGAAG TCGTGCAGGT GGTACCGCCT ATGGCAGTAC AGGAGCTGAA GAAAGTCACA ATATGGTTGT CGCCAGAATA TAAGGGGATT CCTCCCAAGG CTGTGTATCA CCCCAGTCGT CAATGGCTGG TGGCGAACAA TCGACTTCCC GAAATGGCCC GCGCGGTTGA ATTCACCAAC GTGCTCATTT TTGAAGAGGA GTCTCGTCGC ATGCCCAATA TCACCCTGCA TGAACTGGCA CATGCTTACC ATGACCGCGT GCTGACGGGG GGCTATGCAA ATGCTGAAAT CATTCGTGCC TATGAGGCTG CCAAAGAATC AGGAAAGTAT GAGCAGGTCG AACAGAGATT TGGTAATGGT CGTTCAGTAA AGACCAGAGC TTATGCGATG ACCAGTCCGA TGGAATACTT CTCCGAGTCG AGCGAAGCCT ACTTCTCAAC GAATGATTTC TTCCCATTCA ACCGCGAAGA ACTTGTTCAG CATGATCCCA GTTTAGTGCC TGTGCTGAAG GAGCTTTGGG GTGAAAGGTA G
|
Protein sequence | MNYRSLYLSA VVASAVVLSM QFSPGHRVSA ADNPAEKLTL DRIFNSKDFD EQRVGSFQWS RLSNSYFSFE KENPQAKFVS LMRINIQTGA KEVVIAGNLL IPPGSEQPLA VQRFQFTQDE SKLLIYTNSQ KVWRQNTRGD YWLFDLKQQK LTKLGGTTPP AQMMFAKISP DQTKVAFVYA HNLYVQSLTD WTVTPLTTDG SETLINGTSD WVNEEELAIR DGWRWSPDSQ SIAFYQFDTT GVPRFTMIDH GQQNYPKVIT FPYPKVGEKN SSTRVGVVNI SGGKPTWIEL PGDPREHYIP QVEWTPAGGQ LLIQQMNRPQ NRNIVFLVDV TSGKLRTVMT EVEETWIEND NPVKWLAGGK EFLWISERSG WRHVYRANLE SGELQVITRG SFDVIDVEAI DETKGILYFA ASPENPTQRY LYQVPLRGGE IQRVTPAGQS GWHTYQIAPS FEVAIHTFSN LTTPPLTEVV RLPSHEVVRT LADNQVLKDK LAQWKFPQPE LFRVDIGNGI ELDGWRFAPA HTKGEKQHPL FLHVYGEPHG QVVRDVWMGK RGFWYSMLAQ EGYIVAAVDN RGTMSPRGRD FRKCVYKQIG LLASQEQALA VKALLAKWPF ADPARVGIWG WSGGGSMSLN ALFRYPEIYK MAIAVAPMPN QKLYDTIYQE RYMGLLGDNQ EGYKQGSPTT FAKQLQGDLL LIHGTGDDNC HYQATEQLMN ELIAHGKQFS VMPYPSRSHS ISEGRGTNFH LYQLMTNYIH EKLPLKSSPA SEKAPVPVAV ADKTQSGISQ ASKEPVEKEK SGVEAYDEAF IQGWTVRIHR QLKVENSEKL KKALELLEAQ LKEVVQVVPP MAVQELKKVT IWLSPEYKGI PPKAVYHPSR QWLVANNRLP EMARAVEFTN VLIFEEESRR MPNITLHELA HAYHDRVLTG GYANAEIIRA YEAAKESGKY EQVEQRFGNG RSVKTRAYAM TSPMEYFSES SEAYFSTNDF FPFNREELVQ HDPSLVPVLK ELWGER
|
| |