Gene Plim_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2538 
Symbol 
ID9139249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3297854 
End bp3300874 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content52% 
IMG OID 
ProductDipeptidyl-peptidase IV 
Protein accessionYP_003630562 
Protein GI296122784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATC GCTCTCTCTA TTTATCAGCA GTGGTTGCGA GTGCTGTCGT TCTTTCCATG 
CAGTTCTCTC CTGGCCATCG GGTCTCTGCT GCCGACAATC CGGCTGAGAA ACTCACCCTC
GATCGAATTT TTAACTCGAA AGATTTCGAC GAACAACGTG TGGGGAGCTT CCAATGGAGC
CGGCTCTCGA ACAGCTATTT TTCATTCGAG AAAGAGAATC CTCAGGCGAA GTTTGTCAGC
CTGATGCGGA TTAACATCCA GACGGGGGCG AAAGAAGTTG TCATTGCAGG GAACCTGCTC
ATTCCACCAG GAAGCGAACA GCCACTGGCC GTGCAGCGTT TTCAGTTCAC CCAGGACGAA
TCAAAACTGC TGATCTATAC CAACAGCCAG AAAGTCTGGC GGCAGAATAC TCGTGGCGAC
TACTGGCTGT TTGATCTCAA ACAGCAGAAG TTGACCAAGC TGGGCGGGAC AACTCCCCCC
GCGCAGATGA TGTTTGCCAA GATCTCTCCC GATCAAACGA AAGTCGCTTT CGTCTATGCT
CATAACCTCT ATGTTCAATC ACTGACGGAC TGGACGGTGA CTCCTTTAAC CACGGATGGT
TCGGAGACGT TGATCAATGG GACTTCCGAT TGGGTGAACG AAGAGGAACT CGCGATTCGA
GATGGCTGGC GCTGGAGTCC GGATAGCCAG TCGATTGCCT TTTATCAGTT TGATACGACC
GGTGTGCCAC GCTTCACCAT GATTGATCAT GGCCAGCAGA ACTATCCCAA GGTGATCACA
TTCCCTTACC CGAAAGTGGG TGAAAAGAAT TCATCGACGC GTGTTGGTGT GGTCAACATC
TCTGGTGGCA AGCCCACCTG GATTGAACTC CCCGGTGATC CTCGCGAGCA CTACATCCCG
CAGGTCGAAT GGACACCGGC GGGTGGCCAA CTCCTGATTC AACAGATGAA TCGTCCGCAG
AACCGCAATA TCGTCTTCCT GGTCGATGTG ACTTCCGGAA AACTTCGTAC GGTGATGACA
GAAGTCGAAG AGACCTGGAT TGAGAACGAC AATCCCGTCA AATGGCTGGC AGGCGGCAAG
GAATTCCTCT GGATCAGTGA GCGTTCGGGC TGGAGGCACG TCTACCGTGC GAATCTGGAA
AGTGGCGAAC TTCAGGTCAT CACTAGGGGA TCCTTCGATG TGATTGATGT CGAAGCCATC
GATGAGACCA AAGGGATTCT CTACTTCGCT GCTTCGCCGG AGAATCCGAC TCAGCGCTAT
CTGTATCAGG TGCCCCTTCG AGGTGGCGAA ATTCAGCGGG TGACTCCCGC CGGGCAATCC
GGCTGGCATA CCTATCAGAT CGCACCCTCC TTTGAAGTTG CGATCCATAC TTTCTCGAAT
CTGACGACCC CTCCGCTAAC TGAAGTTGTT CGACTTCCCA GCCATGAGGT CGTTCGTACG
CTCGCTGATA ATCAGGTGCT TAAAGACAAG CTGGCACAGT GGAAGTTCCC GCAGCCGGAA
CTCTTCCGGG TTGATATTGG AAATGGCATT GAACTCGATG GCTGGCGGTT TGCACCGGCA
CACACAAAAG GTGAAAAGCA GCATCCTCTA TTCCTGCATG TCTATGGAGA GCCGCATGGG
CAGGTGGTGC GTGATGTCTG GATGGGGAAA CGCGGCTTCT GGTACAGCAT GCTGGCCCAG
GAGGGCTATA TCGTCGCGGC TGTTGATAAT CGCGGCACCA TGTCTCCCCG CGGGCGTGAC
TTCCGCAAAT GCGTTTATAA GCAGATCGGG CTTCTCGCCT CACAAGAGCA GGCACTGGCT
GTCAAAGCCC TGTTGGCGAA GTGGCCCTTT GCAGATCCTG CCCGCGTAGG GATCTGGGGC
TGGAGTGGTG GTGGCAGTAT GAGCCTCAAT GCACTCTTCC GCTATCCCGA GATTTACAAG
ATGGCGATTG CTGTCGCGCC AATGCCTAAT CAGAAGCTCT ACGATACGAT CTATCAGGAG
CGCTATATGG GGTTATTAGG CGATAACCAG GAGGGATACA AGCAGGGCTC GCCGACGACG
TTTGCGAAAC AACTGCAAGG TGATCTGCTG CTGATTCATG GGACGGGCGA CGACAATTGC
CATTATCAGG CAACCGAGCA GTTGATGAAC GAATTGATTG CGCATGGAAA GCAGTTTTCA
GTCATGCCCT ATCCCAGCCG GTCGCATAGT ATCAGTGAAG GGCGGGGAAC GAACTTTCAT
CTCTATCAGC TCATGACGAA TTATATCCAT GAGAAGCTCC CTCTAAAGAG TTCCCCGGCA
AGTGAAAAAG CACCTGTTCC AGTAGCGGTT GCAGACAAGA CCCAGTCAGG CATTTCGCAA
GCTTCGAAGG AACCTGTGGA GAAAGAAAAA TCCGGGGTTG AAGCCTACGA TGAGGCTTTC
ATTCAAGGCT GGACAGTTCG CATTCATCGG CAACTCAAGG TCGAGAATTC CGAGAAGCTG
AAGAAGGCAC TGGAACTTCT GGAGGCACAA CTCAAAGAAG TCGTGCAGGT GGTACCGCCT
ATGGCAGTAC AGGAGCTGAA GAAAGTCACA ATATGGTTGT CGCCAGAATA TAAGGGGATT
CCTCCCAAGG CTGTGTATCA CCCCAGTCGT CAATGGCTGG TGGCGAACAA TCGACTTCCC
GAAATGGCCC GCGCGGTTGA ATTCACCAAC GTGCTCATTT TTGAAGAGGA GTCTCGTCGC
ATGCCCAATA TCACCCTGCA TGAACTGGCA CATGCTTACC ATGACCGCGT GCTGACGGGG
GGCTATGCAA ATGCTGAAAT CATTCGTGCC TATGAGGCTG CCAAAGAATC AGGAAAGTAT
GAGCAGGTCG AACAGAGATT TGGTAATGGT CGTTCAGTAA AGACCAGAGC TTATGCGATG
ACCAGTCCGA TGGAATACTT CTCCGAGTCG AGCGAAGCCT ACTTCTCAAC GAATGATTTC
TTCCCATTCA ACCGCGAAGA ACTTGTTCAG CATGATCCCA GTTTAGTGCC TGTGCTGAAG
GAGCTTTGGG GTGAAAGGTA G
 
Protein sequence
MNYRSLYLSA VVASAVVLSM QFSPGHRVSA ADNPAEKLTL DRIFNSKDFD EQRVGSFQWS 
RLSNSYFSFE KENPQAKFVS LMRINIQTGA KEVVIAGNLL IPPGSEQPLA VQRFQFTQDE
SKLLIYTNSQ KVWRQNTRGD YWLFDLKQQK LTKLGGTTPP AQMMFAKISP DQTKVAFVYA
HNLYVQSLTD WTVTPLTTDG SETLINGTSD WVNEEELAIR DGWRWSPDSQ SIAFYQFDTT
GVPRFTMIDH GQQNYPKVIT FPYPKVGEKN SSTRVGVVNI SGGKPTWIEL PGDPREHYIP
QVEWTPAGGQ LLIQQMNRPQ NRNIVFLVDV TSGKLRTVMT EVEETWIEND NPVKWLAGGK
EFLWISERSG WRHVYRANLE SGELQVITRG SFDVIDVEAI DETKGILYFA ASPENPTQRY
LYQVPLRGGE IQRVTPAGQS GWHTYQIAPS FEVAIHTFSN LTTPPLTEVV RLPSHEVVRT
LADNQVLKDK LAQWKFPQPE LFRVDIGNGI ELDGWRFAPA HTKGEKQHPL FLHVYGEPHG
QVVRDVWMGK RGFWYSMLAQ EGYIVAAVDN RGTMSPRGRD FRKCVYKQIG LLASQEQALA
VKALLAKWPF ADPARVGIWG WSGGGSMSLN ALFRYPEIYK MAIAVAPMPN QKLYDTIYQE
RYMGLLGDNQ EGYKQGSPTT FAKQLQGDLL LIHGTGDDNC HYQATEQLMN ELIAHGKQFS
VMPYPSRSHS ISEGRGTNFH LYQLMTNYIH EKLPLKSSPA SEKAPVPVAV ADKTQSGISQ
ASKEPVEKEK SGVEAYDEAF IQGWTVRIHR QLKVENSEKL KKALELLEAQ LKEVVQVVPP
MAVQELKKVT IWLSPEYKGI PPKAVYHPSR QWLVANNRLP EMARAVEFTN VLIFEEESRR
MPNITLHELA HAYHDRVLTG GYANAEIIRA YEAAKESGKY EQVEQRFGNG RSVKTRAYAM
TSPMEYFSES SEAYFSTNDF FPFNREELVQ HDPSLVPVLK ELWGER