Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_4195 |
Symbol | |
ID | 9140916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 5363786 |
End bp | 5366863 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003632202 |
Protein GI | 296124424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.17632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAC GTAAATCTGC CTCCACTCTT GCTCAACCAC AACCAATCCC TGGGGTTGAT GATGCTCGAA TTGAAGCGGA AACAACTCAA CTGGGTGAAT TGATCTGGCG ACGACTCGCT GCCAGACAAC CATCATTCTT TGAACAGCGC TGGTGGGATG ATCGCATTCT GGCAGCCGCC ATGAATGATG AATCTCTCAA GGTGCAGATG TTCCGGTTTG TGGATGTGTT GCCGCGACTG AAAACGCATC AATCGATGAC GCGTCATCTT CAGGAATACT TTGACGAGAT TCGTGAGCAT CTTCCGTGGG CGATCCAACT GGTCGGATTT GGTGTGGAGC AGGTCGCTCC TAATTCGATT TTATCCCGAA CACTCGCATT TAATGCCCGG AATAACGCTC AACGGATTGC CAAGCGTCTG ATGGGTGGTG AATCGACCGA TGATGTTCTC AAGACGATTC ATAAACTTTA TCGACAAGGC TATCTCTTCT CGCTCAATTA CCTGAGCTCG AAAGTTGTCA GTCAGGCCGA GGCAGATCAG TACCAGCAGC GATATCTCGA CATGCTTTCG TCATTGGGCG CGGAGGTCAG CAACTGGCCA GCCCATACGT CATATGCCCA TGCTGCTGCG GTCGAGGGTA AGTCAGCTGC GACTGCGGAG ATTCCTCAAC TGCAGCTTTC GCTCAAACTG TCGTCGTTAA CCAGCGACTT TCGCCCACTC GATGCCCAAG GGACCATGCG TGTGGTGCTG GAGCGATTGC GGCCCATTCT CCGTCTGGCC ATCGAGCAGC AGGCGATTGT GCAGATCGAG CTGGAGCATT CGACGACGAA TCGATTGATT CTCGATATTG TTCAGCAGGT TCTCTCGGAA AAAGAATTCC AGAGCTGGGC CGATTGCGGC ATCACTTTGC CGGCCTATCT AAAAACTGCG GAAAGCGATC TGACGGCACT GGCAACCTGG GTTCAAAAAC GCGGTACACC GATTCGCATC TGCCTGACCA AGGGAGAGTA TTGGAATCAG GAAATCGCTC TGGCTCAGTC AAAAAGCTGG CCAGTGGCTG TTTTTGAAGA AGAATGGCAG ATCGACGAGA ACTACGAAAA ACTCTCCCGA GCTTTGATCG ATCAGCCGGA GTTATTCAAG CCCGTATTTG CAGGCCAGAG TTTAAGAAGC CTCTGTTACG TTCTGGCCTA TGCACAGGCC CGCAATTTGC CCCGCTCCCG AGTTGAACTG CAATTGCGAT ACGGTTTGGC CGATGAGCAG GCCCAGGCAT TTGCCGAACT GGGTTGCCGG GTCCGCATTG ATACGCCGGT CGGTCATCAT GTGAAAGGAA TGGCCCGCCT GGCGCGTCAC TTCCTGGAAA ACTCTGCGAA TGATTCGTTT CTCAGGCAAG GCTATTCCGC CGAAGTCTCC ATCGAGGATC TGCTTATGAA TCCCACCGTC GCCGGCCAGA CGGCCCGGGT CCGAAAAACC TCGAAGATGT GGACCGTTCC ACCCCAGGGC TTCGTGAATG AACCAGTGAC TGATTTTTCA ATGCCTGCTC ACCGTGAAGC GATGCAGGGG GCAATTGAAA AGGTCGAACA TCAATTCGGG CAAACCTACT CGTTGATCAT CAATGGTCGC CGGGAAGATA CTCGGCAGAA TCTGACAGTT CGTTCCCCTT CCGATAAGTC GAAAGTGCTG GGACTGGTCG CTTCAGCAAG TCCGGAGCAG GCTCTGGCAG CCATTGATTC GGCCCGACGG GCTTTCGTGC GGTGGTCAGT GATTGAAGCC AGTTATCGAG CTGAATATCT CGAACTGATT GCGCGTGAAC TCAGGCAGCG CCGATTTGAA CTGGCTGCCT GGCAGATCTT TGAATGTGGC AAGCCCTGGG CGGAAGCCGA TGCTGACGTG GCCGAAGCCA TCGACTTCTG CAATTACTAC GCGATGCAGA TGCGCGAACT AGCGGAACCC CAAAGGTTCG ATATTGCCGA CGAAGAGAAC GCTTACTTCT ACCGTCCGCG TGGTGTGGTG GTGGTGATCT CTCCCTGGAA TTTCCCCATG GCTGTCCTGA CCGGGATGGT CGCTGCTGCC CTGGTGGCTG GAAACACCGT CATCATCAAG CCTGCTGAAC AGGCTTCGGT CACGGCTGCG AAACTGATGG AAATCCTCCA GGCCTGTGGT ATTCCGGATG GTGTCATCAC CTTCCTGCCC GGTATTGGGG AAGAGATTGG CCCGGTTCTC GCTGGGAGCC CTGATGTCGA TCTTGTGGCC TTTACCGGTT CCACGGAAGT CGGACTGACA GTGAATCAGT CGGCAGCACA GGTCCATGCG GCTGCACAAT CGATCAAGCG GGTCATTACG ACCTTGAGTG GTCACAATGC TATCATCGTC GATGCCGATG CCGACCTGGA TGATGCCGTC ACGGGTGTGA TTGAAAGTGC CTTCAGTTAT GCCGGGCAGA AGTGCTCATC GTGCTCGCGA GTCGTTGTGA TTGGTGAAAT CTACGACGAG TTCATCAAAC GACTCGTGGC CGCCACTTCT GATTTGAAGC TGGCCCGTGC AGAAGATCCC GCCTGCCAGA TAGGGCCCGT GATTGATGAA GAATCTTGCA AGCGACTTCT GGCACTCATC GAAGATGCTA AGCAGACCTG CGAAGTGATT CTTGCGATGG AAACGGGCTC ACTGGCTAAG AAGGGATACT TCGTCGGCCC GCACATCTTT GCGAATATTC CGAGTGAGTC ACGCCTAAAT AAAGAAGAGA TCTGCGGCCC GATTCTCCTC GTCTACAAAG CCGCCGATCT TTCGGAAGCC TTGGGGATGG CCAACAGCGT GCCTTATGCT CTGGCAGGTG GTTTGTATAG TCGCAGCCCG GCGAATCTTA AACGGGCCAA ACAGCAATTT CTGGCAGGAA ATCTCTATCT GAATACTCCG GTCACGACCG GTCTCGTCGC CCGGCAACCT TTCGGTGGCT TCAAACTCTC CGGGATTGGC AGCAAAACGG GCGGGCCAGA TTACCTGCCA CAGTTCATGG TGCCAGTCAA TGTGACAGAA AACACATCCC GACGCGGCTT CAGTCACGAG ACAACGACTG AAAGTTAA
|
Protein sequence | MAKRKSASTL AQPQPIPGVD DARIEAETTQ LGELIWRRLA ARQPSFFEQR WWDDRILAAA MNDESLKVQM FRFVDVLPRL KTHQSMTRHL QEYFDEIREH LPWAIQLVGF GVEQVAPNSI LSRTLAFNAR NNAQRIAKRL MGGESTDDVL KTIHKLYRQG YLFSLNYLSS KVVSQAEADQ YQQRYLDMLS SLGAEVSNWP AHTSYAHAAA VEGKSAATAE IPQLQLSLKL SSLTSDFRPL DAQGTMRVVL ERLRPILRLA IEQQAIVQIE LEHSTTNRLI LDIVQQVLSE KEFQSWADCG ITLPAYLKTA ESDLTALATW VQKRGTPIRI CLTKGEYWNQ EIALAQSKSW PVAVFEEEWQ IDENYEKLSR ALIDQPELFK PVFAGQSLRS LCYVLAYAQA RNLPRSRVEL QLRYGLADEQ AQAFAELGCR VRIDTPVGHH VKGMARLARH FLENSANDSF LRQGYSAEVS IEDLLMNPTV AGQTARVRKT SKMWTVPPQG FVNEPVTDFS MPAHREAMQG AIEKVEHQFG QTYSLIINGR REDTRQNLTV RSPSDKSKVL GLVASASPEQ ALAAIDSARR AFVRWSVIEA SYRAEYLELI ARELRQRRFE LAAWQIFECG KPWAEADADV AEAIDFCNYY AMQMRELAEP QRFDIADEEN AYFYRPRGVV VVISPWNFPM AVLTGMVAAA LVAGNTVIIK PAEQASVTAA KLMEILQACG IPDGVITFLP GIGEEIGPVL AGSPDVDLVA FTGSTEVGLT VNQSAAQVHA AAQSIKRVIT TLSGHNAIIV DADADLDDAV TGVIESAFSY AGQKCSSCSR VVVIGEIYDE FIKRLVAATS DLKLARAEDP ACQIGPVIDE ESCKRLLALI EDAKQTCEVI LAMETGSLAK KGYFVGPHIF ANIPSESRLN KEEICGPILL VYKAADLSEA LGMANSVPYA LAGGLYSRSP ANLKRAKQQF LAGNLYLNTP VTTGLVARQP FGGFKLSGIG SKTGGPDYLP QFMVPVNVTE NTSRRGFSHE TTTES
|
| |