Gene Plim_4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4195 
Symbol 
ID9140916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5363786 
End bp5366863 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content54% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003632202 
Protein GI296124424 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAC GTAAATCTGC CTCCACTCTT GCTCAACCAC AACCAATCCC TGGGGTTGAT 
GATGCTCGAA TTGAAGCGGA AACAACTCAA CTGGGTGAAT TGATCTGGCG ACGACTCGCT
GCCAGACAAC CATCATTCTT TGAACAGCGC TGGTGGGATG ATCGCATTCT GGCAGCCGCC
ATGAATGATG AATCTCTCAA GGTGCAGATG TTCCGGTTTG TGGATGTGTT GCCGCGACTG
AAAACGCATC AATCGATGAC GCGTCATCTT CAGGAATACT TTGACGAGAT TCGTGAGCAT
CTTCCGTGGG CGATCCAACT GGTCGGATTT GGTGTGGAGC AGGTCGCTCC TAATTCGATT
TTATCCCGAA CACTCGCATT TAATGCCCGG AATAACGCTC AACGGATTGC CAAGCGTCTG
ATGGGTGGTG AATCGACCGA TGATGTTCTC AAGACGATTC ATAAACTTTA TCGACAAGGC
TATCTCTTCT CGCTCAATTA CCTGAGCTCG AAAGTTGTCA GTCAGGCCGA GGCAGATCAG
TACCAGCAGC GATATCTCGA CATGCTTTCG TCATTGGGCG CGGAGGTCAG CAACTGGCCA
GCCCATACGT CATATGCCCA TGCTGCTGCG GTCGAGGGTA AGTCAGCTGC GACTGCGGAG
ATTCCTCAAC TGCAGCTTTC GCTCAAACTG TCGTCGTTAA CCAGCGACTT TCGCCCACTC
GATGCCCAAG GGACCATGCG TGTGGTGCTG GAGCGATTGC GGCCCATTCT CCGTCTGGCC
ATCGAGCAGC AGGCGATTGT GCAGATCGAG CTGGAGCATT CGACGACGAA TCGATTGATT
CTCGATATTG TTCAGCAGGT TCTCTCGGAA AAAGAATTCC AGAGCTGGGC CGATTGCGGC
ATCACTTTGC CGGCCTATCT AAAAACTGCG GAAAGCGATC TGACGGCACT GGCAACCTGG
GTTCAAAAAC GCGGTACACC GATTCGCATC TGCCTGACCA AGGGAGAGTA TTGGAATCAG
GAAATCGCTC TGGCTCAGTC AAAAAGCTGG CCAGTGGCTG TTTTTGAAGA AGAATGGCAG
ATCGACGAGA ACTACGAAAA ACTCTCCCGA GCTTTGATCG ATCAGCCGGA GTTATTCAAG
CCCGTATTTG CAGGCCAGAG TTTAAGAAGC CTCTGTTACG TTCTGGCCTA TGCACAGGCC
CGCAATTTGC CCCGCTCCCG AGTTGAACTG CAATTGCGAT ACGGTTTGGC CGATGAGCAG
GCCCAGGCAT TTGCCGAACT GGGTTGCCGG GTCCGCATTG ATACGCCGGT CGGTCATCAT
GTGAAAGGAA TGGCCCGCCT GGCGCGTCAC TTCCTGGAAA ACTCTGCGAA TGATTCGTTT
CTCAGGCAAG GCTATTCCGC CGAAGTCTCC ATCGAGGATC TGCTTATGAA TCCCACCGTC
GCCGGCCAGA CGGCCCGGGT CCGAAAAACC TCGAAGATGT GGACCGTTCC ACCCCAGGGC
TTCGTGAATG AACCAGTGAC TGATTTTTCA ATGCCTGCTC ACCGTGAAGC GATGCAGGGG
GCAATTGAAA AGGTCGAACA TCAATTCGGG CAAACCTACT CGTTGATCAT CAATGGTCGC
CGGGAAGATA CTCGGCAGAA TCTGACAGTT CGTTCCCCTT CCGATAAGTC GAAAGTGCTG
GGACTGGTCG CTTCAGCAAG TCCGGAGCAG GCTCTGGCAG CCATTGATTC GGCCCGACGG
GCTTTCGTGC GGTGGTCAGT GATTGAAGCC AGTTATCGAG CTGAATATCT CGAACTGATT
GCGCGTGAAC TCAGGCAGCG CCGATTTGAA CTGGCTGCCT GGCAGATCTT TGAATGTGGC
AAGCCCTGGG CGGAAGCCGA TGCTGACGTG GCCGAAGCCA TCGACTTCTG CAATTACTAC
GCGATGCAGA TGCGCGAACT AGCGGAACCC CAAAGGTTCG ATATTGCCGA CGAAGAGAAC
GCTTACTTCT ACCGTCCGCG TGGTGTGGTG GTGGTGATCT CTCCCTGGAA TTTCCCCATG
GCTGTCCTGA CCGGGATGGT CGCTGCTGCC CTGGTGGCTG GAAACACCGT CATCATCAAG
CCTGCTGAAC AGGCTTCGGT CACGGCTGCG AAACTGATGG AAATCCTCCA GGCCTGTGGT
ATTCCGGATG GTGTCATCAC CTTCCTGCCC GGTATTGGGG AAGAGATTGG CCCGGTTCTC
GCTGGGAGCC CTGATGTCGA TCTTGTGGCC TTTACCGGTT CCACGGAAGT CGGACTGACA
GTGAATCAGT CGGCAGCACA GGTCCATGCG GCTGCACAAT CGATCAAGCG GGTCATTACG
ACCTTGAGTG GTCACAATGC TATCATCGTC GATGCCGATG CCGACCTGGA TGATGCCGTC
ACGGGTGTGA TTGAAAGTGC CTTCAGTTAT GCCGGGCAGA AGTGCTCATC GTGCTCGCGA
GTCGTTGTGA TTGGTGAAAT CTACGACGAG TTCATCAAAC GACTCGTGGC CGCCACTTCT
GATTTGAAGC TGGCCCGTGC AGAAGATCCC GCCTGCCAGA TAGGGCCCGT GATTGATGAA
GAATCTTGCA AGCGACTTCT GGCACTCATC GAAGATGCTA AGCAGACCTG CGAAGTGATT
CTTGCGATGG AAACGGGCTC ACTGGCTAAG AAGGGATACT TCGTCGGCCC GCACATCTTT
GCGAATATTC CGAGTGAGTC ACGCCTAAAT AAAGAAGAGA TCTGCGGCCC GATTCTCCTC
GTCTACAAAG CCGCCGATCT TTCGGAAGCC TTGGGGATGG CCAACAGCGT GCCTTATGCT
CTGGCAGGTG GTTTGTATAG TCGCAGCCCG GCGAATCTTA AACGGGCCAA ACAGCAATTT
CTGGCAGGAA ATCTCTATCT GAATACTCCG GTCACGACCG GTCTCGTCGC CCGGCAACCT
TTCGGTGGCT TCAAACTCTC CGGGATTGGC AGCAAAACGG GCGGGCCAGA TTACCTGCCA
CAGTTCATGG TGCCAGTCAA TGTGACAGAA AACACATCCC GACGCGGCTT CAGTCACGAG
ACAACGACTG AAAGTTAA
 
Protein sequence
MAKRKSASTL AQPQPIPGVD DARIEAETTQ LGELIWRRLA ARQPSFFEQR WWDDRILAAA 
MNDESLKVQM FRFVDVLPRL KTHQSMTRHL QEYFDEIREH LPWAIQLVGF GVEQVAPNSI
LSRTLAFNAR NNAQRIAKRL MGGESTDDVL KTIHKLYRQG YLFSLNYLSS KVVSQAEADQ
YQQRYLDMLS SLGAEVSNWP AHTSYAHAAA VEGKSAATAE IPQLQLSLKL SSLTSDFRPL
DAQGTMRVVL ERLRPILRLA IEQQAIVQIE LEHSTTNRLI LDIVQQVLSE KEFQSWADCG
ITLPAYLKTA ESDLTALATW VQKRGTPIRI CLTKGEYWNQ EIALAQSKSW PVAVFEEEWQ
IDENYEKLSR ALIDQPELFK PVFAGQSLRS LCYVLAYAQA RNLPRSRVEL QLRYGLADEQ
AQAFAELGCR VRIDTPVGHH VKGMARLARH FLENSANDSF LRQGYSAEVS IEDLLMNPTV
AGQTARVRKT SKMWTVPPQG FVNEPVTDFS MPAHREAMQG AIEKVEHQFG QTYSLIINGR
REDTRQNLTV RSPSDKSKVL GLVASASPEQ ALAAIDSARR AFVRWSVIEA SYRAEYLELI
ARELRQRRFE LAAWQIFECG KPWAEADADV AEAIDFCNYY AMQMRELAEP QRFDIADEEN
AYFYRPRGVV VVISPWNFPM AVLTGMVAAA LVAGNTVIIK PAEQASVTAA KLMEILQACG
IPDGVITFLP GIGEEIGPVL AGSPDVDLVA FTGSTEVGLT VNQSAAQVHA AAQSIKRVIT
TLSGHNAIIV DADADLDDAV TGVIESAFSY AGQKCSSCSR VVVIGEIYDE FIKRLVAATS
DLKLARAEDP ACQIGPVIDE ESCKRLLALI EDAKQTCEVI LAMETGSLAK KGYFVGPHIF
ANIPSESRLN KEEICGPILL VYKAADLSEA LGMANSVPYA LAGGLYSRSP ANLKRAKQQF
LAGNLYLNTP VTTGLVARQP FGGFKLSGIG SKTGGPDYLP QFMVPVNVTE NTSRRGFSHE
TTTES