Gene Plim_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1972 
Symbol 
ID9138674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2562990 
End bp2566322 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content54% 
IMG OID 
Productheme-binding protein 
Protein accessionYP_003630001 
Protein GI296122223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGAT TGAGTTTTCG AGTCGCCATG CTGGCTTGGG TTTTTGCCGC AGCGTTACCT 
TTGATTCCGG AATCTTTTGG GCAGGAGACC TATTCGCCAC CGATCAGTGA AAAGTCAGAT
GAAGCGGAAC TGGCGAAATC TCGATTCAAA TTCCCCAAAG GCGTTGAGGT CAAGCTGGCT
GCTTCAGAGC CGGAAATTGC CAACCCTGTC GCTTTCTGTT TCGACGAACA GGGAAATCTC
TATGTGGCTG AGACATTCCG GCAGAGCAAA GGTGTAGAAG ACAATCGCAG CCATATGAAC
TGGTTGCAGG ATGACCTTTC TGCACAGACT GTCGCCGACC GCGTGGCCTA CATGCGAAAG
CACATTCCCG ATGCTGACAA GCGCTACACC AAAGAACACG ATCGCATTCG AAAACTCGAA
GATCGCGATG GTGATGGGGT CTACGAAACC GCCACTGTCT TTGCTGATGG GTTCAACGAT
ATCGCAGACG GGACAGGTGC GGGCATACTC GCATGGAATG GAAACGTCTA TTACACCTGC
ATCCCCAAAC TCTGGATGCT CACCGATACC AACGGCGATG GCGTAGCTGA TCAGAAAAAA
GTGCTCAGCG AAGGCTATGG GGTAAGGTTC GCTTTTCGTG GGCACGATTC CCACGGCTTA
GCCATTGGTC CCGATGGCAG GCTCTATTTC AGCATTGGTG ACCGCGGCTA CAACATCACA
ACTCCTGAAG GCAAGCTCGT TAATCCCGAC CGTGGGGCTG TTTTTCGATG CGAACCCGAC
GGCAGCAACC TTGAAGTCTT CGCCACCGGA TTACGCAATC CTCAGGAACT GGCGTTTGAT
AATCGCGGAC GCCTTTTCAC TGGTGACAAC AACTCAGATG GTGGCGACAA AGCCCGTTGG
ACATACGTCA TGCGTCAGAG CGATACCGGC TGGCGCATGA ACTACCAATA TCTCAACGAT
CGTGGTCCCT GGAATCGTGA GAAGCTCTGG CATCCGGCTC ATCCGGAACA GGCTGCCTAC
ATTGTTCCAC CCATTTTGAA CTTTTCCGAT GGGCCATCAG GTCTGACTCA TAATCCGGGA
ACCGGCCTGC CCGCCAAGTA TGACGACTGG TTTTTTCTAG CCGATTTTCG CGGGACACCT
GCTATCAGCG GGATCCGCGC ACTGGTGAAT AAGCCCAAGG GGGCTGGATT TGAGATTGCC
GAATCAGAAA TGTTCATCTG GGGCATTCTC GCGACTGATG TCGATTTCGG ATACGACAGC
AATCTCTACG TCACCGACTG GGTCAACGGT TGGGAAGGCC TCGGTAAAGG CCGCGTTTAT
CGATTTGCCG ATGCTGCAAA CACCTCCGGT AGTGAAGTCG CAAAACTCTT CCGCGAAGGT
TTTACAGATC GATCGATCGA GGATCTGACA AAGCTGTTGT CTCACGCTGA TTATCGCGTG
CGGCAGCGAG CACAGTTTGC CCTGGTTTCC AGGCAAGCCA CCCAAGTGCT GGCTCAACAG
GCGACAAAGA GCTCGTCCAC TTTTGCCAGA CTGCATGCCA TCTGGGGCCT GGGGCAACTG
GCGCGACAAA AACTTCCGAC TGCAGAACTG CTTCTCCCGC TCCTGGCGGA TCCTTCAGCC
GAAGTTCGCG AAGCCACTTG CGTCGCTTTA GGTGATCTTC GCTATCAACC TGCTGCTGCC
CAGTTGGGAA AGCTGCTGAA AGATGCCGAT GCTCATGTCC AGGCGCAGGC AGCGATTGCC
CTGGGCCAGT TGAAAGGGCA CGGTCAGGAA AAAGCCTTGG TTGAAGCTCT GACATTAAGC
AACAACACAG ATCCGTTTTT GCGACACGCT CTGGTTGTGG GTCTGACAGG TGTGGGTGAT
TCCCAGCAGA TTGCGGCACT TTTGAAGAAT CCCTCCCCTG CCGTCCGCCT GGCTGCCGTG
GTGGCATTAA GGCGTATGGA ATCTGCCGAT TTAGGTATGG CCCTCTCCGA TGCCGACCCC
CTCATCGTGC TCGAAGCGGC CAGAGCCATC CATGACCTCC CGACGACGAC TCATCTGGAA
AAACTGGCGG CTCTGCCCAT CGCTGGAACA ACTTCTGACG CGCTTGCCCG CCGGATTCTG
AATGCGAGCT ATCGTATCGG CAACGTTGAA TCAGCCTTAC GGGTGGCACA GGTTGCAGCG
AATTCTTCGG TGTCAGACTC ATTGCGGATC GAAGCGATTG AAGAGCTCCT CAACTGGAAT
AGCCCTGCTG TGCTTGATCG TGTCTTGGGT GATTATCGCC CACTGGCCAC ACGAAATGTG
GAGATTGCTG ACGCCATCAG ACCACTTCTG ACGTCGATGC TTGCCAGCCC GACCAAAGTT
CGAGAAGCAG CCACCAAGCT CGCCGCCAAG TATGGCATCG AGGAAGTTCA GCCCATTCTG
CGAGAAGCGG CTCTCTCGGT GAAAGCCGAA AGTTCCGAAC GATTGGCAGC ACTATCCGCT
TTGAAATCAT TGAAAGATTC TCAGCTGACA GAAATCGCCT CGAAGCTGAT TGATGATGCG
AACCCGGAGG TTCGAGCCGG TGCTGCTGCC GTCCTTGTCA AACTGGATGG TGCTCGATCT
TTGAAATACC TGGAAACATT GACGCCCCAA TCCCCCTCCG TCGAGGTGCA ACAGGGGATC
GCCACGTTGG CCAGTCTGTC GGATGAACAA GCACAGAAAA CGCTGGACGC CTGGTTCCTT
CGACTGGCAG ATCGGTCAGC ACCTCCGGAA GTCTGGCTGG ACTTAATTGA AGCTGCTCAG
AAGAAAAAAT CGGAAGTCTC GAAAAAGGCA CTCGCCAGTT TTGAATCTTC ACGAGACGCC
AATGATCATC TCTCGAAACA TCGTGAACTT GTGGCTGGTG GTGATATCGA ACGTGGTCGA
GATATTTTCT TCAATCGCAG TGAAGTGAGT TGCCAGCGTT GCCACAAAGT CGGTTCTCAA
GGTGGCGAAG TGGGGCCTGT CCTGACAAAA ATTGGTGCCG AAAAATCGTC AGAGTATCTG
CTCGAAGCAA TTGTCGATCC CAATCGAGTG ATTGCCAAGG GCTTTGAAAC AGCCATTCTC
GGTATGGAAG ATGGCCGTGT GCTGGTGGGA ATTATCAAGT CTGAGAGTAA TGGAAAACTG
ATTCTGCAGC CCGCTGAAGG GCAACCCATC ACGGTCAATG TGGCGGAAAT CGAAGAACGC
TCAGTGGGTA AATCCGGCAT GCCCGAAGAT CTTGCGGGCA AGATTTCCCG CCGAGATTTA
AGAGACCTTG TCGCTTACCT GGCAAGTCTG AAGCGGGATG TTGATGCCGC TGCTCACGGC
TCAAGTTCGG CCCATGGTGC GGCACATCCG TAA
 
Protein sequence
MDRLSFRVAM LAWVFAAALP LIPESFGQET YSPPISEKSD EAELAKSRFK FPKGVEVKLA 
ASEPEIANPV AFCFDEQGNL YVAETFRQSK GVEDNRSHMN WLQDDLSAQT VADRVAYMRK
HIPDADKRYT KEHDRIRKLE DRDGDGVYET ATVFADGFND IADGTGAGIL AWNGNVYYTC
IPKLWMLTDT NGDGVADQKK VLSEGYGVRF AFRGHDSHGL AIGPDGRLYF SIGDRGYNIT
TPEGKLVNPD RGAVFRCEPD GSNLEVFATG LRNPQELAFD NRGRLFTGDN NSDGGDKARW
TYVMRQSDTG WRMNYQYLND RGPWNREKLW HPAHPEQAAY IVPPILNFSD GPSGLTHNPG
TGLPAKYDDW FFLADFRGTP AISGIRALVN KPKGAGFEIA ESEMFIWGIL ATDVDFGYDS
NLYVTDWVNG WEGLGKGRVY RFADAANTSG SEVAKLFREG FTDRSIEDLT KLLSHADYRV
RQRAQFALVS RQATQVLAQQ ATKSSSTFAR LHAIWGLGQL ARQKLPTAEL LLPLLADPSA
EVREATCVAL GDLRYQPAAA QLGKLLKDAD AHVQAQAAIA LGQLKGHGQE KALVEALTLS
NNTDPFLRHA LVVGLTGVGD SQQIAALLKN PSPAVRLAAV VALRRMESAD LGMALSDADP
LIVLEAARAI HDLPTTTHLE KLAALPIAGT TSDALARRIL NASYRIGNVE SALRVAQVAA
NSSVSDSLRI EAIEELLNWN SPAVLDRVLG DYRPLATRNV EIADAIRPLL TSMLASPTKV
REAATKLAAK YGIEEVQPIL REAALSVKAE SSERLAALSA LKSLKDSQLT EIASKLIDDA
NPEVRAGAAA VLVKLDGARS LKYLETLTPQ SPSVEVQQGI ATLASLSDEQ AQKTLDAWFL
RLADRSAPPE VWLDLIEAAQ KKKSEVSKKA LASFESSRDA NDHLSKHREL VAGGDIERGR
DIFFNRSEVS CQRCHKVGSQ GGEVGPVLTK IGAEKSSEYL LEAIVDPNRV IAKGFETAIL
GMEDGRVLVG IIKSESNGKL ILQPAEGQPI TVNVAEIEER SVGKSGMPED LAGKISRRDL
RDLVAYLASL KRDVDAAAHG SSSAHGAAHP