Gene Dole_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2191 
Symbol 
ID5695037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2662702 
End bp2665665 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content59% 
IMG OID641264795 
Productformate C-acetyltransferase 
Protein accessionYP_001530072 
Protein GI158522202 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA ACCGTAAAGG CATGACCAAT ATGCTGGCCA ACCTGTTTTT CCGGTTTTTT 
GCCGCCAACT TCAACCTTCG GCCCGCGTTG AACCGTTACC TTTCGTGCGA TGACGGGCCC
ATCAATTTTA CCTTCGGCAT TCGCACCGAA AGCGGCAGCG TGGAACAGGC TGTTCAGTTT
GAAAACGGCC GAGTCCGTGT ACTGAAAAAA ATGCCGGAAA AACCCGACGC TCAACTGGTG
TTTGTGGACG AAGCCGCGGT AAAGGAGGCG GCCACCCAGC CCCCCAACAA GCTGATGCTG
GCCCTGATGG AAAACCGCAT GGTCACCCGG GGCAACCTGG GCTACCTCCA GCTGATGAAC
TTTTACCTGT CGCTGCTGTT AAAAAAGGTA CAGGTGGGAA AGCTGCAAAA AGAGGCCAAA
CGGGAAAAGC GGGATTACGA TCCAGCCGAA GCGTCTCAGA AAAGCATTAA CAAAAAAAAC
AAAGACCGGC TGTCGGCCAC GGCCACGGAC CCCGGGGTCC GGTTTCTTGC CGACCCCTAC
CTGGCGGATT ACAGCCTGGA CGACTTTCCC CGGCTCAAAA CCTTTCTGGA TATTCATTTA
AAGACCCGAC CCGCCCTCTG CCACGAGCGG CCCGAAATCC TGACCCGGTG GTACAGGGAA
AATGGTTTTG AAACCGACAA AAACGGCGAG CCCTGGGTGG CCGAGATTCG CCAGGCCCAT
GCCTTAAAAT ACCTGATGGA AAACCGCAAA CCCCTCATTC GGGAAAACGA CCTGGTGGCT
GGCACCACCA CCGCCCAAGA GATCGGCGTG GTGCTCTACC CGGACGCCCA CGGCACCATG
ATCTGGGGTG AACTGCTTAC CGCCCCCCAC AGGCCCTTAA ACCCCTACGA CGTTTCCCCG
AACACCATTG ACGTGCTGCA CAACTCCGTG TTTCCCTTCT GGCTGAACCG CAATTTCCGC
GAATGGGTGC GGGACAACAA AGGCTATCCT GACTGTCAGA AACTCGACGA GCGGTTTGCC
GCCTGCTTTT TGTGGAAAAC CGTGGCCCTT TCCCACACCA TTCTTGATTA CCCGACCCTG
CTCTCTCTGG GCACAAAAGG CATCATGGCC CAAATCGACG CCGAGATTGG CCGCACCAGC
GAGGCGGACG CGGAAAAGCA CACCACCCTG CGGGCCATGA AGCTGACCCT GGAGGGCATC
AACGCCTACG CGAAAAACCT GGCCGCTGAA GCCCACCGGC TGGCAGGCAT GGAAAAAGAC
CCGGCCCGCA AAAAAGAGCT GGTCCGGCTG GCCGACATCT GCCGAAAGGT GCCGGAAAAC
CCGGCCGCCA CCCTGGATGA AGCGATCAAC GCCATCTGGA TCGTCTGGGT GGGGGTTCAC
ATGGAAAACA CCAACGCCGG TTTTTCCCTG GGCCGCATGG ACCAGTGGCT GCAACCCTAT
TTTGAGGCGG ACATGGCAGC CCTGAAAAAC GAAAACGAAC AGAAGGACTA TATCCGCCGC
GCCATCGAGC TTGTCGGGTG CTTCTACATG CGATGCACCG ACCACCTGCC CCTGATTCCG
GACATCGGCA ACTACCTGTT CGGCGGCAGC TCATCGGACC AGGCCATCAC CCTGGGCGGG
GTGACCCCGG ACGGCGAGGA CGCGGTCAAC GACATGACCT ACATCTTTTT AAAGGTCACG
GAGATGCTCT CCATTCGGGA CCCCAACGTC AATGCCCGGT ACAACCGCGA AAAAAACAGC
GACGCTTATC TTGCGCGGCT CTGCGAGGTG AACCTGAACA CCGCGGCCAC CCCCTCCATT
CACAATGACG AGGCGGTCAT GGCCTCGCTG GCCGAATTCA ACTACCCGGC CGAGCACCTG
CGGGACTGGG CCGCCACCGG GTGCGTGGAG CCCACCCTTT CCGGCCGTCA CATCGGCCAC
ACCAACTGCA TGATGTTCAA CATGGTGGCG GCCCTTGAAA TGGCCTTGAA CAACGGGCGC
CACCCCCTGA TGCGCTGGGA CCTGGGCCCC AAAACAGGGG ACGTGACCAC CGGCCATTTC
AAGGATTTTG AATCGTTTTT CGAGGCCTTT GCAACCCAGC TTGGATTTCT GGCCGACCAG
ACCTGCGAGT ACAACAACCT GCTGGGCCAG GCCCACCAGA CAATCCGGCC CACGCCATAC
ATCTCGGCCC TGATCCAGGG CCCGAAAGAA AAAGGCAGGG ATGTCACCAA AGGCGGGGCC
CTTTACAACT CGTCAGGCGT GGCCTGCATC GGTCTTGCCG ACATCACCGA CTCCATGATG
GTGATCAAGA AACTGGTGTT CGACGGCAAG CAGGTCTCCT TTGGCGATCT TCACCGGGCC
CTGGCCGCCA ATTTTGAAAA CGAACCGGCC CTGCTGGCGA TCATCAAAAA CAAAATTCCC
CTGTTCGGGT CCGGCGACAA AGAGGCCCTG GCCATGGCCA ACCGCATCAT CCGGCTGGCC
CACGACATTT TCGGGGCCCA CACCAACTAC CGGGGCGGCC CCTATACCGC GGGGTTCTGG
TCCATGTCCA ACCACGTGGC CTTCGGCACC CTGACCGGAG CACTGCCGTC GGGACGACTG
GCCGGCAAGG CCTTTACACC GGGCCTGACC CCTGAACCCC ATTCTTCCCC CAGTATTTTA
GACAACCTGC GCGACGTGGC CGGCCTGGAC CCCACCGCCA TGAACAACAA CATCGCCTTT
AACGTCAAGG TGGTGCCGGC GCCGGGCGAG TCCCATGCCC AGACCGTCAA CACCCTCTGC
TCCTATACCA AGGCCTATAC CGGCCTGGGC GGCATGCAGA TGCAACTCAA CGTGGTGTCG
TCGGACACGC TGCGCGACGC CATGGAACAC CCGGAAAACT ACCAGAACCT GCTGGTGCGC
ATATCGGGAT ACAACGCCTA TTTTGTCTCG CTGAACCGGG AGATGCAGAT AGAGCTGATT
GAACGGGCTG AATTCGGCAT GTGA
 
Protein sequence
MNANRKGMTN MLANLFFRFF AANFNLRPAL NRYLSCDDGP INFTFGIRTE SGSVEQAVQF 
ENGRVRVLKK MPEKPDAQLV FVDEAAVKEA ATQPPNKLML ALMENRMVTR GNLGYLQLMN
FYLSLLLKKV QVGKLQKEAK REKRDYDPAE ASQKSINKKN KDRLSATATD PGVRFLADPY
LADYSLDDFP RLKTFLDIHL KTRPALCHER PEILTRWYRE NGFETDKNGE PWVAEIRQAH
ALKYLMENRK PLIRENDLVA GTTTAQEIGV VLYPDAHGTM IWGELLTAPH RPLNPYDVSP
NTIDVLHNSV FPFWLNRNFR EWVRDNKGYP DCQKLDERFA ACFLWKTVAL SHTILDYPTL
LSLGTKGIMA QIDAEIGRTS EADAEKHTTL RAMKLTLEGI NAYAKNLAAE AHRLAGMEKD
PARKKELVRL ADICRKVPEN PAATLDEAIN AIWIVWVGVH MENTNAGFSL GRMDQWLQPY
FEADMAALKN ENEQKDYIRR AIELVGCFYM RCTDHLPLIP DIGNYLFGGS SSDQAITLGG
VTPDGEDAVN DMTYIFLKVT EMLSIRDPNV NARYNREKNS DAYLARLCEV NLNTAATPSI
HNDEAVMASL AEFNYPAEHL RDWAATGCVE PTLSGRHIGH TNCMMFNMVA ALEMALNNGR
HPLMRWDLGP KTGDVTTGHF KDFESFFEAF ATQLGFLADQ TCEYNNLLGQ AHQTIRPTPY
ISALIQGPKE KGRDVTKGGA LYNSSGVACI GLADITDSMM VIKKLVFDGK QVSFGDLHRA
LAANFENEPA LLAIIKNKIP LFGSGDKEAL AMANRIIRLA HDIFGAHTNY RGGPYTAGFW
SMSNHVAFGT LTGALPSGRL AGKAFTPGLT PEPHSSPSIL DNLRDVAGLD PTAMNNNIAF
NVKVVPAPGE SHAQTVNTLC SYTKAYTGLG GMQMQLNVVS SDTLRDAMEH PENYQNLLVR
ISGYNAYFVS LNREMQIELI ERAEFGM