Gene Dole_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2037 
Symbol 
ID5694880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2469037 
End bp2470731 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content60% 
IMG OID641264638 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_001529918 
Protein GI158522048 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.182642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTCA CAGGCGCGCA GATCATGATG AAGGTGCTCA AGGAAGAGAA GGTGGAGACC 
ATCTTCGGCT ATCCCGGCGG CGCGGTGCTG GATGTTTATA ACGAGCTGCT CAACACCGAT
TTTGCACATA TCCTGGTCCG CCAGGAACAG GGGGCCGTTC ACGCGGCCGA CGCCTATGCC
CGGGTCTCCG GCAAAACCGG GGTCTGCCTG GTCACCTCCG GCCCCGGTGC CACCAACACC
ATCACCGGCA TTGCTTCTGC CTACTGCGAC TCCATTCCGG TGGTGATTTT CACCGGCCAG
GTGCCAACGC CCCTGATCGG CAACGACGCC TTTCAGGAGG TGGATATCGT GGGCATCTCC
CGGCCCTGCA CCAAGCACAA CTACCTGGTC AAGGATGTCA AAGACCTGGC CGGGGTCATT
CGGGAGGCCT TTTACATCGC CCGTTCCGGC CGGCCGGGAC CGGTGCTGAT CGACATGCCC
AAGGACGTGA TCAACGCAAA GACCACCTAT GAGCCGCCGA AGCCGATGGC GCTCAAGTCT
TACAACCCGA CCTACGAGCC CAACGTCAAA CAGCTGAAAA AGGTGATCGA CCTGGTGAAA
ACCGCCAAAA AGCCGGTGAT CTTTTCCGGC GGGGGCATCA TCTTTTCCGG CGCGGCCAAA
GAACTGACCC GGTTTGCAAA AAAAGCCCGC ATTCCGGTCA CCTCGACGCT GATGGGTCTG
GGCGCCTTTC CGGCCACCGA CGACCTGTGG CTGGGCATGC CGGGCATGCA CGGCACCTAC
CGGGCCAACC TGTCCCTTTC CTCATGCGAC CTTCTCATCG CGGTGGGGGT GCGGTTTGAT
GACCGGGTTA CCGGCAAGAC CAGCGAGTTT GCGGCCAACG CCACCATTGT GCATATCGAC
ATCGACCCCA CCTCCATTCA GAAAAACGTG AGGGTGGCCA TTCCCATCGT GGGCGACTGC
AAAGCGGCCA TGGCCCGGCT CAATAAAATG GCTGATGAGG ACAAAGACCT GCTGAGCGAC
AAGGTCAAAA AAGAGCGGAC CGCCTGGGCC AAACAGATTG CGGACTGGAA AAAAACCAAG
CCCCTGGCCT ACACCCAGAC GGATGTGATC AAACCCCAGT ACGTGGTGGA ACAGCTTTAT
GAACTGACAA AGGGCCAGGC CATCATCACC ACCGAGGTGG GCCAGAACCA GATGTGGGCG
GCCCAGTATT ACCATTATAC CTGGCCCGGC CAGTTCATCA CCTCCGGCGG ACTGGGCGTG
ATGGGGTTCG GCCTGCCCGC GGCCGTGGGC GCCCAGGTGG CGGCCCCGGA CAAAGTGGTC
ATTGATATTG CCGGCGACGG CAGCATTCAG ATGAACATTC AGGAGATGAT GACAGCGGTG
AGCCACAACC TGCCGGTGAA AATCGCCATT CTCAACAACG GATTCCTCGG CATGGTGCGC
CAGTGGCAGG AGCTGTTCTA CGACCGGCGC TATGCCTGGA CCGACATGGC CGCGGCCCCG
GACTTTGTCA AGCTGGCCGA GGCCTACGGT GCGGTGGGCC TACGGGCCAC CAAACCCAGC
GAGGTGGCGA AGGTGATTAA AAAGGCCCTG GCAACGCCCA AACCGGTGAT CATGGACTTT
GTGGTGGAAA AAGAAGAAAA CGTCTATCCC ATGGTGCCGG CCGGCTCTCC CATTACCAAC
ATGATCCTTG TATAA
 
Protein sequence
MQFTGAQIMM KVLKEEKVET IFGYPGGAVL DVYNELLNTD FAHILVRQEQ GAVHAADAYA 
RVSGKTGVCL VTSGPGATNT ITGIASAYCD SIPVVIFTGQ VPTPLIGNDA FQEVDIVGIS
RPCTKHNYLV KDVKDLAGVI REAFYIARSG RPGPVLIDMP KDVINAKTTY EPPKPMALKS
YNPTYEPNVK QLKKVIDLVK TAKKPVIFSG GGIIFSGAAK ELTRFAKKAR IPVTSTLMGL
GAFPATDDLW LGMPGMHGTY RANLSLSSCD LLIAVGVRFD DRVTGKTSEF AANATIVHID
IDPTSIQKNV RVAIPIVGDC KAAMARLNKM ADEDKDLLSD KVKKERTAWA KQIADWKKTK
PLAYTQTDVI KPQYVVEQLY ELTKGQAIIT TEVGQNQMWA AQYYHYTWPG QFITSGGLGV
MGFGLPAAVG AQVAAPDKVV IDIAGDGSIQ MNIQEMMTAV SHNLPVKIAI LNNGFLGMVR
QWQELFYDRR YAWTDMAAAP DFVKLAEAYG AVGLRATKPS EVAKVIKKAL ATPKPVIMDF
VVEKEENVYP MVPAGSPITN MILV