Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2037 |
Symbol | |
ID | 5694880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2469037 |
End bp | 2470731 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641264638 |
Product | acetolactate synthase, large subunit, biosynthetic type |
Protein accession | YP_001529918 |
Protein GI | 158522048 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.182642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTCA CAGGCGCGCA GATCATGATG AAGGTGCTCA AGGAAGAGAA GGTGGAGACC ATCTTCGGCT ATCCCGGCGG CGCGGTGCTG GATGTTTATA ACGAGCTGCT CAACACCGAT TTTGCACATA TCCTGGTCCG CCAGGAACAG GGGGCCGTTC ACGCGGCCGA CGCCTATGCC CGGGTCTCCG GCAAAACCGG GGTCTGCCTG GTCACCTCCG GCCCCGGTGC CACCAACACC ATCACCGGCA TTGCTTCTGC CTACTGCGAC TCCATTCCGG TGGTGATTTT CACCGGCCAG GTGCCAACGC CCCTGATCGG CAACGACGCC TTTCAGGAGG TGGATATCGT GGGCATCTCC CGGCCCTGCA CCAAGCACAA CTACCTGGTC AAGGATGTCA AAGACCTGGC CGGGGTCATT CGGGAGGCCT TTTACATCGC CCGTTCCGGC CGGCCGGGAC CGGTGCTGAT CGACATGCCC AAGGACGTGA TCAACGCAAA GACCACCTAT GAGCCGCCGA AGCCGATGGC GCTCAAGTCT TACAACCCGA CCTACGAGCC CAACGTCAAA CAGCTGAAAA AGGTGATCGA CCTGGTGAAA ACCGCCAAAA AGCCGGTGAT CTTTTCCGGC GGGGGCATCA TCTTTTCCGG CGCGGCCAAA GAACTGACCC GGTTTGCAAA AAAAGCCCGC ATTCCGGTCA CCTCGACGCT GATGGGTCTG GGCGCCTTTC CGGCCACCGA CGACCTGTGG CTGGGCATGC CGGGCATGCA CGGCACCTAC CGGGCCAACC TGTCCCTTTC CTCATGCGAC CTTCTCATCG CGGTGGGGGT GCGGTTTGAT GACCGGGTTA CCGGCAAGAC CAGCGAGTTT GCGGCCAACG CCACCATTGT GCATATCGAC ATCGACCCCA CCTCCATTCA GAAAAACGTG AGGGTGGCCA TTCCCATCGT GGGCGACTGC AAAGCGGCCA TGGCCCGGCT CAATAAAATG GCTGATGAGG ACAAAGACCT GCTGAGCGAC AAGGTCAAAA AAGAGCGGAC CGCCTGGGCC AAACAGATTG CGGACTGGAA AAAAACCAAG CCCCTGGCCT ACACCCAGAC GGATGTGATC AAACCCCAGT ACGTGGTGGA ACAGCTTTAT GAACTGACAA AGGGCCAGGC CATCATCACC ACCGAGGTGG GCCAGAACCA GATGTGGGCG GCCCAGTATT ACCATTATAC CTGGCCCGGC CAGTTCATCA CCTCCGGCGG ACTGGGCGTG ATGGGGTTCG GCCTGCCCGC GGCCGTGGGC GCCCAGGTGG CGGCCCCGGA CAAAGTGGTC ATTGATATTG CCGGCGACGG CAGCATTCAG ATGAACATTC AGGAGATGAT GACAGCGGTG AGCCACAACC TGCCGGTGAA AATCGCCATT CTCAACAACG GATTCCTCGG CATGGTGCGC CAGTGGCAGG AGCTGTTCTA CGACCGGCGC TATGCCTGGA CCGACATGGC CGCGGCCCCG GACTTTGTCA AGCTGGCCGA GGCCTACGGT GCGGTGGGCC TACGGGCCAC CAAACCCAGC GAGGTGGCGA AGGTGATTAA AAAGGCCCTG GCAACGCCCA AACCGGTGAT CATGGACTTT GTGGTGGAAA AAGAAGAAAA CGTCTATCCC ATGGTGCCGG CCGGCTCTCC CATTACCAAC ATGATCCTTG TATAA
|
Protein sequence | MQFTGAQIMM KVLKEEKVET IFGYPGGAVL DVYNELLNTD FAHILVRQEQ GAVHAADAYA RVSGKTGVCL VTSGPGATNT ITGIASAYCD SIPVVIFTGQ VPTPLIGNDA FQEVDIVGIS RPCTKHNYLV KDVKDLAGVI REAFYIARSG RPGPVLIDMP KDVINAKTTY EPPKPMALKS YNPTYEPNVK QLKKVIDLVK TAKKPVIFSG GGIIFSGAAK ELTRFAKKAR IPVTSTLMGL GAFPATDDLW LGMPGMHGTY RANLSLSSCD LLIAVGVRFD DRVTGKTSEF AANATIVHID IDPTSIQKNV RVAIPIVGDC KAAMARLNKM ADEDKDLLSD KVKKERTAWA KQIADWKKTK PLAYTQTDVI KPQYVVEQLY ELTKGQAIIT TEVGQNQMWA AQYYHYTWPG QFITSGGLGV MGFGLPAAVG AQVAAPDKVV IDIAGDGSIQ MNIQEMMTAV SHNLPVKIAI LNNGFLGMVR QWQELFYDRR YAWTDMAAAP DFVKLAEAYG AVGLRATKPS EVAKVIKKAL ATPKPVIMDF VVEKEENVYP MVPAGSPITN MILV
|
| |