Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4864 |
Symbol | |
ID | 5835035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5434826 |
End bp | 5436601 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641370662 |
Product | acetolactate synthase, large subunit, biosynthetic type |
Protein accession | YP_001642303 |
Protein GI | 163854260 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG AGGTCATGAC CGGGGCCCAG ATGGTCATCC GGGCGTTCCA GGATCAGGGC GTGGATACCC TGTTCGGCTA TCCGGGCGGC GCGGTGCTGC CGATCTACGA CGCGCTCTTC AACGAGACCA AGATCCAGCA CGTGCTGGTG CGCCATGAGC AGGGCGCGGT CCACGCGGCC GAGGGCTATG CCCGTTCGTC GGGCAAGGTC GGCGTCGTCC TCGTCACGTC CGGCCCCGGT GCCACCAACA TCGTCACTGG CCTCACCGAC GCGATGCTGG ACTCGATCCC GCTCGTCGCG GTGACGGGTC AGGTTCCGAC GCACCTGATC GGCTCCGACG CGTTCCAGGA ATGCGACACG GTCGGCATCA CCCGGCACTG CACGAAGCAC AACTACCTCG TCAAATCGAT CGAGGACCTG CCGCGCATCC TGCACGAGGC GTTCTACGTC GCCAGCCATG GGCGCCCCGG CCCGGTCGTG ATCGACCTGC CGAAGGACAT CCAGTTCGCG AGCGGCGTCT ATTCCCGCCC GGCGCAGGAC GGCCACAAGA CCTACAACCC GCCGGTGAAG GGCGATTCCG ACAAGATCCG CGCAGCGGTC GAGCTGATGG CGAGCGCGCG CCGTCCGGTC TTCTACACCG GCGGCGGCGT CATCAATTCG GGCCCGGAAG CCTCGCGGCT GCTGCGCGAG CTGGTCGCCG AGACCGGCTT CCCGGTCACG TCCACGCTGA TGGGCCTCGG TGCCTTCCCG GCTTCCGATG ACAAGTTCCT CGGAATGCTG GGCATGCACG GTACCTACGA GGCCAACCTC GCGATGCATG ACTGCGACGT GATGATCAAC ATCGGTGCGC GCTTCGACGA CCGCATCACC GGCCGCCTCG ACGCCTTCGC GCCGTTCTCG AAGAAGATCC ACGTCGATGT CGACGCCTCC TCGATCAACA AGGTCGTGAA GGTCGATGTC GGCATCCTCG GCGATTGCGC CGCGGTGCTG GAGGAGATGC TCGCCCAGTG GCGCGCGCTG CCGAAGCAGC CCGACAAGGA TCGGATGGAG GACTGGTTCA ACAAGATCAG CCGCTGGAAG TCGCGCGACT GCCTCGCCTA CTGGCCGTCG GGCACGATCA TCAAGCCGCA ATACGCGGTG CAGCGGCTCT ACGAGGCGTG CAAGGATCGC AAGACCTTCG TCACGACCGA GGTCGGCCAG CACCAGATGT GGGCGGCGCA GTACTTCAAG TTCGATGAGC CGAACCGCTG GATGACCTCG GGTGGCCTCG GCACCATGGG CTACGGCCTG CCGGCGGCGA TCGGCACCCA GCTCGCCAAC CCGGACGGCC TCGTCATCGA CATCGCGGGC GAAGCCTCGA TCCTGATGAA CATGCAGGAG ATGTCGACGG CGGTGCAGTA CCGGCTGCCG GTCAAGATCT TCATCCTGAA CAACGAGTAT ATGGGCATGG TGCGCCAGTG GCAGGAGCTG CTGCACGGCT CGCGCTACTC GCAGAGCTAC TCGGAGAGCT TGCCGGACTT CGTGAAGCTC GCCGAGGCCT ACGGCGCCAA GGGTATCCGT TGCGAGAAGC CCGGTGAGCT CGATGCGGCG ATCCAGGAGA TGCTCGACTA CGACGGTCCG GTCATCTTCG ACTGCATCGT CGACAAGAAG GAAAACTGCT TCCCGATGAT CCCGTCGGGC AAGGCACACA ACGAGATGCT GCTGTCCGAC TATCTCGGTG AGACCGGGGT GGAGATCGGC GACGTCATCT CGGCCGAGGG CAAGATGCTG GTCTGA
|
Protein sequence | MSGEVMTGAQ MVIRAFQDQG VDTLFGYPGG AVLPIYDALF NETKIQHVLV RHEQGAVHAA EGYARSSGKV GVVLVTSGPG ATNIVTGLTD AMLDSIPLVA VTGQVPTHLI GSDAFQECDT VGITRHCTKH NYLVKSIEDL PRILHEAFYV ASHGRPGPVV IDLPKDIQFA SGVYSRPAQD GHKTYNPPVK GDSDKIRAAV ELMASARRPV FYTGGGVINS GPEASRLLRE LVAETGFPVT STLMGLGAFP ASDDKFLGML GMHGTYEANL AMHDCDVMIN IGARFDDRIT GRLDAFAPFS KKIHVDVDAS SINKVVKVDV GILGDCAAVL EEMLAQWRAL PKQPDKDRME DWFNKISRWK SRDCLAYWPS GTIIKPQYAV QRLYEACKDR KTFVTTEVGQ HQMWAAQYFK FDEPNRWMTS GGLGTMGYGL PAAIGTQLAN PDGLVIDIAG EASILMNMQE MSTAVQYRLP VKIFILNNEY MGMVRQWQEL LHGSRYSQSY SESLPDFVKL AEAYGAKGIR CEKPGELDAA IQEMLDYDGP VIFDCIVDKK ENCFPMIPSG KAHNEMLLSD YLGETGVEIG DVISAEGKML V
|
| |