Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2791 |
Symbol | |
ID | 8013736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2769229 |
End bp | 2771019 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825362 |
Product | acetolactate synthase 3 catalytic subunit |
Protein accession | YP_002976591 |
Protein GI | 241205495 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.111041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.823193 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGG ACAATCAGGC GGCAGGCAAT CGGATGACGG GAGCGGAGAT CGTTCTCAAG GCGCTGAAGG ACAATGGCGT CGAACATATT TTCGGCTATC CCGGCGGCGC GGTCCTGCCG ATCTATGACG AGATCTTCCA GCAGGAAGAT GTCAAGCACA TCCTCGTCCG CCACGAGCAG GGGGCAGGCC ATGCGGCCGA AGGTTACGCC CGCTCCACCG GCAAGGTCGG CGTCATGCTG GTCACCTCGG GTCCGGGCGC TACCAATGCC GTCACGCCGC TGCAGGACGC GCTGATGGAT TCGATCCCGC TCGTCTGCCT GACCGGCCAG GTTCCGACCC CACTGATCGG CTCCGACGCC TTCCAGGAAT GCGATACGGT CGGCATCACC CGGCCCTGCA CCAAGCACAA CTGGCTGGTC AAGGATGTCA ACCAGCTTGC CGCCGTCATT CACGAGGCCT TCCGCATCGC CCAGTCCGGC CGTCCAGGCC CCGTCGTCGT CGATATTCCG AAAGACGTGC AGTTTGCGAC CGGCACCTAT ACGCCGCCCG CCGATTACGC GATTCAGAAG AGCTACCAGC CGAAGATCCA GGGCGACCTC AACCAGATCC ATGCCGCGAT CGAACTGATG GCGAATGCGC GCCGTCCGAT CATCTATTCC GGCGGCGGCG TCATCAATTC CGGCCCCGAG GCTTCCAAGC TGCTGCGCGA GCTGGTCGAG CTCACCGATT TCCCGATCAC CTCGACGCTG ATGGGCCTCG GCGCCTATCC TGCTTCGGGC AAGAACTGGC TGAAGATGCT CGGCATGCAC GGCTCCTACG AAGCCAACAT GGCGATGCAC GACTGCGACG TCATGGTCTG CATCGGCGCC CGTTTCGACG ACCGCATCAC CGGCCGTCTC AATGCCTTTT CGCCGAACTC GAAGAAGATC CATATCGATA TCGATCCATC CTCGATCAAC AAGAACGTCC GAGTCGATAT CGGCATCCGC GGCGATGTCG GCCATGTCCT CGAAGACATG GTCCGCCTGT GGCGGGCGCT GCCGAAGAAG CCGGAGAAGG GTCGCCTCGA CGACTGGTGG ACCGATATCG CCCGCTGGCG GGCACGCAAC TCCTTCGCCT ATACGAAGAG CAATGACGTC ATCATGCCCC AATATGCGCT GGAGCGGCTC TTTGCCCACA CCAAGGACCG GGATACCTAC ATCACCACCG AGGTCGGCCA GCACCAGATG TGGGCGGCGC AGTTCTTCGG TTTCGAGCAG CCGAACCGCT GGATGACCTC GGGCGGCCTC GGTACGATGG GCTACGGCCT GCCGGCCGCG CTCGGCGTGC AGATCGCCCA TCCCGACAGC CTCGTCATCG ACATTGCCGG CGACGCCTCG ATCCAGATGT GTATCCAGGA AATGTCGGCG GCGATCCAGC ACGACGCGCC GATCAAGATC TTCATCATGA ACAACCAATA TATGGGCATG GTGCGTCAGT GGCAGCAGCT GTTGCACGGC AATCGCCTGT CGAATTCCTA TACGGAGGCG ATGCCCGATT TCGTCAAGCT GGCGGAGGCC TATGGTGCCG TCGGCCTGCG CTGCGAAAAG CCGGATGCGC TTGATGACAC CATTCTGGAG ATGATCGAGG TCAGAAAGCC TGTCATCTTC GATTGCCGTG TCGCCAATCT CGCCAATTGC TTCCCGATGA TCCCCTCGGG CAAGGCCCAT AACGAAATGC TGTTGCCGGA CGAAGCCACC GACGAAGCGG TCGCCAATGC GATCGACGCC AAGGGCCGCG CGCTCGTCTG A
|
Protein sequence | MSTDNQAAGN RMTGAEIVLK ALKDNGVEHI FGYPGGAVLP IYDEIFQQED VKHILVRHEQ GAGHAAEGYA RSTGKVGVML VTSGPGATNA VTPLQDALMD SIPLVCLTGQ VPTPLIGSDA FQECDTVGIT RPCTKHNWLV KDVNQLAAVI HEAFRIAQSG RPGPVVVDIP KDVQFATGTY TPPADYAIQK SYQPKIQGDL NQIHAAIELM ANARRPIIYS GGGVINSGPE ASKLLRELVE LTDFPITSTL MGLGAYPASG KNWLKMLGMH GSYEANMAMH DCDVMVCIGA RFDDRITGRL NAFSPNSKKI HIDIDPSSIN KNVRVDIGIR GDVGHVLEDM VRLWRALPKK PEKGRLDDWW TDIARWRARN SFAYTKSNDV IMPQYALERL FAHTKDRDTY ITTEVGQHQM WAAQFFGFEQ PNRWMTSGGL GTMGYGLPAA LGVQIAHPDS LVIDIAGDAS IQMCIQEMSA AIQHDAPIKI FIMNNQYMGM VRQWQQLLHG NRLSNSYTEA MPDFVKLAEA YGAVGLRCEK PDALDDTILE MIEVRKPVIF DCRVANLANC FPMIPSGKAH NEMLLPDEAT DEAVANAIDA KGRALV
|
| |