Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1134 |
Symbol | |
ID | 8012253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1109938 |
End bp | 1111788 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823717 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_002974968 |
Protein GI | 241203872 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.474291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGA CGATACGGTT GACGATGGCG CAAGCCGTCA CGCATTTCCT CAAGGTACAG ATGACGATCG TCGACGGCAA GAAAGTGCCG ATCTTCGGCG GCGTCTGGGC GATCTTCGGT CACGGCAACG TCGCCGGCAT CGGCGAGGCG CTCTATCAGG TGCGCGATGA ATTGACGACC TATCGCGCTC ACAACGAACA GGGCATGGCG CATGCCGCGA TCGCCTATGC CAAGGCGAAT TTCCGCACGC GCTTCATGGC CTGCACGAGT TCGATCGGCC CCGGCGCGCT GAACATGGTG ACGGCCGCCG GCGTGGCGCA TGTCAATCGT ATTCCCGTTC TCTTCCTGCC GGGCGACGTC TTCGCCAACC GCGCGCCGGA TCCGGTGCTG CAGCAGATCG AGGATTTCGG CGACGGCACG GTTTCAGCCA ATGATGCCTT CCGTTCGGTT TCGCGCTATT TCGACCGCAT CACCCGGCCC GAGCAGATCA TCGCGGCGCT GAAGCGCGCC ATGCAGGTCC TGACCGACCC CTTGGATTGC GGCCCGGTGA CATTGTCGCT CTGCCAGGAC GTTCAGGCAG AAGCCTTCGA TTATCCGGAA AGCCTGTTCG ATGAAAAAGT CTGGACGACC CGCCGGCCGC AGCCGGATGC AGACGAGCTG GCGAATGCCA TCGCGCTGAT CAAGGCGTCG CAGAAGCCGG TGATCGTTGC CGGCGGCGGC GTGCTTTATT CGCAGGCGAC GAAGGAGCTT ACCGCCTTTG CCGAGGCCCA CGGCCTTCCC GTCGTCGTCA GCCAGGCCGG CAAGTCGGCG ATCAACGAGA CCCACCCGCT GGCACTCGGC TCGGTCGGCG TCACCGGCAC GTCGGCGGCG AATGCGATCG CCGAAGAGAC GGATCTCGTT ATCGCCGTCG GCACGCGCTG CCAGGATTTC ACTACCGGCT CCTGGGCGCT GTTCAAGAAT GACAGCCTGA AGATGATCGG CCTCAATATC GCCGCCTATG ACGCGGTGAA GCACGACAGC TATCCGCTGG TGACAGACGC CCGCGAAGGG CTGAAGGCGC TTTCGGCCGG ACTTTCGGGC TGGAAGGCGC CGGCCGCCCT CACCGAGAAG GCGGCTGCGG AAAAGAAGGT CTGGATGGAG GCTGCGGCCA GGGCAATGGC CACGACCAAT GCCGCCCTGC CCTCCGATGC GCAGGTGATC GGCGCGGTGG CGCGCACGAT CGGCGGTGAG AATACGACGG TGCTTTGCGC TGCCGGCGGC CTTCCCGGTG AATTGCACAA GCTCTGGCCG GCGACGGCGC CGGGCAGCTA TCACATGGAA TACGGCTTTT CCTGCATGGG CTACGAGATC GCCGGCGGGC TTGGCGCCAA GATGGCGCGT CCCGAACGGG ATGTGGTCGT CATGGTCGGC GACGGTTCCT ACATGATGAT GAATTCCGAG ATCGCCACCT CGGTCATGCT CGGCCTCAAG CTCAACATCG TCTTGCTCGA CAATCGCGGC TATGGCTGCA TCAATCGGCT GCAGATGGGA ACCGGCGGCG CCAACTTCAA CAATCTGCTG AAGGACTCCT ACCACGAGGT GATGCCGGAG ATCGATTTCC GCGCGCATGC CGAAAGCATG GGCGCCATCG CCGTCAAGGT CGCCTCGATC GCCGAGCTGG AGCAGGCGAT CGCCGACTCG AGGAAGAACG ATCGCACCTC GGTCTTCGTT ATCGACACCG ATCCGCTTAT TACCACGGAG GCCGGCGGCC ACTGGTGGGA TGTCGCGGTG CCGGAGGTCA GCCCGCGCGA AGAGGTCAAC GAAGCGCGCA AGGGCTATGT CGAAGCACGC GCCGCCCAGC GTATCGGCTG A
|
Protein sequence | MGKTIRLTMA QAVTHFLKVQ MTIVDGKKVP IFGGVWAIFG HGNVAGIGEA LYQVRDELTT YRAHNEQGMA HAAIAYAKAN FRTRFMACTS SIGPGALNMV TAAGVAHVNR IPVLFLPGDV FANRAPDPVL QQIEDFGDGT VSANDAFRSV SRYFDRITRP EQIIAALKRA MQVLTDPLDC GPVTLSLCQD VQAEAFDYPE SLFDEKVWTT RRPQPDADEL ANAIALIKAS QKPVIVAGGG VLYSQATKEL TAFAEAHGLP VVVSQAGKSA INETHPLALG SVGVTGTSAA NAIAEETDLV IAVGTRCQDF TTGSWALFKN DSLKMIGLNI AAYDAVKHDS YPLVTDAREG LKALSAGLSG WKAPAALTEK AAAEKKVWME AAARAMATTN AALPSDAQVI GAVARTIGGE NTTVLCAAGG LPGELHKLWP ATAPGSYHME YGFSCMGYEI AGGLGAKMAR PERDVVVMVG DGSYMMMNSE IATSVMLGLK LNIVLLDNRG YGCINRLQMG TGGANFNNLL KDSYHEVMPE IDFRAHAESM GAIAVKVASI AELEQAIADS RKNDRTSVFV IDTDPLITTE AGGHWWDVAV PEVSPREEVN EARKGYVEAR AAQRIG
|
| |