Gene Rleg_1134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1134 
Symbol 
ID8012253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1109938 
End bp1111788 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content64% 
IMG OID644823717 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_002974968 
Protein GI241203872 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.474291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA CGATACGGTT GACGATGGCG CAAGCCGTCA CGCATTTCCT CAAGGTACAG 
ATGACGATCG TCGACGGCAA GAAAGTGCCG ATCTTCGGCG GCGTCTGGGC GATCTTCGGT
CACGGCAACG TCGCCGGCAT CGGCGAGGCG CTCTATCAGG TGCGCGATGA ATTGACGACC
TATCGCGCTC ACAACGAACA GGGCATGGCG CATGCCGCGA TCGCCTATGC CAAGGCGAAT
TTCCGCACGC GCTTCATGGC CTGCACGAGT TCGATCGGCC CCGGCGCGCT GAACATGGTG
ACGGCCGCCG GCGTGGCGCA TGTCAATCGT ATTCCCGTTC TCTTCCTGCC GGGCGACGTC
TTCGCCAACC GCGCGCCGGA TCCGGTGCTG CAGCAGATCG AGGATTTCGG CGACGGCACG
GTTTCAGCCA ATGATGCCTT CCGTTCGGTT TCGCGCTATT TCGACCGCAT CACCCGGCCC
GAGCAGATCA TCGCGGCGCT GAAGCGCGCC ATGCAGGTCC TGACCGACCC CTTGGATTGC
GGCCCGGTGA CATTGTCGCT CTGCCAGGAC GTTCAGGCAG AAGCCTTCGA TTATCCGGAA
AGCCTGTTCG ATGAAAAAGT CTGGACGACC CGCCGGCCGC AGCCGGATGC AGACGAGCTG
GCGAATGCCA TCGCGCTGAT CAAGGCGTCG CAGAAGCCGG TGATCGTTGC CGGCGGCGGC
GTGCTTTATT CGCAGGCGAC GAAGGAGCTT ACCGCCTTTG CCGAGGCCCA CGGCCTTCCC
GTCGTCGTCA GCCAGGCCGG CAAGTCGGCG ATCAACGAGA CCCACCCGCT GGCACTCGGC
TCGGTCGGCG TCACCGGCAC GTCGGCGGCG AATGCGATCG CCGAAGAGAC GGATCTCGTT
ATCGCCGTCG GCACGCGCTG CCAGGATTTC ACTACCGGCT CCTGGGCGCT GTTCAAGAAT
GACAGCCTGA AGATGATCGG CCTCAATATC GCCGCCTATG ACGCGGTGAA GCACGACAGC
TATCCGCTGG TGACAGACGC CCGCGAAGGG CTGAAGGCGC TTTCGGCCGG ACTTTCGGGC
TGGAAGGCGC CGGCCGCCCT CACCGAGAAG GCGGCTGCGG AAAAGAAGGT CTGGATGGAG
GCTGCGGCCA GGGCAATGGC CACGACCAAT GCCGCCCTGC CCTCCGATGC GCAGGTGATC
GGCGCGGTGG CGCGCACGAT CGGCGGTGAG AATACGACGG TGCTTTGCGC TGCCGGCGGC
CTTCCCGGTG AATTGCACAA GCTCTGGCCG GCGACGGCGC CGGGCAGCTA TCACATGGAA
TACGGCTTTT CCTGCATGGG CTACGAGATC GCCGGCGGGC TTGGCGCCAA GATGGCGCGT
CCCGAACGGG ATGTGGTCGT CATGGTCGGC GACGGTTCCT ACATGATGAT GAATTCCGAG
ATCGCCACCT CGGTCATGCT CGGCCTCAAG CTCAACATCG TCTTGCTCGA CAATCGCGGC
TATGGCTGCA TCAATCGGCT GCAGATGGGA ACCGGCGGCG CCAACTTCAA CAATCTGCTG
AAGGACTCCT ACCACGAGGT GATGCCGGAG ATCGATTTCC GCGCGCATGC CGAAAGCATG
GGCGCCATCG CCGTCAAGGT CGCCTCGATC GCCGAGCTGG AGCAGGCGAT CGCCGACTCG
AGGAAGAACG ATCGCACCTC GGTCTTCGTT ATCGACACCG ATCCGCTTAT TACCACGGAG
GCCGGCGGCC ACTGGTGGGA TGTCGCGGTG CCGGAGGTCA GCCCGCGCGA AGAGGTCAAC
GAAGCGCGCA AGGGCTATGT CGAAGCACGC GCCGCCCAGC GTATCGGCTG A
 
Protein sequence
MGKTIRLTMA QAVTHFLKVQ MTIVDGKKVP IFGGVWAIFG HGNVAGIGEA LYQVRDELTT 
YRAHNEQGMA HAAIAYAKAN FRTRFMACTS SIGPGALNMV TAAGVAHVNR IPVLFLPGDV
FANRAPDPVL QQIEDFGDGT VSANDAFRSV SRYFDRITRP EQIIAALKRA MQVLTDPLDC
GPVTLSLCQD VQAEAFDYPE SLFDEKVWTT RRPQPDADEL ANAIALIKAS QKPVIVAGGG
VLYSQATKEL TAFAEAHGLP VVVSQAGKSA INETHPLALG SVGVTGTSAA NAIAEETDLV
IAVGTRCQDF TTGSWALFKN DSLKMIGLNI AAYDAVKHDS YPLVTDAREG LKALSAGLSG
WKAPAALTEK AAAEKKVWME AAARAMATTN AALPSDAQVI GAVARTIGGE NTTVLCAAGG
LPGELHKLWP ATAPGSYHME YGFSCMGYEI AGGLGAKMAR PERDVVVMVG DGSYMMMNSE
IATSVMLGLK LNIVLLDNRG YGCINRLQMG TGGANFNNLL KDSYHEVMPE IDFRAHAESM
GAIAVKVASI AELEQAIADS RKNDRTSVFV IDTDPLITTE AGGHWWDVAV PEVSPREEVN
EARKGYVEAR AAQRIG