Gene Rleg_5572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5572 
Symbol 
ID8016463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp154202 
End bp156004 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content62% 
IMG OID644827739 
Productthiamine pyrophosphate protein domain protein TPP-binding 
Protein accessionYP_002978939 
Protein GI241518311 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.307179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATG CCGCTGACAT ATTGATAGAC ACACTGATTG AATGGGACGT GAAGGTCATC 
TTCGGCCTTC CTGGAGACGG TATTAACGGG GTCATGGAGG CGTTGCGCAA GCGCCAGGAC
CAGATCCGCT TCATCCAGGT CCGGCATGAG GAATCGGCCG CTTTCATGGC GAGTGCCTAC
GCTAAATTCA CCGGAAATCT CGGCGTTTGC CTTGCAACGT CCGGACCAGG CGGCACGAAT
CTGCTGACGG GACTTTATGA TGCCAAGCTC GATCAGATGC CGGTGCTTGC CATCACCGGG
ACGCAGTATC ATGACCTGAT CGAAACCTTT ACCCAGCAGG ACGTCGACTT GACCCGCGTG
TTCGACAATG TCGCGCTCTA CAATGCGCAT GTCAGTGACG CGTCGCACAT GGAGAACGTC
GCCAGCCTCG CCTGCCGCTC GGCGCTCTCG CGCCGGGGCG TCGCACATCT CTCCATCGCC
AACGACGTCC AGGAGATGGA CGGGAAATCG CGATCGAAAC GGAACCGCCC GAAACACGTA
CCGAACCGCT ATTTCCACGG CCGCCAAGTG CCAGAGGAGA TCGAGCTCGA TCGTGCGGCC
CGGATACTGA ACGATGCCCG CAAGGTCGCC ATCCTCGCCG GCCGGGGTGT TGTCGGCGCG
GCGGAGGAAT TGCGCGAGAT CGCCGACCTG CTCGCCGCGC CGGTGGCCAA AGCCCTGCTC
GGCAAGACGG CTCTCGCCGA CGACGATCCC TTGACGACCG GCGGGATTGG AATTCTCGGC
ACCGCTCCCT CGCAGGAGAT CATGGAACAG TGCGACGCGG TGCTGATTGT CGGTTCCTCC
TTTCCCTACA TCGAATATTA CCCGCGGCCC GAAGCCGCTC GCGGCGTGCA GATCGACAGC
GATCCTCAGC GAATTGGCCT ACGATTTCCC GTCGATGCCG GTCTGGTGGG GGACGCGCGG
GAAACGCTAC GGCTCCTTAG GCCTAGGCTG ACCAGGAAAA CAGATCGGTC GTTCCTGGAA
AAGGCGCAAA CGGCGATGTC CGAATGGCGC CGAAAGATGG AGACGATGGA GACGGAGCGG
AGCGCTCCGT TAAAGCCACA GGCGGTGGTT CGAGCTTTCG GCAGGCGCAT CGCGGCCGAT
GGCGTGCTGG TGGCCGATTC CGGCCAGAAC ACCGAGCTGG CGGCGCGCCA CATCGACCTC
CGCACGACAA ACCAGTTCGC AGTCTCCGGC GCGCTCGCCT CGATGGCTTC CGGCCTGCCT
TATGCGATAG CGGCGGGCGC CGCCGATCCG AAGCGGCCAA TCTACGCGGT CATTGGCGAT
GGCGGATTCG GCATGCAGCT CGGCGAGTTC TCGACCGCCG TTCGCATGGC TCTCCCGCTG
AAATTGCTGG TGATCTGCAA CGGCATGCTC AACCAGATCG CCTGGGAGCA GATGATGTTC
CTTGGCAACC CGCAATTTGC CTGCGAACTG GCGCCCATCG ACTTCGCCAA GGCGGCGGAG
GCAATGGGAG GCCGCGGCTT CACAATTCGG CGTTTCGATC AGATAGAGCC GACTTTGACG
GAGGCTTTCT CCGTCGCCGG TCCGGTGGTC ATTCAAGCGA TGGTTGACCA ATACGAGCCG
ATGATGCCTC CGAAGATGCC CAAAGACTAT GCCAAGAGCT TTCGCCAGGC TCTTCCGGAG
ACGCCGGGTC ATCGGGCGAT CGAGGCGAAT GTGTCGCGCT CGCCGCTTCG CGAGATGATG
GACGCCGGAG AGCAGGAAAA GGAACTGTCC GAAAACACGA CCGATTTGCC GGGCATACCT
TGA
 
Protein sequence
MPNAADILID TLIEWDVKVI FGLPGDGING VMEALRKRQD QIRFIQVRHE ESAAFMASAY 
AKFTGNLGVC LATSGPGGTN LLTGLYDAKL DQMPVLAITG TQYHDLIETF TQQDVDLTRV
FDNVALYNAH VSDASHMENV ASLACRSALS RRGVAHLSIA NDVQEMDGKS RSKRNRPKHV
PNRYFHGRQV PEEIELDRAA RILNDARKVA ILAGRGVVGA AEELREIADL LAAPVAKALL
GKTALADDDP LTTGGIGILG TAPSQEIMEQ CDAVLIVGSS FPYIEYYPRP EAARGVQIDS
DPQRIGLRFP VDAGLVGDAR ETLRLLRPRL TRKTDRSFLE KAQTAMSEWR RKMETMETER
SAPLKPQAVV RAFGRRIAAD GVLVADSGQN TELAARHIDL RTTNQFAVSG ALASMASGLP
YAIAAGAADP KRPIYAVIGD GGFGMQLGEF STAVRMALPL KLLVICNGML NQIAWEQMMF
LGNPQFACEL APIDFAKAAE AMGGRGFTIR RFDQIEPTLT EAFSVAGPVV IQAMVDQYEP
MMPPKMPKDY AKSFRQALPE TPGHRAIEAN VSRSPLREMM DAGEQEKELS ENTTDLPGIP