Gene Rleg_6797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6797 
Symbol 
ID8022727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp238422 
End bp239627 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content65% 
IMG OID644833664 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_002984798 
Protein GI241666714 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAG CCTTTATCTG CGACTATATC AGAACGCCGA TCGGCCGTTT CGCCGGCTCG 
CTCTCCCAGG TGCGGGCCGA CGATCTCGGT GCCATCCCGC TGAAGGCACT GATGCAACGA
AATGCCGCCG TCGATTGGGA AGCCGTCGAC GATGTGATCT TCGGCTGCGC CAACCAGGCG
GGTGAGGACA ACCGCAATGT CGCGCGCATG TCGGCTCTGC TCGCCGGCCT GCCGATCGCC
GTCCCCGGCA CGACGATCAA CCGGCTCTGC GGCTCCGGCA TGGATGCGGT GATCACGGCC
GCACGCGCCA TCCGCGCGGG CGAAGCCGAG CTGATGATCG CTGGTGGCGT CGAGAGCATG
TCACGCGCGC CGTTCGTCAT GCCGAAGGCC GAGACGGCCT TTTCACGGGC CGCCGAAATC
CATGACACGA CGATCGGCTG GCGCTTCGTC AACCCGCTGA TGAAGAAGCA GTACGGCGTC
GATTCCATGC CGGAGACTGG CGAGAATGTC GCCGAGGACT ATCATGTCAG CCGCGAGGAT
CAGGATGCCT TCGCGGTGCG AAGCCAGGCG AAGGCGGCCG CTGCCCAGGC GAACGGACGG
TTGGCGAAGG AGATCACCCC GGTGACCATC TCGCAGCGCA AGGGCGATCC TGTTATCGTC
GACAAGGACG AGCATCCGCG CGCAACCACG ATCGAAACGC TGGCGAAACT CGCGACACCT
TTCAAAAAAG AAGGCGGCAC GGTGACAGCA GGCAATGCCT CCGGCGTCAA TGACGGGGCG
GCGGCGCTGA TCGTCGCTTC GGAAGCGGCG GCGCGGAAAT ACGGCCTGAC GCCGATCGCC
CGCATCCTCG GCGGCGCGGC TGCCGCCGTT CCGCCAAGGG TGATGGGCGT CGGGCCGATC
CCGGCCTCGC GCAAGCTGAT GGCACGGCTC GGCATGACCG CGGATCAGTT CGACGTGATC
GAACTCAACG AGGCCTTTGC CAGCCAGGGG CTGGCGGTGC TGCGCGCGCT CGGCATTGCC
GATGATGATG CGCGGGTGAA CCGCAATGGC GGCGCGATCG CGCTCGGCCA TCCGCTTGGC
ATGTCGGGTG CACGCATCAC CGGCACGGCT GCCCTCGAGC TTTTGCAGAC CGGCGGACAA
TATTCGCTGT CGACCATGTG CATCGGCGTC GGGCAGGGGA TTGCGATAGC ACTTAAAAGG
GTTTGA
 
Protein sequence
MTEAFICDYI RTPIGRFAGS LSQVRADDLG AIPLKALMQR NAAVDWEAVD DVIFGCANQA 
GEDNRNVARM SALLAGLPIA VPGTTINRLC GSGMDAVITA ARAIRAGEAE LMIAGGVESM
SRAPFVMPKA ETAFSRAAEI HDTTIGWRFV NPLMKKQYGV DSMPETGENV AEDYHVSRED
QDAFAVRSQA KAAAAQANGR LAKEITPVTI SQRKGDPVIV DKDEHPRATT IETLAKLATP
FKKEGGTVTA GNASGVNDGA AALIVASEAA ARKYGLTPIA RILGGAAAAV PPRVMGVGPI
PASRKLMARL GMTADQFDVI ELNEAFASQG LAVLRALGIA DDDARVNRNG GAIALGHPLG
MSGARITGTA ALELLQTGGQ YSLSTMCIGV GQGIAIALKR V