Gene Rleg_3617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3617 
Symbol 
ID8014467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3657492 
End bp3658544 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content63% 
IMG OID644826181 
ProductThreonine aldolase 
Protein accessionYP_002977401 
Protein GI241206305 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTTTG CTTCCGATAA TTGGGCCGGC GCCCACAAAT CCATTGCCGA ACGTCTGCTG 
ACGGAATCGA CCGGCTTTGC CGCCGCCTAT GGCGCCGGCG ATCTCGACAG GAAGGTCGAG
GCCCGTTTTT CCGAGATCTT CGAGCGCGAG GTTTCGGTTT TCTTCGTCGC CACCGGCACG
GCCGCCAACT CCCTGTCGCT GGCAAGCGTC CAGCGGCCCG GCGGCATCAC CTTCTGCCAT
TCGGAGGCCC ATGTGATCGA GGATGAATGC GGCGCGCCGG AATTTTTCTC CGGCGCTGCC
CGTCTCGTTG CCATTGACGG CGAAGCCGGC AAGATCGATC CGGCGAAGCT TTCGGCGAAG
ATCGCAAGCT TTCCCGAAGA CGCCGTCCAT CATGGCCGCG CCAGCGCGGT GACCATCACC
CAGGCGACTG AGATCGGCAC CGTCTATTCC TTGCCGGAGA TCGGCGAGAT CGCCGCCATA
TCAAGGAAGC GCAATCTGCC GCTCCACATG GATGGTGCCC GCTTTGCCAA CGCCCTGGTC
GCACTCGGCG CCACCCCGGC CGAAATGACC TGGAAGCGCG GCGTCGACAT GCTGTCCTTC
GGCGGCACCA AGAACGGCTG CTGGTGTGCC GAAGCGATCG TCTTCTTCAA TCCGGATCGA
GCCCGGGAAA TGCCCTTCAT CCGCAAACGC GCTGCCCAGC TCTTCTCCAA GTCGCGTTTC
ATCGCCGCCC AGTTCGATGC CTATTTCGAA GATGGCCTCT GGCTCGATCT CGCCCGCCAT
TCGAACGGCA TGGCCGACCG GCTGCGCGCC GGCATCGGCA CGAGCAACTC CGCCCGCCTC
GCCTGGCCGA CCGCATCCAA CGAAGTCTTT GCCGTCGTCA GCAAGAGTGC CGTGAAAATT
GCGGAGGAAA AGGGCGCGAA GTTTTACGAA TGGCCGGTCC CGGCGGCAAC GCCCGAGCTC
GTTTCCGAAA GCGAAACCCT GATCCGCCTC GTCACCAGCT TCGCGACCAC CGAAGCGGAT
GTCGATGGCT TCTTGAAGTG CCTGGCCGCC TGA
 
Protein sequence
MFFASDNWAG AHKSIAERLL TESTGFAAAY GAGDLDRKVE ARFSEIFERE VSVFFVATGT 
AANSLSLASV QRPGGITFCH SEAHVIEDEC GAPEFFSGAA RLVAIDGEAG KIDPAKLSAK
IASFPEDAVH HGRASAVTIT QATEIGTVYS LPEIGEIAAI SRKRNLPLHM DGARFANALV
ALGATPAEMT WKRGVDMLSF GGTKNGCWCA EAIVFFNPDR AREMPFIRKR AAQLFSKSRF
IAAQFDAYFE DGLWLDLARH SNGMADRLRA GIGTSNSARL AWPTASNEVF AVVSKSAVKI
AEEKGAKFYE WPVPAATPEL VSESETLIRL VTSFATTEAD VDGFLKCLAA