Gene Rleg_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0002 
Symbol 
ID8011254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1743 
End bp2825 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content62% 
IMG OID644822593 
ProductSaccharopine dehydrogenase 
Protein accessionYP_002973853 
Protein GI241202757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.717423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0043893 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTCG AAAAGATCGC CGTTCTCGGC CTGGGCAAGG TCGGACGGCT GGCGGCGACG 
CTGTTGCATG AAGGCGGCTT CGAGGTCATC GGCGTCGATG CGCAATTGCC GCTGAGCGAC
GTCCCCTTCA AGTGCCGCAT CGGCGATATC TCCGATCCTC AAGTGATCGG CGAACTGCTC
TCGAATGTCG AGGCGGTGCT GTCCTGCCTG CCCTATCATT TGAATATCGA GCTGGCGCGC
GCCGCCCATC TTGCCGGCAT TCATTATTTC GATCTGACCG AAGACGTTCC GACCACCAAT
TTCATCATCG AGCTGTCGAA GACAGCCCGC GGCCTGATGG CGCCGCAATG CGGCCTGGCG
CCGGGTTTCG TCGGCATCAT CGGTGCAAGC CTGGCCGACG GCTTCGATCG CTGCCGGTCG
ATCCGCATGC GCGTCGGCGC CCTGCCGCAG CATCCGACCG GACTGCTCGG CTACGCCTTC
AACTGGTCGC CCGAGGGCGT CGTCAACGAA TATCTGAACG ACTGCGAGGT CATCGAGGGC
GGTGTGCGCA AGCTTGTCTC GCCGATGGAA TGGCACGAGA CCGTCTATGT CGGCGGCGTC
AAGCTCGAAG CCTTCACGAC GTCCGGCGGC CTTGGCACCA TGTGTGACAC CATGCTCGGC
AAGATCGACA ATCTCGATTA CAAGACCATG CGTTATCCCG GCCATATGGA GCTGATGAAT
TTCTTCTTCC ACGAGCTGTT GATGCGCGAC AAGCGCAAGC TCGCCGGCGA GATCCTGACC
AATGCCAAGC CGCCGGTTGA AGACGATGTT GTCTATGTCC ATGTCGCCGC CGAAGGCACC
GAGAATGGCA GCCTGCGCCG CAAGGAATTC GTGCGCGCCT ATTACCCGAT CGAGATTGCC
GGCGCGCGCC GCACGGCGAT CGCCTGGACG ACGTCAGCCT CCGTCGTCGC CGTCATCGAG
ATGGTCCGCG ACGGCCTGCT GCCGACGACC GGCTTCCTGC ACCAGGAGCA TATTCCGCTG
GAGATGTTTT TGAAGACGCC GACCGGCAGC CTCTTCAAGG CGGGTGCGAC CAGCCACGGC
TAA
 
Protein sequence
MSFEKIAVLG LGKVGRLAAT LLHEGGFEVI GVDAQLPLSD VPFKCRIGDI SDPQVIGELL 
SNVEAVLSCL PYHLNIELAR AAHLAGIHYF DLTEDVPTTN FIIELSKTAR GLMAPQCGLA
PGFVGIIGAS LADGFDRCRS IRMRVGALPQ HPTGLLGYAF NWSPEGVVNE YLNDCEVIEG
GVRKLVSPME WHETVYVGGV KLEAFTTSGG LGTMCDTMLG KIDNLDYKTM RYPGHMELMN
FFFHELLMRD KRKLAGEILT NAKPPVEDDV VYVHVAAEGT ENGSLRRKEF VRAYYPIEIA
GARRTAIAWT TSASVVAVIE MVRDGLLPTT GFLHQEHIPL EMFLKTPTGS LFKAGATSHG