Gene Rleg_5216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5216 
Symbol 
ID8007111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp627560 
End bp628708 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID644822125 
Productoxidoreductase domain protein 
Protein accessionYP_002973385 
Protein GI241113550 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.647839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.501993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGA CCAATGGTAT GAAACTCAGG ATCGGCATTG TCGGATGCGG CAACATTTCG 
CTCGCCTATA TGCGCAACGC GCCGCTGTTT CGCGGCGTCG AAATCATCGC CTGTGCAGAC
CTCAACGCAG ACGCCGCCAA GCGCCGCGCA GCGGAGTTCG ATCTGCGCGC GGCTGACGTC
GACAGCCTCA TCGACGACAG GAACATCGAC CTCATCCTCA ATCTGACGAT CCCGGCTGCG
CATTTTGACG TTTCGATGCG GGCGCTGTCT GCAGGCAAGC ATGTCTTCAC GGAGAAGCCG
CTCGGTGTCA CGGCCGCCGA AGGACGCCGG TTGGTGGATG CCGCCGCCGT AAAGGGCCTC
ATGCTCGGCT CGGCGCCAGA CACTTTTCTG GGGGCGGCCG GACGCCATGC CCGGCGGCAG
ATGGAAGCCG GCGCCATCGG CAAGCCGGTG ACCGGGACAG CCTTCATGAT GGGGCGCGGC
ATGGAGCACT GGCATCCGGA TCCCGGCTTT TATTACCAGG CCGGCGCCGG CCCGGTCATG
GATATGGGGC CTTATTATCT GACGATGATG GTCAATCTGA TGGGGCCTAT CCGCCGTGTG
CAGGCCGTCG CCACAAGCGG GCAGGACGAG CGGCTCATCA CGGCGGAGGG CCCGAAGCAG
GGCACCACGT TCAAGGTCGG CACGCCGACC AGCGTGCTGT CGTTGCTGGA ATTCGATTGC
GGTGCCAAGG TCACCTTCGG CGCCTCCTGG GACGTCTTTC GCCACTCCAA TCACCCCATC
GAACTCCACG GGACCGAAGG CTCGCTGCGC CTGCCTGACC CCGACAATTT CGGCGGCTCC
GTTGCGCTCT CCAGTCGCGG CGCGCCCTGG CAGGAAACGG ATACGTCAGG CAAACTCTTC
GGCGCCGTCA ACTGGCCGAT CGCAGCGCCT GATCGTGCCA ACTACCGCAT GCTTGGTCTT
GCCGATCTCG CACGCGCAAT CATTGAGGGC CGTGCGCCGC GTGCTTCGGG CGATCTCGCT
CTCCATGTGC TCGAAGTCAT GGAAGCGATC CTGCGTGCCG GTGAAGCCGG TGTCGCGCAG
ACCATTCCGG GTATTGTCGC GCAGCCAAAA GAATTGCGGG AAGACGAAGC GAGGAGTTTG
CTGGCATGA
 
Protein sequence
MAQTNGMKLR IGIVGCGNIS LAYMRNAPLF RGVEIIACAD LNADAAKRRA AEFDLRAADV 
DSLIDDRNID LILNLTIPAA HFDVSMRALS AGKHVFTEKP LGVTAAEGRR LVDAAAVKGL
MLGSAPDTFL GAAGRHARRQ MEAGAIGKPV TGTAFMMGRG MEHWHPDPGF YYQAGAGPVM
DMGPYYLTMM VNLMGPIRRV QAVATSGQDE RLITAEGPKQ GTTFKVGTPT SVLSLLEFDC
GAKVTFGASW DVFRHSNHPI ELHGTEGSLR LPDPDNFGGS VALSSRGAPW QETDTSGKLF
GAVNWPIAAP DRANYRMLGL ADLARAIIEG RAPRASGDLA LHVLEVMEAI LRAGEAGVAQ
TIPGIVAQPK ELREDEARSL LA