Gene Rleg_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1047 
Symbol 
ID8012176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1022234 
End bp1023400 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content62% 
IMG OID644823630 
Productoxidoreductase domain protein 
Protein accessionYP_002974881 
Protein GI241203785 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.51636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACGA ATAACAGGCA AAAGCCGATC CGTTGGGGCA TGGTCGGCGG CGGCAGAGGC 
AGCCAGATCG GCTATATTCA TCGCTCGGCA GCGCATCGCG ACGACGTCTT TGCCCTTGCG
GCGGGTGCCT TCGATATCGA TCCGGAACGC GGCCGCGCGT TCGGCGTCGA TCTCGGGCTC
GACGAAGCCA GGAGCTACCG GGACTATGCG GCGATGTTTG CGACTGAGGC CCGGCGGGCC
GACGGCATCG AGGCCGTGTC GATCGCGACA CCCAACAACA CCCATTTCGC GATCTGCAAG
GCAGCACTCG AACATGATCT GCATGTGATC TGCGAAAAGC CACTCTGCTT CACGACCGCC
GAAGCCGAAG AGCTGAAGAC GCTTTCCCAG GCGCGCGGCC GGATCGTCGG CGTGACCTAT
GGTTATGCCG GCCATCAGAT AATCGAGCAG GCGCGCGCCA TGGTCAGGAA TGGCGATCTC
GGCGAAATCC GCATCGTCAA CCTGCAATTC GCCCATGGTT TCCATAGTGC TGCGGTGGAG
GAGCAAAACC CCTCGACGCG GTGGCGCGTC GATCCGAAAT TCGCGGGGCC CAGCTATGTC
CTCGGCGATG TCGGGACCCA CCCGCTTTAT ATCGCCAAGG TCATCCTGCC GCACCTCAAG
ATCAGACGTC TCCTCTGCAC CCGCCAGAGC TTCGTCAAAA GCCGAGCGCC GCTCGAAGAC
AATGCGGTGA CCCTGATGGA ATATGATAAT GGCGCGATCG CCACCATCTG GTCGAGCGCC
GTCAATGCCG GCTCCATGCA CGGCCAGAAG ATCCGCATCG TCGGCTCGAA GGCCAGTATC
GAATGGTGGG ACGAGCGGCC GAACCAGCTC TCCTACGAGA TCCAGGGCGA ACCGGCCCGT
ATTCTCGAGC GTGGCATGGA CTATCTCTAT CCCGAGGCCA GGATCGACGA CCGGATCGGC
GGCGGCCACC CGGAAGGTCT CTTCGAAGCC TGGGCGAACC TCTATCGCCG CTTCGGCTTC
GCCATCAACA GAGAGCGCGG CCTCGCCCCT GCGGGGATCG AAGAGCTGGT CTTTCCCGAT
GTCGATGCCG GACTGGAAGG GGTGCGCTGG GTGGAAAACT GCGTGCGTTC CGCCGATGCC
GGCGGGATCT GGCTGGATTA TCGCTAG
 
Protein sequence
MMTNNRQKPI RWGMVGGGRG SQIGYIHRSA AHRDDVFALA AGAFDIDPER GRAFGVDLGL 
DEARSYRDYA AMFATEARRA DGIEAVSIAT PNNTHFAICK AALEHDLHVI CEKPLCFTTA
EAEELKTLSQ ARGRIVGVTY GYAGHQIIEQ ARAMVRNGDL GEIRIVNLQF AHGFHSAAVE
EQNPSTRWRV DPKFAGPSYV LGDVGTHPLY IAKVILPHLK IRRLLCTRQS FVKSRAPLED
NAVTLMEYDN GAIATIWSSA VNAGSMHGQK IRIVGSKASI EWWDERPNQL SYEIQGEPAR
ILERGMDYLY PEARIDDRIG GGHPEGLFEA WANLYRRFGF AINRERGLAP AGIEELVFPD
VDAGLEGVRW VENCVRSADA GGIWLDYR