Gene Rleg2_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1683 
Symbol 
ID6980420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1712477 
End bp1713577 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID643396407 
Productoxidoreductase domain protein 
Protein accessionYP_002281197 
Protein GI209549280 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGAG TCGGTATCGG CATCATCGGA TGCGGCAATA TTTCGGGCGC CTATCTCACG 
GCGATGGCAT CCTTTCCCAT TCTCGACATT CGCGGCGTCG CCGATCTCAA TCGGGAGCTG
GCCGAGGCGA AAGCTAAGGA ATTCAACGTT CCCGCCCGAT CGATCGAAGA GCTTTTCGCC
GATCCGAAGG TCGAGATCAT CGTCAATCTG ACGATCCCGA AAGCCCATGT GGCGGTCGCG
CTGCAGGCGC TCGAGGCCGG CAAACATACC TATTCGGAAA AGCCCCTCGG GATTAATTTC
GCGGAAGGGA AAAAATTGGC CGACGCCGCC AAGGCGAGGA ATCTGCGCAT CGGCGCGGCG
CCCGACACCT TCCTCGGCGG CGGCCACCAG ACGGCCCGCG CGCTGATCGA CCAGGGTGTC
ATCGGCCAGC CGGTCGGCGG CTCGGCGAGC TTCATGTGCC CCGGCCATGA ACGCTGGCAT
CCGAATCCGG CTTTCTTCTA CGAGGTCGGC GGGGGGCCGA TGCTCGACAT GGGTCCCTAT
TACATCACCG ATCTCGTCAA TCTTCTCGGG CCAGTTGCCG AGGTTGCGGG CTTTGCGACG
ACGCCGCGGA CCGAACGGCT GATTACCAGC GAGCCGCGCA ATGGCGAGCG GATCCCCGTC
CATGTCCCCA CCCATGTCGC CGGCATGATG CGTTTTGAGA ACGGCGCCGT CGTCCAGATC
GCCATGAGCT TCGATGTCGC CGGCCACAAA CATGTGCCGC TCGAAGTCTA CGGCACCGAG
GGCACGTTGA TCGTGCCCGA TCCCAACAAG TTCGCCGGCC CGGTGGAATA TCTGAAGAAG
GGCGGTCAGT TTGAGGACCA GCCGCTCACC GCACCCTATG CCGACGGCAA CTACCGCTCG
CTCGGCGTCG CCGACATGGC GCATGCGATC CGCTCCAACC GTCCGCATCG GGCCAACGGC
GATCTGGCGC TGCATGTGCT CGAGGTCATG GAAGCGTTTC ACACTGCGTC TGCGACAGGC
CGGACGGTGG CGATATCAAC GGCGGTGGAG CGCCCTGCCC CCTTGTCCCA ATCGATCGTC
GACGGACGGC TGGCGAAATA A
 
Protein sequence
MERVGIGIIG CGNISGAYLT AMASFPILDI RGVADLNREL AEAKAKEFNV PARSIEELFA 
DPKVEIIVNL TIPKAHVAVA LQALEAGKHT YSEKPLGINF AEGKKLADAA KARNLRIGAA
PDTFLGGGHQ TARALIDQGV IGQPVGGSAS FMCPGHERWH PNPAFFYEVG GGPMLDMGPY
YITDLVNLLG PVAEVAGFAT TPRTERLITS EPRNGERIPV HVPTHVAGMM RFENGAVVQI
AMSFDVAGHK HVPLEVYGTE GTLIVPDPNK FAGPVEYLKK GGQFEDQPLT APYADGNYRS
LGVADMAHAI RSNRPHRANG DLALHVLEVM EAFHTASATG RTVAISTAVE RPAPLSQSIV
DGRLAK