Gene Rleg2_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4018 
Symbol 
ID6982788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4188302 
End bp4189399 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID643398747 
Productoxidoreductase domain protein 
Protein accessionYP_002283506 
Protein GI209551589 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCA CGGGAAACAG TGACATGAGC ATCAGAACGG TCGCGATCGT CGGCTGCGGT 
ATCGGCCGCT CCCACATTGT CGAGGGTTAT CTGCCGCATG CCGACAAGTT CAAGGTCGTG
GCGATCTGCG ACCTGAATGA GCAGCGCATG GCGTCAGTCG GCGACGAGTT CGGCATCGAG
CGGCGCACCA CCTCCTTTGC GGAGTTGCTC GCCGACGAAA CGATCGACAT CATCGATATC
TGCACTCCGC CCGGCATCCA TCTGGAACAG GTGGTGGCTG CCCTCGCTGC CGGCAAACAT
GTCGTTTGCG AAAAGCCGCT GACCGGCTCG CTTGCCGCCG TCGATACGAT CATGGCAGCG
GAGAAAGCCG CCAAAGGCGT GCTGATGCCG ATCTTCCAGT ATCGCTACGG CGACGGCATC
CAGAAGGCCA AGCGGATTAT CGACGCCGGC ATTGCCGGCA AGGCCTACAC GGCTTCGGTC
GAAACCTTCT GGCTGCGCAA GCCCGAATAT TACGCCGTGC CCTGGCGCGG CAAATGGGCG
ACGGAACTCG GCGGCGTGCT CGTCACCCAT GCGCTGCATC TGCACGACAT GATGATGCAT
CTGATGGGGC CGGCGGCAAG GGTCTTCGGC CGTGTCGCCA CCCGCGTCAA CGATATCGAG
GTCGAGGATT GCGCCTCCGC CAGCCTGCTG ATGGAAAGCG GCGCCTTCGT CTCGCTGTCC
TGCACGCTGG GTTCGCAGGA ACAGTTGAGC CGGCTGAGGC TGCACTTCGA GAATGTTACC
TTCGAAAGCA GCCATGAGCC CTATACGCCA GGTAAGGATC CTTGGAAGAT CATCGCCGCC
AATGACGACG TGCAGGCAAA GATCGACCGG GTGATCAGCG ACTGGCAGCC GGTCGCGCCG
CGTTTCACTA CCCAGATGGG CCAGTTTCAC GCCTTCCTCA GCGGCCATGG GCCGCTGCCG
GTGACGACGG TGGATGCACG CCGCGCGCTG GAACTCGTCA CCGCCATCTA CCAGTCTTCC
GACAGCGGCG CCGAAGTGCC GCTGCCGGTC GGCCCGGACA GTCCGAAATA CGTCGATTGG
CGTGCAAGAA CGAAGTAA
 
Protein sequence
MAVTGNSDMS IRTVAIVGCG IGRSHIVEGY LPHADKFKVV AICDLNEQRM ASVGDEFGIE 
RRTTSFAELL ADETIDIIDI CTPPGIHLEQ VVAALAAGKH VVCEKPLTGS LAAVDTIMAA
EKAAKGVLMP IFQYRYGDGI QKAKRIIDAG IAGKAYTASV ETFWLRKPEY YAVPWRGKWA
TELGGVLVTH ALHLHDMMMH LMGPAARVFG RVATRVNDIE VEDCASASLL MESGAFVSLS
CTLGSQEQLS RLRLHFENVT FESSHEPYTP GKDPWKIIAA NDDVQAKIDR VISDWQPVAP
RFTTQMGQFH AFLSGHGPLP VTTVDARRAL ELVTAIYQSS DSGAEVPLPV GPDSPKYVDW
RARTK