Gene Rleg_4339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4339 
Symbol 
ID8015917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4461754 
End bp4463031 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content59% 
IMG OID644826915 
Productoxidoreductase domain protein 
Protein accessionYP_002978118 
Protein GI241207022 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.222851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000148603 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGAAAC GCCGTTTTGC CTTGATCGGT ACGGGAAACC GCGGCACCAC CATGTGGGGC 
AAGGATCTGC TTGCCGGCTG GCGCGAGCAT GTCGACCTGA CCGCCATTGT CGAGAAGAAT
TCGCTGCGCG GCGAGCGCGC CCGCAACATG ATCGGCAGTA ATGCGCCGCT CTACGAAAAC
ATCGATTCGA TGCTTGCTGA GCAGAAGCCG GATCTCGTCA TCGTCTGTAC GCCCGACCAT
ACGCATGACG ATATCGTCGT GCGGGCGCTG GAGTCCGGCA TCGACGTCAT CACCGAAAAG
CCAATGACGA CCTCGGTCGA GAGGATTCGC CGGATTCTGG ATGCCGAAAA GCGCACTGGT
CGCCGGGTCG ACGTGTCCTT CAACTATCGC TATGCGCCGA CGGCCGCGAA GATCAAGGAA
TTGCTGAATG CCGGCGAGAT CGGCCGAGTC ACCTCAGTCG ATTTTCATTG GTATTTGAAC
ACTAAGCACG GCGCCGACTA CTTCCGCCGC TGGCATGCCT ATACGGAAAA TTCCGGCAGT
CTGTTCGTCC ACAAGGCAAC GCATCATTTC GATCTGCTGA ACTGGTACCT CGACAGCGAT
CCCGATGCCG TCACCTCTTT CGCCGACCTG CAGAATTACG GCCGCAAGGG CCCGTTCCGC
GGCCCGCGCT GCAAGCTCTG TCCGCACGCG CATGAATGCG ACTATTATCT CGATCTCGAG
GCCGATCCCT TCCTTGATTC ACTCTACGAG GATCCCTCGA AGATCGACGG CTACTTCCGT
GACGGCTGCG TCTTCCGCGA GGACATCGAC ATTCCCGATA CGATGGTGGT GTCGCTCCGT
TACCGCAACA ATGTCCACGT CTCCTATTCG CTGAACACCT TCCAGCCGAT CGAAGGCCAT
CACCTTGCCT TCAACGGCAC CAAAGGGCGG ATCGAGCTTC GCCAGTATGA AGCCCAGCCC
TGGGAAGAGC CGAAGCAGGA CACGATCCTG CTCATCCGCA ATTTCCCTGA TGGTAAGGAG
GCAGTGGAGC GCATCGTTGT TCCGCATTTC ACCGGCGGCC ATTACGGCGG CGACGACCGG
ATGCGTAACA TGATCTTCAA GCCCGATATG GAAGACAAGC TTGCGCAACG CGCCGGCACG
CGGGCGGGCG CCATGTCGGT GCTTTGCGGC ATTGCGGCGC TGGAGAGCTC GCGCACCGGC
AAGGTGGTCA ACATTGCCGA TCTCATGCCC GAACTTGCCA ATGACGGTTC GCCAAATTCG
CTGAGGACGT CGCGCTGA
 
Protein sequence
MEKRRFALIG TGNRGTTMWG KDLLAGWREH VDLTAIVEKN SLRGERARNM IGSNAPLYEN 
IDSMLAEQKP DLVIVCTPDH THDDIVVRAL ESGIDVITEK PMTTSVERIR RILDAEKRTG
RRVDVSFNYR YAPTAAKIKE LLNAGEIGRV TSVDFHWYLN TKHGADYFRR WHAYTENSGS
LFVHKATHHF DLLNWYLDSD PDAVTSFADL QNYGRKGPFR GPRCKLCPHA HECDYYLDLE
ADPFLDSLYE DPSKIDGYFR DGCVFREDID IPDTMVVSLR YRNNVHVSYS LNTFQPIEGH
HLAFNGTKGR IELRQYEAQP WEEPKQDTIL LIRNFPDGKE AVERIVVPHF TGGHYGGDDR
MRNMIFKPDM EDKLAQRAGT RAGAMSVLCG IAALESSRTG KVVNIADLMP ELANDGSPNS
LRTSR