Gene Rleg_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2023 
Symbol 
ID8013056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2013885 
End bp2015840 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content66% 
IMG OID644824610 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_002975841 
Protein GI241204745 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.995582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAT TTAACGGCGT CGTTCCGGCG ATCGCCAAGG CGTTGCAGAA GCGTGGTTAC 
GCCGAACTCA CCCCGGTGCA GAAGGCGATG CTCGATCCGG CGCTTGCCGC TTCCGACGCG
CTGGTCTCGG CCCAGACCGG CTCCGGCAAA ACCGTCGCCT TCGGCCTGGC GCTGGCGCCG
ACGCTGCTCG AAGGCAGGGA GCGCTTCGGC AATGCCGCCG CGCCGCTCGC CCTCGTCATC
GCCCCGACGC GCGAGCTTGC CCTGCAGGTG AAGCGGGAGC TCGAATGGCT CTTTGAGATG
ACCGGCGCGG TGATCGTCAG CTGCGTCGGC GGCATGGACA TCCGTAGCGA GCGTCGCTCG
CTCGAACGAG GCGCCCATAT CGTCGTCGGC ACGCCCGGTC GGCTTTGCGA CCATATCCGC
CGCAGGGCGC TCGACATGTC GGAACTGAAG GCCGCCGTGC TCGACGAGGC CGACGAGATG
CTCGATCTCG GCTTCCGCGA AGACCTTGAA TTCATCCTCG ATGCGGCGCC GGACGAACGC
CGGACGCTGA TGTTTTCGGC AACGGTGCCG GCGGCGATCG CCAAGCTTGC CAAGAGCTAC
CAGCGCGATG CCGTGCGCAT CAGCACCGCG GCCGAGGAAA AGCAGCATAT CGACATCGAA
TATCGCGCGC TGATGGTTGC CCCGAGCGAC CGCGAGAACG CGATCATCAA CGTGCTGCGG
TATTACGAGG CGACCAATGC CATCGTGTTC TGCTCGACGC GCGCCGCCGT CAACCATCTG
ACCGCACGCT TCAACAACCG CAATTTCGCC GTCGTGGCGC TGTCCGGCGA GTTGACGCAG
AACGAGCGCA CTCACGCGCT GCAGGCGATG CGCGACGGGC GCGCCCGTGT CTGCATCGCG
ACCGACGTTG CCGCCCGCGG CATCGACCTG CCGGGGCTCG ATCTCGTCAT CCATGCCGAC
CTGCCGACCA ATCCGGACAC GCTGCTGCAC CGCAGCGGCC GAACCGGCCG GGCCGGGCGC
AAGGGCATCA GCGCCATGAT CGTGCCGCTG AATGCACGGC GCAAGGCAGA GCGCCTGCTC
GAGAATGCCG GCATTTCGGC TGCCTGGGCG CGACCGCCTT CGGCCGAGGA AGTGAGCGAG
CGCGACGACG AGCGGCTGCT CGCCGATCCG ATCTTCAACG AAACGCCGCA GGAGGAAGAG
CAGGGGCTGG TACAACAGCT TCTCGCCAGC CACGGCGCCG AAAAGCTCGC CGCAGCCTTC
CTGCGTCTTT ACCGCACCAA TCATTCGGCG CCGGAGGACC TCATCGAGGT TACCGTGCAG
GACGACCGCA ACCGCAAGCG CCGCGACAAT GCCGAGCCGT ACGAGCCGGC GCAAAAGGGA
CCGCGCGAAG ATTTCGGCGC CAGCGTCTGG TTTTCCGTCT CGGTCGGGCG CAAACAGAAT
GCCGAGCCGC GCTGGCTGAT CCCGATGCTC TGCCGCAACG GTAATGTGAC GAAGCGCGAA
ATCGGCGCGA TCAAGATGCA GCCCGAGGAA ACTTTTGTGG AGATCGCTGC GGCGAGCGCC
GAAAGCTTCC TGGCGGCGAT CGGTCCGAAC AAGGCGCTGG AACGCGGCAT CCGCGTGACC
AGGCTTTCCG GCACGCCGGA TTTCAGCCGG GCGCCTGCGC CAAAACCCTA TGCCGGCAAG
CCATCTCGCG ACGAGCGGCC GGACGACACA TTCCGCGGCG AGCGGCCGAA GAACAAGTTC
GGCAAGGGTC CGGGCGGCGG ATATGCCGCG GCCGATAATA GCGGCGGGGA CAAACGGGAT
AGCAAGCCCT GGAGCAAGAA GCCGGGCAAG CCCGCGTTCG ATGGCCCGAA GTCCGACAAG
CCGAAATACG AAGGCCCAAG ATCCGACGCC CCAAGATATG AGGGTAAGGG CGGCGCCGGA
CCAAAAGCTA AGTTCTCGAA GAAGAAGCCC GGCTGA
 
Protein sequence
MTEFNGVVPA IAKALQKRGY AELTPVQKAM LDPALAASDA LVSAQTGSGK TVAFGLALAP 
TLLEGRERFG NAAAPLALVI APTRELALQV KRELEWLFEM TGAVIVSCVG GMDIRSERRS
LERGAHIVVG TPGRLCDHIR RRALDMSELK AAVLDEADEM LDLGFREDLE FILDAAPDER
RTLMFSATVP AAIAKLAKSY QRDAVRISTA AEEKQHIDIE YRALMVAPSD RENAIINVLR
YYEATNAIVF CSTRAAVNHL TARFNNRNFA VVALSGELTQ NERTHALQAM RDGRARVCIA
TDVAARGIDL PGLDLVIHAD LPTNPDTLLH RSGRTGRAGR KGISAMIVPL NARRKAERLL
ENAGISAAWA RPPSAEEVSE RDDERLLADP IFNETPQEEE QGLVQQLLAS HGAEKLAAAF
LRLYRTNHSA PEDLIEVTVQ DDRNRKRRDN AEPYEPAQKG PREDFGASVW FSVSVGRKQN
AEPRWLIPML CRNGNVTKRE IGAIKMQPEE TFVEIAAASA ESFLAAIGPN KALERGIRVT
RLSGTPDFSR APAPKPYAGK PSRDERPDDT FRGERPKNKF GKGPGGGYAA ADNSGGDKRD
SKPWSKKPGK PAFDGPKSDK PKYEGPRSDA PRYEGKGGAG PKAKFSKKKP G