Gene Rleg_6301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6301 
Symbol 
ID8017031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp16554 
End bp17579 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID644828097 
ProductDGQHR domain protein 
Protein accessionYP_002979297 
Protein GI241554084 
COG category 
COG ID 
TIGRFAM ID[TIGR03187] DGQHR domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.483477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAT CAATGTGGCT CAACTGCTCG ACTGGCGTCT CGGTCGACAG GCCGGTTCTC 
CTAGGATTTG CACCGGCAAA ACTCCTGCAT CGATACAGCT TTGCCGACGT GTTGAACGAA
GACACAGGTC TTGGCTATCA GCGTCGCTTC AACTCGCAGC ACAGCCAAGA TTTCCGTCGG
TACATTCGAC AGACCGGCGC CTCGACGATC CCCCTCACTT TGAATCTGCG GCCTGACGAA
AAGGGTTGGA AAGTCGAGAA TGTCGGACCT GGACAGGCTC GGTTAGAGAT CGAGCTGGAC
GCCGGCAAGA TCATGGCGCA GGTCGATTGC CAACATAGGC TCGGTTGTCT TGAGGATCTC
GACATCCAGC TGCCGTTTAT GTGTTACGTG GGGCTCAGTC TCAAAGAAGA GATGGAAGTC
TTCAGTACCA TTAACAGCAA GGCAAAAGGC CTGAGCAACA GTCTGCTGGA CTTTCATGAT
GCACACCTGG CTGGAGACCT GGCGAAAGAT CGCCCGGAAA TCTTTATCGC TCTTCATCTG
AACAATGACC CGGATTCGCC TTGGTGCCGA CAGCTTGATC TCGGCGGAGA GAGCACTTCC
GGGATGACCC GGCGTGCGTC GCTTCGGACG ATGCAAAAGG CCATAAAGCG ATTTCTCAAC
TCCACCCGGT CGCTCAAGAC GCGCTCACCG GAAACCGTCA CGCAGATCGT CATGTCCTTT
TGGCGTGCAG TTGCCGAGGT GCTTCCTGCC CAGTGGAGCA CGCCGCGCAA GCACATCCTT
ACCAAGGGTG TTGGCGTATA CGCGTTAATG GACATCGCTG CCGATCTTTA CAACGAGGCC
GAGGATGGGG CCAAGCTGGA CCGTGGCTAT TTCGTCAATC GCCTCGCTGA CTTTGCCTAT
GATATCGACT GGTCAACGAC CGGCCGCCTG AAAGGACTTG GCGGCGAGGG TGGGGTCAAC
GAGGCCGTCG AATATATCCG CGAAACCCGC AAGCGCTCTC ATTTGAAAGT TGTCAGCAAT
GGCTAA
 
Protein sequence
MAESMWLNCS TGVSVDRPVL LGFAPAKLLH RYSFADVLNE DTGLGYQRRF NSQHSQDFRR 
YIRQTGASTI PLTLNLRPDE KGWKVENVGP GQARLEIELD AGKIMAQVDC QHRLGCLEDL
DIQLPFMCYV GLSLKEEMEV FSTINSKAKG LSNSLLDFHD AHLAGDLAKD RPEIFIALHL
NNDPDSPWCR QLDLGGESTS GMTRRASLRT MQKAIKRFLN STRSLKTRSP ETVTQIVMSF
WRAVAEVLPA QWSTPRKHIL TKGVGVYALM DIAADLYNEA EDGAKLDRGY FVNRLADFAY
DIDWSTTGRL KGLGGEGGVN EAVEYIRETR KRSHLKVVSN G