Gene Rleg_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1237 
Symbol 
ID8012343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1211497 
End bp1213701 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content63% 
IMG OID644823818 
ProductNitrate reductase 
Protein accessionYP_002975068 
Protein GI241203972 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0895339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000112023 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTCGC GTCATGGGGA ATTTTTGTGG CTTTTGGCCG TGCAAGCCTT GGAATATACA 
AATGCCATGA ACATTGCGAC CCCCATCCAC GCCAAATCAG AAAAGCCGGA CAATAGGGTG
GGCCACACGG CCTGTCCGCA TGACTGTCCC TCCACCTGCG CACTGGAGGT CGAGATATCG
GAGGATGGCC GCATCGGCCG CGTGCGCGGC GCCAATGACC ATTCCTACAC GTCAGGCGTC
ATCTGCGCCA AGGTCGCCCG TTATGCCGAG CGGCTCTACC ATCCCGACCG CCTGATGCAT
CCGTTGCGCC GCGCCGGCGC CAAGGGGGCA GGGCAGTGGC AGCAGATTTC TTGGGACGAT
GCGCTGGATG AGATCGCCGA AGCCTTTGTG AAAGCCGAGG CAAGGGACGG CAGCGAGGCG
GTCTGGCCCT ATTTCTACGC CGGTACGATG GGCTGGGTGC AGCGCGATTC CATCGATCGC
CTCCGTCATG CCAAGCGTTA CTCCGGTTTC TTCTCTTCGA TCTGCACCAA CCCGGCCTGG
ACCGGCTTCA CCATGGCGAC CGGCACGCTG CGCGGTCCCG ATCCACGCGA GATGGGCCGC
ACCGATTGCG TCGTCATCTG GGGCACCAAC GCGGTGTCGA CGCAGGTCAA TGTGATGACC
CACGCCATCA AGTCGCGCAA GGAGCGCGGC GCGAAGATCG TCGTCATCGA CATCTACGAC
AATCCGACGA TGAAGCAGGC CGACATGGCG CTGATCGTCA GGCCGGGTAC CGACGCCGCG
CTCGCCTGCG CCGTCATGCA CATCGCCTTC CGCGACGGTT ACGCCGACCG CGATTACATG
GCGAGATACG CCGATGATCC CGCCGGTCTC GAAGCGCATC TGAAAACCAA GACGCCGCAA
TGGGCCGCTG CTATCACCGG CCTTTCGATC GAGGAGATCG AAGCCTTCGC CAGCCTCGTC
GGCACGACGA AGAAGACCTT CTTCCGCCTG GGTTACGGCT TCACCCGCCA GCGCAACGGC
GCGGTCGCCA TGCATGCGGC CGCCTCGGTC GCCACCGTTC TCGGCTCCTG GCAATATGAG
GGCGGCGGCG CCTTCCATTC GAACAGCGAT ATCTTCCGCA TGAACAGCGC CGAACTGACC
GGCCGGTCGA TGAAGGATGC CGATATCCGC ATGCTCGACC AGTCGCAGAT CGGCCGCGTG
CTGACCGGCG ATGCCGTGGC GCTGCGCCAT CGCGGCCCGG TGACGGCTAT GCTGATCCAG
AACACCAATC CCGCCAACAT CGCCCCCGAG CAGCGCCTCG TCAGACGTGG CTTTGCCCGT
GAGGACCTCT TCGTTGCCGT CCACGAGCAG TTCCTGACCG AAACGGCCGA GATCGCCAAT
ATCGTCATTC CGGCAACGAT GTTCGTCGAA CATGACGACA TCTACCGGGC CGGCGGCCAG
AACCATATCC TGCTGGGACC GAAGCTGGTC GAGCCACCGC CCACCGTGCG CACCAATCTC
TTCGTCATCG AGGAACTGGC CAAACGCCTC GGCGTCGCCG ATCGCCCAGG CTTCGGCTTT
ACCGCCCGCG AGATGGTCGA CCGCATCCTC GAATCGAGCG GCCTGCCGGA TTACGATCAT
TTCCTCGAAC ACAAATGGTT CGATCGCCAG CCCGCTTTCG AGGAAGCGCA TTATCTGAAC
GGCTTTGCCC ATCCGGACGG CAAGTTCCAC TTTCGCCCGG ACTGGATCAA TCAGCCGGCG
CCGAACAAAC CGCCGGCGGC AATCGGCGCG CTCGGTCCGC ACGCAGCGCT TCCAGACTTC
CCCGATCAGG TCGATGTCAT CGAAGTCGCC GATCCCGAGC ATCCCTTCCG CCTCGCCACG
TCGCCGGCGC GCAACTTCCT GAATTCGAGC TTTTCGGAAA CCAAGACCTC CCGGCAGAAA
GAAGGCCGCC CTGAAGTGAT GATCAATCCG GCCGACGCCG AAGCCAACGG CATCACCCAT
GGCGATCTCG TCCGCATCGG TAACAGCCGC GGCGATCTGC GCATCCACGC CCGCATCACC
ACTGAAGTGA AATCAGGCGT GCTGATTGCC GAGGGCCTTT GGCCGAACAA GGCGCATGTC
GACGGCGAGG GCATCAACGT CTTGACCGGC GCCGACCCCG TCGCGCCTTA TGGCGGTGCG
GCCGTGCACG ACAACAAGGT CTGGCTTCGC AGGGACGCAG CATGA
 
Protein sequence
MNSRHGEFLW LLAVQALEYT NAMNIATPIH AKSEKPDNRV GHTACPHDCP STCALEVEIS 
EDGRIGRVRG ANDHSYTSGV ICAKVARYAE RLYHPDRLMH PLRRAGAKGA GQWQQISWDD
ALDEIAEAFV KAEARDGSEA VWPYFYAGTM GWVQRDSIDR LRHAKRYSGF FSSICTNPAW
TGFTMATGTL RGPDPREMGR TDCVVIWGTN AVSTQVNVMT HAIKSRKERG AKIVVIDIYD
NPTMKQADMA LIVRPGTDAA LACAVMHIAF RDGYADRDYM ARYADDPAGL EAHLKTKTPQ
WAAAITGLSI EEIEAFASLV GTTKKTFFRL GYGFTRQRNG AVAMHAAASV ATVLGSWQYE
GGGAFHSNSD IFRMNSAELT GRSMKDADIR MLDQSQIGRV LTGDAVALRH RGPVTAMLIQ
NTNPANIAPE QRLVRRGFAR EDLFVAVHEQ FLTETAEIAN IVIPATMFVE HDDIYRAGGQ
NHILLGPKLV EPPPTVRTNL FVIEELAKRL GVADRPGFGF TAREMVDRIL ESSGLPDYDH
FLEHKWFDRQ PAFEEAHYLN GFAHPDGKFH FRPDWINQPA PNKPPAAIGA LGPHAALPDF
PDQVDVIEVA DPEHPFRLAT SPARNFLNSS FSETKTSRQK EGRPEVMINP ADAEANGITH
GDLVRIGNSR GDLRIHARIT TEVKSGVLIA EGLWPNKAHV DGEGINVLTG ADPVAPYGGA
AVHDNKVWLR RDAA