Gene Rleg_3925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3925 
Symbol 
ID8014741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3998450 
End bp4001329 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content63% 
IMG OID644826494 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002977705 
Protein GI241206609 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.287134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTCA TCCATGAAAT CGACTACGGC ACTCCTGCTT CGAAATCCGA GGTCATGGTG 
AAGCTCACCA TCGACGGACA GCAGATCAGC GTGCCGGAGG GCACCTCGAT CATGCGCGCC
TCGATGGAGG CCGGCATCAA GGTGCCGAAG CTCTGCGCCA CCGATATGGT CGACGCTTTC
GGCTCCTGCC GGCTCTGCCT CGTCGAGATC GAGGGCCGCA ACGGAACACC CTCCTCCTGC
ACGACGCCGG TGGCGGCAAA CATGGTGGTG CACACGCAGA CGGGGCGGTT GAAGGATATC
CGCCGCGGCG TGATGGAACT CTATATTTCC GACCATCCGC TCGACTGTCT CACCTGCGCG
GCTAACGGTG ATTGCGAATT GCAGGACATG GCGGGCGCCG TCGGCCTGCG CGACGTACGT
TACGGCTATG AGGGCGACAA CCACGTCAAG GCGCGCAGCA ATGGCGACAT CAATCTGAAA
TGGATGCCGA AGGACGAGTC CAATCCCTAT TTCACCTATG ATCCCTCGAA ATGCATCGTC
TGTTCGCGCT GCGTGCGCGC CTGCGAGGAA GTGCAGGGCA CCTTCGCGCT GACGATCGAG
GGCCGCGGCT TCGGCTCGCG CGTTTCGCCC GGCATGCACG AGCATTTCAT CGATTCCGAA
TGCGTCTCCT GCGGTGCCTG CGTCCAGGCC TGCCCGACGG CAACGCTGAC GGAGAAATCG
GTGATTCAGA TCGGCCAGCC GGAGCATTCG GCTGTGACGA CCTGCGCCTA CTGCGGCGTC
GGCTGTTCCT TCAAGGCGGA GATGCGCGGC GAGGAACTGG TGCGCATGGT GCCGTGGAAG
GACGGCCAAG CCAATCGCGG CCATTCCTGC GTCAAGGGAC GCTTCGCCTA CGGCTATTCC
ACCCACAAGG ACCGCATCCT CAATCCGATG ATCCGCGAAA AGGTCAGCGA TCCCTGGCGG
GAAGTGAGCT GGGACGAGGC CTTCGCGCAT GTGGCGCTGG AGTTCCGCCG CATCCAGTAT
CAATACGGCC GCGAGGCAAT TGGCGGCATC ACCTCGTCGC GCTGCACCAA TGAGGAAACG
TATCTGGTGC AGAAGCTGAT CCGCGCCGGC TTCGGCAACA ACAATGTCGA CACCTGCGCC
CGCGTCTGCC ATTCGCCGAC CGGTTACGGC CTCGGCCAGA CCTTCGGCAC GTCGGCGGGC
ACACAGGATT TCGACAGCGT CGAGCATTCC GACGTCGTCA TCGTCATCGG CGCCAATCCA
ACCGATGGGC ATCCGGTGTT CGGCTCGCGG CTGAAGAAGC GGCTGCGCCA GGGCGCCAAG
CTCATCGTCA TCGATCCGCG CCGCACCGAT ATCGTCCGCT CGCCGCATAT CGAGGCCTCC
TATCACCTGC CGCTGAAGCC CGGCACCAAT GTCGCGGTCA TGACGGCGCT GGCGCATGTG
ATCGTCACCG AAGGGCTCTT TGACGAGGCG TTCATCCGCG AGCGCTGCGA CTGGTCGGAG
TTCGAGGACT GGGCCGCCTT CGTCGCCGAA CCGCAGCACA GTCCCGAAGA GACCGAGATC
TTCACCGGCG TGCCGGCGGC GGATCTGCGC GACGCGGCAA GGCTCTATGC CAAGGGCGGC
AATGGCGCAA TCTATTATGG CCTCGGCGTC ACCGAACACA GCCAGGGCTC GACCACGGTC
ATCGCGATCG CCAACCTGGC GATGGCGACC GGCAATATCG GCCGTCCCGG CGTCGGCGTG
AACCCGCTGC GCGGCCAGAA CAATGTGCAG GGCTCCTGCG ACATGGGCTC GTTCCCGCAC
GAATTGCCGG GCTACCGGCA CATTTCCGAC GATGCGACGC GCGATATCTT CGAAAAGCTC
TGGGGCGTGA AGCTCAACAA CGAGCCGGGC CTGCGTATTC CGAACATGCT GGATGCAGCA
GTCGACGGCT CGTTCAAGGG CATCTACATC CAGGGCGAAG ACATCCTCCA GTCCGATCCC
GATACCAAAC ATGTCGCGGC CGGGCTTGCG GCGATGGAAT GCGTCGTCGT GCAGGATCTG
TTCCTCAACG AGACCGCCAA TTACGCCCAT GTCTTTCTAC CGGGCTCGAC CTTCCTCGAG
AAGGACGGCA CCTTCACCAA TGCCGAGCGC CGCATCAATC GTGTGCGCAA GGTGATGTCG
CCGCGCAACG GCTATGGCGA CTGGGAAGTG ACGCAGAAGC TTGCCCAGGC GATGGGGCTC
GACTGGAATT ACACCCATCC GTCGGAGATC ATGGACGAGA TCGCCGCGAC GACGCCGAGT
TTCGCGATGG TCTCCTACGA CTATCTCGAC AAAATGGGCT CGGTGCAGTG GCCCTGCAAC
GAGAAGACCC CGCTCGGCTC GCCGATCATG CATGTCAATG GCTTCGTGCG CGGCAAGGGC
AAGTTCATCC GCACCGAATA TGTGGCGACC GACGAGCGCA CCGGTCCGCG CTTCCCGCTG
CTGCTGACAA CCGGCCGCAC CCTCAGCCAG TACAATGTCG GGGCGCAGAC ACGGCGAACC
GAGAATGTCG TATGGCATGC GGAAGACCGG CTGGAAATCC ATTCGCACGA TGCCGAGCAG
CGCGGCGTTC GCGACGGCGA CTGGGTGAAG CTCGGCAGCC GCTCCGGCGA CACGACGCTC
AGGGCGCTGA TCACCGATCG CGTCGCGCCG GGTGTCGTCT ACACGACCTT CCATCATCCC
ACGACGCAGG CGAACGTCAT CACCACCGAT TTCTCCGACT GGGCGACGAA CTGCCCGGAA
TACAAGGTGA CGGCGGTGCA GGTCTCGCCC TCCAACGGGC CGAGCGAATG GCAACTCGAA
TATGACGAGC AGGCGCGTCA ATCGCGCCGC ATCGCCGGCA AGCTCGAGGC AGCGGAGTGA
 
Protein sequence
MSLIHEIDYG TPASKSEVMV KLTIDGQQIS VPEGTSIMRA SMEAGIKVPK LCATDMVDAF 
GSCRLCLVEI EGRNGTPSSC TTPVAANMVV HTQTGRLKDI RRGVMELYIS DHPLDCLTCA
ANGDCELQDM AGAVGLRDVR YGYEGDNHVK ARSNGDINLK WMPKDESNPY FTYDPSKCIV
CSRCVRACEE VQGTFALTIE GRGFGSRVSP GMHEHFIDSE CVSCGACVQA CPTATLTEKS
VIQIGQPEHS AVTTCAYCGV GCSFKAEMRG EELVRMVPWK DGQANRGHSC VKGRFAYGYS
THKDRILNPM IREKVSDPWR EVSWDEAFAH VALEFRRIQY QYGREAIGGI TSSRCTNEET
YLVQKLIRAG FGNNNVDTCA RVCHSPTGYG LGQTFGTSAG TQDFDSVEHS DVVIVIGANP
TDGHPVFGSR LKKRLRQGAK LIVIDPRRTD IVRSPHIEAS YHLPLKPGTN VAVMTALAHV
IVTEGLFDEA FIRERCDWSE FEDWAAFVAE PQHSPEETEI FTGVPAADLR DAARLYAKGG
NGAIYYGLGV TEHSQGSTTV IAIANLAMAT GNIGRPGVGV NPLRGQNNVQ GSCDMGSFPH
ELPGYRHISD DATRDIFEKL WGVKLNNEPG LRIPNMLDAA VDGSFKGIYI QGEDILQSDP
DTKHVAAGLA AMECVVVQDL FLNETANYAH VFLPGSTFLE KDGTFTNAER RINRVRKVMS
PRNGYGDWEV TQKLAQAMGL DWNYTHPSEI MDEIAATTPS FAMVSYDYLD KMGSVQWPCN
EKTPLGSPIM HVNGFVRGKG KFIRTEYVAT DERTGPRFPL LLTTGRTLSQ YNVGAQTRRT
ENVVWHAEDR LEIHSHDAEQ RGVRDGDWVK LGSRSGDTTL RALITDRVAP GVVYTTFHHP
TTQANVITTD FSDWATNCPE YKVTAVQVSP SNGPSEWQLE YDEQARQSRR IAGKLEAAE