Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3925 |
Symbol | |
ID | 8014741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3998450 |
End bp | 4001329 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826494 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002977705 |
Protein GI | 241206609 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.287134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTCA TCCATGAAAT CGACTACGGC ACTCCTGCTT CGAAATCCGA GGTCATGGTG AAGCTCACCA TCGACGGACA GCAGATCAGC GTGCCGGAGG GCACCTCGAT CATGCGCGCC TCGATGGAGG CCGGCATCAA GGTGCCGAAG CTCTGCGCCA CCGATATGGT CGACGCTTTC GGCTCCTGCC GGCTCTGCCT CGTCGAGATC GAGGGCCGCA ACGGAACACC CTCCTCCTGC ACGACGCCGG TGGCGGCAAA CATGGTGGTG CACACGCAGA CGGGGCGGTT GAAGGATATC CGCCGCGGCG TGATGGAACT CTATATTTCC GACCATCCGC TCGACTGTCT CACCTGCGCG GCTAACGGTG ATTGCGAATT GCAGGACATG GCGGGCGCCG TCGGCCTGCG CGACGTACGT TACGGCTATG AGGGCGACAA CCACGTCAAG GCGCGCAGCA ATGGCGACAT CAATCTGAAA TGGATGCCGA AGGACGAGTC CAATCCCTAT TTCACCTATG ATCCCTCGAA ATGCATCGTC TGTTCGCGCT GCGTGCGCGC CTGCGAGGAA GTGCAGGGCA CCTTCGCGCT GACGATCGAG GGCCGCGGCT TCGGCTCGCG CGTTTCGCCC GGCATGCACG AGCATTTCAT CGATTCCGAA TGCGTCTCCT GCGGTGCCTG CGTCCAGGCC TGCCCGACGG CAACGCTGAC GGAGAAATCG GTGATTCAGA TCGGCCAGCC GGAGCATTCG GCTGTGACGA CCTGCGCCTA CTGCGGCGTC GGCTGTTCCT TCAAGGCGGA GATGCGCGGC GAGGAACTGG TGCGCATGGT GCCGTGGAAG GACGGCCAAG CCAATCGCGG CCATTCCTGC GTCAAGGGAC GCTTCGCCTA CGGCTATTCC ACCCACAAGG ACCGCATCCT CAATCCGATG ATCCGCGAAA AGGTCAGCGA TCCCTGGCGG GAAGTGAGCT GGGACGAGGC CTTCGCGCAT GTGGCGCTGG AGTTCCGCCG CATCCAGTAT CAATACGGCC GCGAGGCAAT TGGCGGCATC ACCTCGTCGC GCTGCACCAA TGAGGAAACG TATCTGGTGC AGAAGCTGAT CCGCGCCGGC TTCGGCAACA ACAATGTCGA CACCTGCGCC CGCGTCTGCC ATTCGCCGAC CGGTTACGGC CTCGGCCAGA CCTTCGGCAC GTCGGCGGGC ACACAGGATT TCGACAGCGT CGAGCATTCC GACGTCGTCA TCGTCATCGG CGCCAATCCA ACCGATGGGC ATCCGGTGTT CGGCTCGCGG CTGAAGAAGC GGCTGCGCCA GGGCGCCAAG CTCATCGTCA TCGATCCGCG CCGCACCGAT ATCGTCCGCT CGCCGCATAT CGAGGCCTCC TATCACCTGC CGCTGAAGCC CGGCACCAAT GTCGCGGTCA TGACGGCGCT GGCGCATGTG ATCGTCACCG AAGGGCTCTT TGACGAGGCG TTCATCCGCG AGCGCTGCGA CTGGTCGGAG TTCGAGGACT GGGCCGCCTT CGTCGCCGAA CCGCAGCACA GTCCCGAAGA GACCGAGATC TTCACCGGCG TGCCGGCGGC GGATCTGCGC GACGCGGCAA GGCTCTATGC CAAGGGCGGC AATGGCGCAA TCTATTATGG CCTCGGCGTC ACCGAACACA GCCAGGGCTC GACCACGGTC ATCGCGATCG CCAACCTGGC GATGGCGACC GGCAATATCG GCCGTCCCGG CGTCGGCGTG AACCCGCTGC GCGGCCAGAA CAATGTGCAG GGCTCCTGCG ACATGGGCTC GTTCCCGCAC GAATTGCCGG GCTACCGGCA CATTTCCGAC GATGCGACGC GCGATATCTT CGAAAAGCTC TGGGGCGTGA AGCTCAACAA CGAGCCGGGC CTGCGTATTC CGAACATGCT GGATGCAGCA GTCGACGGCT CGTTCAAGGG CATCTACATC CAGGGCGAAG ACATCCTCCA GTCCGATCCC GATACCAAAC ATGTCGCGGC CGGGCTTGCG GCGATGGAAT GCGTCGTCGT GCAGGATCTG TTCCTCAACG AGACCGCCAA TTACGCCCAT GTCTTTCTAC CGGGCTCGAC CTTCCTCGAG AAGGACGGCA CCTTCACCAA TGCCGAGCGC CGCATCAATC GTGTGCGCAA GGTGATGTCG CCGCGCAACG GCTATGGCGA CTGGGAAGTG ACGCAGAAGC TTGCCCAGGC GATGGGGCTC GACTGGAATT ACACCCATCC GTCGGAGATC ATGGACGAGA TCGCCGCGAC GACGCCGAGT TTCGCGATGG TCTCCTACGA CTATCTCGAC AAAATGGGCT CGGTGCAGTG GCCCTGCAAC GAGAAGACCC CGCTCGGCTC GCCGATCATG CATGTCAATG GCTTCGTGCG CGGCAAGGGC AAGTTCATCC GCACCGAATA TGTGGCGACC GACGAGCGCA CCGGTCCGCG CTTCCCGCTG CTGCTGACAA CCGGCCGCAC CCTCAGCCAG TACAATGTCG GGGCGCAGAC ACGGCGAACC GAGAATGTCG TATGGCATGC GGAAGACCGG CTGGAAATCC ATTCGCACGA TGCCGAGCAG CGCGGCGTTC GCGACGGCGA CTGGGTGAAG CTCGGCAGCC GCTCCGGCGA CACGACGCTC AGGGCGCTGA TCACCGATCG CGTCGCGCCG GGTGTCGTCT ACACGACCTT CCATCATCCC ACGACGCAGG CGAACGTCAT CACCACCGAT TTCTCCGACT GGGCGACGAA CTGCCCGGAA TACAAGGTGA CGGCGGTGCA GGTCTCGCCC TCCAACGGGC CGAGCGAATG GCAACTCGAA TATGACGAGC AGGCGCGTCA ATCGCGCCGC ATCGCCGGCA AGCTCGAGGC AGCGGAGTGA
|
Protein sequence | MSLIHEIDYG TPASKSEVMV KLTIDGQQIS VPEGTSIMRA SMEAGIKVPK LCATDMVDAF GSCRLCLVEI EGRNGTPSSC TTPVAANMVV HTQTGRLKDI RRGVMELYIS DHPLDCLTCA ANGDCELQDM AGAVGLRDVR YGYEGDNHVK ARSNGDINLK WMPKDESNPY FTYDPSKCIV CSRCVRACEE VQGTFALTIE GRGFGSRVSP GMHEHFIDSE CVSCGACVQA CPTATLTEKS VIQIGQPEHS AVTTCAYCGV GCSFKAEMRG EELVRMVPWK DGQANRGHSC VKGRFAYGYS THKDRILNPM IREKVSDPWR EVSWDEAFAH VALEFRRIQY QYGREAIGGI TSSRCTNEET YLVQKLIRAG FGNNNVDTCA RVCHSPTGYG LGQTFGTSAG TQDFDSVEHS DVVIVIGANP TDGHPVFGSR LKKRLRQGAK LIVIDPRRTD IVRSPHIEAS YHLPLKPGTN VAVMTALAHV IVTEGLFDEA FIRERCDWSE FEDWAAFVAE PQHSPEETEI FTGVPAADLR DAARLYAKGG NGAIYYGLGV TEHSQGSTTV IAIANLAMAT GNIGRPGVGV NPLRGQNNVQ GSCDMGSFPH ELPGYRHISD DATRDIFEKL WGVKLNNEPG LRIPNMLDAA VDGSFKGIYI QGEDILQSDP DTKHVAAGLA AMECVVVQDL FLNETANYAH VFLPGSTFLE KDGTFTNAER RINRVRKVMS PRNGYGDWEV TQKLAQAMGL DWNYTHPSEI MDEIAATTPS FAMVSYDYLD KMGSVQWPCN EKTPLGSPIM HVNGFVRGKG KFIRTEYVAT DERTGPRFPL LLTTGRTLSQ YNVGAQTRRT ENVVWHAEDR LEIHSHDAEQ RGVRDGDWVK LGSRSGDTTL RALITDRVAP GVVYTTFHHP TTQANVITTD FSDWATNCPE YKVTAVQVSP SNGPSEWQLE YDEQARQSRR IAGKLEAAE
|
| |