Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3636 |
Symbol | |
ID | 6982398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3765562 |
End bp | 3768441 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398360 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002283127 |
Protein GI | 209551210 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.234983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTCA TCCATGAAAT CGACTACGGC ACTCCCGCTT CGACATCCGA GGTGATGGTG ACGCTCACCA TTGATGGGCA GCAGATCAGC GTGCCGGAGG GCACCTCGAT CATGCGCGCT TCGATGGAAG CCGGCATCGA GGTGCCGAAG CTCTGCGCCA CCGACATGAT CGATGCCTTC GGTTCCTGCC GGCTCTGTCT CGTCGAGATC GAGGGCCGCA ACGGTACGCC TGCCTCCTGC ACGACGCCTG TGGCGGCCAA CATGGTGGTG CACACGCAGA CGGGACGGCT GAAGGATATT CGCCGCGGCG TGATGGAGCT TTATATTTCC GACCACCCGC TCGACTGTCT GACCTGTGCG GCCAATGGTG ATTGCGAATT GCAGGACATG GCGGGCGCCG TCGGCTTGCG CGACGTGCGC TATGGCTATG ACGGCGACAA TCATGTCAGG GCGCGCAACA ATGGCGAGAT CAATCTGAAA TGGACGCCGA AGGACGAGTC CAATCCCTAT TTCACCTTCG ATCCTTCCAA ATGCATCGTC TGTTCGCGCT GTGTGCGCGC CTGCGAGGAG GTGCAGGGCA CTTTCGCGCT GACGATCGAG GGGCGCGGTT TCGGCTCCAA GGTTTCCTCG GGTGCACATG AAGCCTTCAT CGATTCCGAA TGTGTCTCCT GCGGGGCCTG CGTCCAGGCC TGCCCGACGG CGACGCTGAC GGAAAAGTCG GTGATCGAGA TCGGCCAGCC TGAACATTCG GCCATCACCA CCTGCGCCTA TTGCGGCGTC GGCTGTTCCT TCAAGGCGGA AATGCGCGGC GAGGAGCTGG TGCGCATGGT GCCGTGGAAG GACGGCCAGG CCAATCGCGG CCATTCCTGC GTCAAGGGCC GCTTCGCTTA CGGCTATTCC ACCCATAAGG ACCGGATTCT CAATCCGATG ATCCGCGAAA AGATCAGCGA TCCCTGGCGG GAGGTAAGCT GGGACGAGGC CTTCGCGCAT GTAGCGTCGG AGTTTCGCCG CATCCAGTAT CAATATGGTC GCGACGCGGT CGGCGGCATC ACCTCGTCGC GCTGCACCAA TGAGGAAACC TACCTGGTGC AGAAGCTGGT CCGGGCCGGC TTCGGTAACA ACAATGTCGA TACCTGCGCC CGCGTCTGCC ATTCGCCGAC CGGCTACGGC CTCGGCCAGA CATTCGGCAC GTCGGCCGGC ACGCAGAATT TCGACAGCGT CGAGCAGTCC GATGTCGTCG TCATCATCGG CGCCAACCCG ACCGATGGGC ATCCGGTATT CGGCTCGCGG CTGAAGAAGC GGCTGCGCCA GGGCGCCAAG CTCATCGTCA TCGATCCGCG CCGCACCGAT ATCGTCCGGT CGCCGCATGT CGAGGCCTCC TATCACCTGC CGCTGAAGCC CGGCACCAAT GTCGCCGTCA TGACTGCGCT GGCGCATGTG ATCGTCACCG AAGGGCTCTA TGACGAGGCG TTCATCCGCG AGCGCTGCGA TTGGTCGGAA TTCGAGGATT GGGCCGCCTT CGTCGCCGAA CCGGCGCACA GCCCCGAACA GACAGAGATC TTCACCGGTG TGCCGGCGGC GGATCTGCGC GGCGCGGCAA GGCTCTATGC CAAGGGCGGC AACGGCGCGA TCTATTACGG CCTCGGCGTC ACCGAACACA GCCAGGGTTC GACCACGGTC ATCGCGATCG CCAATCTGGC GATGGCGACC GGCAATATCG GCCGTCCCGG TGTCGGCGTG AACCCGTTGC GCGGCCAGAA CAATGTGCAG GGCTCCTGCG ACATGGGCTC CTTCCCGCAC GAACTGCCGG GCTACCGGCA CATTTCCGAC GATGCGACGC GGGATATCTT CGAAAAGCTC TGGGGCGTGA AGATCAATAA CGAGCCGGGC CTGCGCATTC CGAACATGCT GGACGCGGCG GTCGACGGCT CCTTCAAGGG CCTCTACGTC CAAGGCGAGG ATATTCTCCA GTCCGATCCC GATACGAAAC ATGTCGCGGC CGGGCTTGCG GCGATGGAAT GCGTCGTCGT CCAGGACCTG TTCCTCAACG AGACCGCCAA TTACGCTCAT GTCTTCCTGC CGGGCTCGAC CTTCCTCGAG AAGGACGGCA CCTTCACCAA TGCCGAGCGC CGCATCAACC GGGTGCGCAA GGTGATGACG CCGCGCAACG GCTATGGCGA CTGGGAGGTG ACCCAGAAGC TTGCTCAGGC GATGGGGCTC GACTGGAATT ACGCCCATCC GTCGGAGATC ATGGACGAAA TTGCCGCTAC GACGCCGAGT TTCGCCCTGG TGTCCTACGA TTACCTCGAG AAGATGGGCT CGGTGCAGTG GCCCTGCAAC GAGAAGAACC CGCTCGGCTC GCCGATCATG CATGTCAATG GCTTCGTGCG CGGCAAGGGC AAGTTCATCC GCACCGAATA TGTGGCCACC GACGAGCGCA CCGGCCCACG CTTCCCGCTG CTGCTCACCA CCGGCCGCAT CCTCAGCCAG TACAATGTCG GGGCGCAGAC GCGGCGGACC GAGAATGTCG TCTGGCATGC GGAAGACCGG CTGGAAATCC ATCCGCATGA TGCCGAGCAG CGCGGCATTC GCGATGGCGA TTGGGTGAAG CTCGTCAGCC GCTCCGGCGA CACCACGCTG AGATCGCTGA TTACCGATCG TGTTGCACCG GGCGTCGTTT ATACGACCTT CCATCATCCC AATACGCAGG CGAACGTCAT CACCACCGAC TTTTCCGACT GGGCGACCAA TTGCCCTGAG TATAAGGTGA CGGCGGTGCA GGTTTCGCCC TCCAACGGGC CGAGCGACTG GCAGATGGAA TATGACGAGC AGGCACGGCA ATCGCGCCGC ATCGCGGGCA AGCTCGAGGC AGCGGAGTGA
|
Protein sequence | MSLIHEIDYG TPASTSEVMV TLTIDGQQIS VPEGTSIMRA SMEAGIEVPK LCATDMIDAF GSCRLCLVEI EGRNGTPASC TTPVAANMVV HTQTGRLKDI RRGVMELYIS DHPLDCLTCA ANGDCELQDM AGAVGLRDVR YGYDGDNHVR ARNNGEINLK WTPKDESNPY FTFDPSKCIV CSRCVRACEE VQGTFALTIE GRGFGSKVSS GAHEAFIDSE CVSCGACVQA CPTATLTEKS VIEIGQPEHS AITTCAYCGV GCSFKAEMRG EELVRMVPWK DGQANRGHSC VKGRFAYGYS THKDRILNPM IREKISDPWR EVSWDEAFAH VASEFRRIQY QYGRDAVGGI TSSRCTNEET YLVQKLVRAG FGNNNVDTCA RVCHSPTGYG LGQTFGTSAG TQNFDSVEQS DVVVIIGANP TDGHPVFGSR LKKRLRQGAK LIVIDPRRTD IVRSPHVEAS YHLPLKPGTN VAVMTALAHV IVTEGLYDEA FIRERCDWSE FEDWAAFVAE PAHSPEQTEI FTGVPAADLR GAARLYAKGG NGAIYYGLGV TEHSQGSTTV IAIANLAMAT GNIGRPGVGV NPLRGQNNVQ GSCDMGSFPH ELPGYRHISD DATRDIFEKL WGVKINNEPG LRIPNMLDAA VDGSFKGLYV QGEDILQSDP DTKHVAAGLA AMECVVVQDL FLNETANYAH VFLPGSTFLE KDGTFTNAER RINRVRKVMT PRNGYGDWEV TQKLAQAMGL DWNYAHPSEI MDEIAATTPS FALVSYDYLE KMGSVQWPCN EKNPLGSPIM HVNGFVRGKG KFIRTEYVAT DERTGPRFPL LLTTGRILSQ YNVGAQTRRT ENVVWHAEDR LEIHPHDAEQ RGIRDGDWVK LVSRSGDTTL RSLITDRVAP GVVYTTFHHP NTQANVITTD FSDWATNCPE YKVTAVQVSP SNGPSDWQME YDEQARQSRR IAGKLEAAE
|
| |