Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2897 |
Symbol | |
ID | 5323774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3038395 |
End bp | 3041274 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640791849 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001328562 |
Protein GI | 150398095 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.30865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTCA TTCATGAAAT CGACTACGGC ACTCCTGCTT CCAAATCCGA AAAGCTGGTG ACGCTGACGA TCGACGGACG CGAGATCACC GTGCCGGAAG GCACCTCGAT CATGCGCGCG GCAATGGAAG CGGGCATCGA GGTGCCGAAG CTTTGCGCTT CCGACATGAT GGATGCCTTC GGCTCCTGCC GGCTCTGTCT CGTCGAAATC GATGGGCGCG CTGGGATGCC GGCCTCCTGC ACGACGCCGG TTTCGGCAGG CATCAGCGTT TCGACGCAGA CACAGCGGCT GAAGGATGTC CGCCGCGGCG TCATGGAACT CTATATTTCC GACCATCCGC TCGACTGCCT TACCTGCGCA GCCAACGGCG ATTGCGAGCT GCAGGATATG GCTGGCGCCG TGGGCCTGCG CGATGTGCGC TACGGCTATG ACGGCGAGAA CCATGTGACG GCGCGCAACA ATGGCGAAAT CAACGCCAAA TGGATGCCGA AGGACGAATC GAACCCCTAT TTCACCTATG ATCCGGCGAA GTGCATCGTC TGCTCGCGCT GCGTGCGGGC GTGCGAGGAA GTGCAGGGAA CTTTCGCGCT GACGATCAGC GGCCGTGGCT TCGATTCACG CGTATCGGCC GGCATGAACG AGGACTTCGT CTCCTCCGAA TGCGTTTCCT GCGGCGCCTG CGTTCAGGCC TGCCCGACCG CGACGCTGAC GGAAAAATCG GTGATAGAAA TCGGCCAGCC CGAGCATTCT GTCGTCACCA CCTGCGCCTA TTGCGGCGTC GGCTGCTCCT TCAAGGCGGA GATGCGCGGC GAGGAGCTGG TGCGCATGGT GCCATGGAAG GACGGGCAGG CGAACCGCGG CCATTCCTGC GTAAAGGGCC GCTTTGCCTA TGGTTATTCC AACCACAAGG ACCGCATTCT CAATCCGATG ATCCGCGAGA AGGTCACCGA TGCCTGGCGC GAGGTCACCT GGGAGGAAGC TTTCGCCCAT GTCGCCTCCG AGTTCCGCCG GATCCAGTAC CAGTACGGCC GTGATTCCGT CGGCGGCATC ACGTCTTCGC GCTGCACCAA TGAGGAGACC TTTCTGGTGC AGAAGCTGGT GCGCGCCGGT TTCGGCAACA ACAATGTCGA CACCTGCGCC CGCGTCTGCC ATTCGCCGAC CGGCTACGGC CTCAACCAGA CCTTCGGCAC CTCCGCCGGC ACGCAGGATT TCGACAGCGT GGAGCACACG GATGTTGCGG TCATCATCGG TGCCAACCCG ACCGACGGTC ATCCGGTCTT CGCCTCGCGG CTGAAGAAGC GGCTGCGCCA GGGCGCCAAG CTGATCGTCA TCGATCCGCG CCGGATCGAC CTCGTCCGCT CGGCCCATGT CGAAGCGTCC TACCATCTGC CGCTGAAGCC AGGTACCAAC GTCGCCATCC TGACCGCGCT GGCGCATGTC ATCGTCACCG AGGGTATTGG TAACGAAGCC TTCATCCGCG AGCGCTGCGA CTGGTCGGAA TTCGAGGACT GGGCAGCCTT CGTCGCCGAG CCGCATCACA GCCCGGAAGC GACCGCGGCC TATACCGGCG TTCGGGCCGA TCTGGTGCGC GGAGCCGCGC GGCTTTACGC GACCGGCGGC AATGGCGCGA TCTATTACGG CCTTGGCGTC ACCGAGCACA GCCAGGGCTC GACGACAGTG ATGGCGATCG CGAACCTCGC CATGCTCACC GGCAACATCG GCCGGCCGGG CGTTGGCGTC AATCCGCTGC GCGGCCAGAA CAACGTCCAG GGCTCCTGCG ACATGGGCTC CTTCCCACAT GAGCTTCCCG GCTACCGGCA TATTTCGGAC GATGCAACGC GCGAGATCTT CGAAAAGCTC TGGGGCGTGA AGCTCAACCA CGAGCCGGGA CTGCGCATTC CCAACATGCT CGACGCCGCC GTCGAGGGCA CCTTCAAGGG CCTGTACGTC CAGGGGGAGG ACATTCTGCA GTCGGACCCC GACACGAAGC ACGTCGCGGC CGGCCTTGCC GCCATGGAGT GCGTCGTGGT GCACGACCTC TTCCTCAACG AGACGGCGAA CTACGCTCAT GTCTTCCTGC CGGGCTCGAC CTTCTTCGAA AAGGACGGAA CCTTCACCAA TGCCGAGCGC CGCATCAACC GCGTCCGGCG CGTCATGCGG CCGAAGAACG GCTATGCCGA TTGGGAGGTG ACGCAGAAGA TGGCGCAAGC CATGGGGCTT GCCTGGAATT ACCGTCATCC GTCCGAGATC ATGGACGAGA TCGCCGCTAC GACGCCGAGC TTTGCCATGG TCTCCTACGA CTATCTGGAC AAGATGGGCT CGGTGCAGTG GCCGTGCAAC GAAAAGGCGC CGCTCGGCTC GCCGATCATG CATGTGGACG GTTTCGTACG CGGCAAGGGC AAGTTCATCC GCACCGAATA TGTGGCGACC GACGAGAGAA CCGGCCCCCG CTTCCCGCTG CTTCTGACGA CCGGCCGTAT TCTCAGCCAG TACAATGTCG GTGCCCAGAC GCGGCGCACG GAGAACGTCG TCTGGCATGC GGAAGACCGG CTGGAAATCC ATCCGCACGA CGCCGAGCAG CGCGGCATTC GCGACGGCGA CTGGGTCAGG CTCGCCAGCC GCTCGGGCGA CACGACGCTC CGCGCGCTGA TCACCGACCG AGTCGCACCT GGCGTCGTCT ATACGACCTT CCATCACCCC TCGACGCAGG CGAACGTGAT CACCACCGAC TTTACCGACT GGGCGACCAA CTGCCCGGAA TACAAGGTGA CGGCGGTGCA GGTCTCGCCG TCGAACGGCC CCTCCGACTG GCAGCGTGAT TATGATGAGC AGGCGCGCCA GTCGCGTCGC ATCGCCGGCA AGCTGGAGGC GGCGGAATAG
|
Protein sequence | MSLIHEIDYG TPASKSEKLV TLTIDGREIT VPEGTSIMRA AMEAGIEVPK LCASDMMDAF GSCRLCLVEI DGRAGMPASC TTPVSAGISV STQTQRLKDV RRGVMELYIS DHPLDCLTCA ANGDCELQDM AGAVGLRDVR YGYDGENHVT ARNNGEINAK WMPKDESNPY FTYDPAKCIV CSRCVRACEE VQGTFALTIS GRGFDSRVSA GMNEDFVSSE CVSCGACVQA CPTATLTEKS VIEIGQPEHS VVTTCAYCGV GCSFKAEMRG EELVRMVPWK DGQANRGHSC VKGRFAYGYS NHKDRILNPM IREKVTDAWR EVTWEEAFAH VASEFRRIQY QYGRDSVGGI TSSRCTNEET FLVQKLVRAG FGNNNVDTCA RVCHSPTGYG LNQTFGTSAG TQDFDSVEHT DVAVIIGANP TDGHPVFASR LKKRLRQGAK LIVIDPRRID LVRSAHVEAS YHLPLKPGTN VAILTALAHV IVTEGIGNEA FIRERCDWSE FEDWAAFVAE PHHSPEATAA YTGVRADLVR GAARLYATGG NGAIYYGLGV TEHSQGSTTV MAIANLAMLT GNIGRPGVGV NPLRGQNNVQ GSCDMGSFPH ELPGYRHISD DATREIFEKL WGVKLNHEPG LRIPNMLDAA VEGTFKGLYV QGEDILQSDP DTKHVAAGLA AMECVVVHDL FLNETANYAH VFLPGSTFFE KDGTFTNAER RINRVRRVMR PKNGYADWEV TQKMAQAMGL AWNYRHPSEI MDEIAATTPS FAMVSYDYLD KMGSVQWPCN EKAPLGSPIM HVDGFVRGKG KFIRTEYVAT DERTGPRFPL LLTTGRILSQ YNVGAQTRRT ENVVWHAEDR LEIHPHDAEQ RGIRDGDWVR LASRSGDTTL RALITDRVAP GVVYTTFHHP STQANVITTD FTDWATNCPE YKVTAVQVSP SNGPSDWQRD YDEQARQSRR IAGKLEAAE
|
| |