Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0735 |
Symbol | |
ID | 3918559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 777116 |
End bp | 779974 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443467 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_496016 |
Protein GI | 87198759 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTATG AACGCCAGGC CGATTTCGGC ACACCGGAAG TTCGCAGCGA GACCGAGGTC ACCTGCTTCA TTGACGGACG CGAGGTCACT GTCCCGGCGG GCACGACGGT CATGCGCGCG GCGGCGCTGA CCGGGGGCTC GATCCCCAAG CTCTGCGCGA CGGACAATCT CAAGCAATTC GGCTCGTGCC GCCTTTGCCT GGTCGAGATC GACGGGATGC GCGGGACGCC GGCAAGCTGC ACCACGCCGG TGGCATCGGG CATGAAGGTC CACACCCAGA CCCCGCGCCT CGAAAAGCTG CGGCGCGGGG TGATGGAGCT GTACATCTCC GACCACCCGC TCGACTGCCT GACCTGCAGC GCCAACAACG ATTGCGAACT GCAGGACCAG GCGGCAGCGG TGGGCTTGCG CGACGTGCGC TTCGGCTATG AGGGCGAGAA CCACCTTGGC GCCGCGACCG ACACCTCCAA CCCGTATTTC GACTTCGACC CTTCCAAGTG CATCGTGTGT TCGCGCTGCG TGCGCGCCTG CGACGAGGTG CAGGGGACAT TCGCGCTGAC GATCGACGGG CGCGGCTTTG CATCGAAGGT CTCGGCAGGG CTTGGTTCGG ACGACTATCT CTCGTCCGAA TGTGTAAGCT GCGGCGCCTG CGTCCAGGCT TGCCCGACTG CCACGTTGCA GGAAAAGGCG GTCAAGGAAA TCGGGAAGCC CGAGCGCGCG GTGGTCACGA CTTGCGCCTA TTGCGGCGTG GGCTGCACTT TCCGCGCCGA GATGCGCGGT GAGCAGCTCG TGCGCATGGT GCCGTGGAAG GACGGCAAGG CCAATCGCGG GCATTCCTGC GTCAAGGGTC GTTTCGCCTG GGGCTATGCC CAGCACCAGG AGCGCGTGCT TTCGCCGATG ATCCGCGAGT CCATCGACCA GCCCTGGCGC GAAGTTTCGT GGGAAGAGGC GCTTTCCCAT ACCGCGCGCG AGATCAACCG CATCCGCGAG ACCTACGGGC GCAACGCCCT GGGCGGGATA ACGTCGAGCC GCTGCACCAA CGAGGAGACG TTCCTCGTCC AGAAGCTGAT CCGCGCCGGT TTCGGCACCA ACAACGTCGA TACCTGCGCG CGCGTTTGCC ATTCGCCCAC CGGCTATGGC CTGAAGACGA CCTTCGGCAC ATCTGCGGGG ACGCAGGACT TCGACAGCGT CGAGGACACC GACGTGATCC TTGTGATCGG CGCGAACCCG ACCGACGGGC ATCCGGTCTT CGCCAGCCGC ATGAAGCGGC GGCTGCGCCA GGGTGCGAAG CTGATCGTGA TCGATCCGCG CCGGACCGAT CTGGTGCGTT CGCCGCATGT CGAGGCGGAC TTCCACCTGC CGCTGCGCCC GGGCACCAAT GTCGCCGTGC TGACCGCGAT GGCCCACGTT ATCGTGACCG AGGGACTTGC CGACGAGGCC TTCATCCGCG AGCGTTGCGA CTGGGACGAA TACCAGCATT GGGCCGAGTT CGTCTCGGCG GCACGCAACA GCCCCGAGTT CCTTTCGCCG GTGATCGGTG TAGGCGCCGA CGCCATTCGT GGCGCGGCGC GGCTTTATGC GACCGGCGGC AACGGTGCGA TCTACTATGG GCTGGGCGTG ACCGAGCACA GTCAGGGTTC CTCGACCGTC ATGGCCATCG CCAACCTGGC CATGGCGACC GGCAACATCG GTCGGCGCGG CGTGGGCGTG AACCCGTTGC GCGGTCAGAA CAACGTGCAG GGCGCCTGTG ACATGGGGTC GTTCCCGCAC GAGCTTTCGG GCTATCGCCA TGTCTCGGAC GCCGAGACGC GCGCCCTGTT CGAGGCAGAA TGGGGCGTGC CGATCGACCC CGAACCGGGC CTGCGCATCC CCAACATGCT CGATGCGGCG ACCGACGGGG TCTTCAAGGC GATCTACATC CAGGGCGAGG ACATTCTCCA GTCTGACCCC GACACCCGCC ACGTCGCGGC CGGCCTTGCC GCGATGGAAT GCGTGATCGT CCACGACTTG TTCCTGAACG AGACCGCCAA CTACGCGCAC GTCTTCCTGC CGGGCTCCAC CTTCCTCGAA AAGAACGGCA CGTTCACGAA CGCGGAGCGC CGTATCCAGC CGGTGCGCAA GGTGATGGAG CCGTTGAACG GGTATGAAGA CTGGCAGGTG ACGCAGGAGC TGGCCCGCGC GATTGGGCTT GACTGGAACT ATACCCACCC GTCCGAGATC ATGGACGAGA TCGCACGGCT GACGCCCACT TTCGCGGGCG TGAACTACAA GCGGCTCGAT GCCGAAGGCT CCCTGCAATG GCCCGTCAAT GACAAGGCGC CCGATGGCAG TCCGATCATG CACATCGACG GCTTCGTGCG CGGGCGCGGC AAGTTCGTGG TCACCGACTA CGTGCCGACC GATGAACGCA CCGGCCCGCG CTTCCCGTTG TTGCTGACCA CGGGGCGCAT CCTGTCGCAG TACAACGTCG GAGCGCAGAC CCGGCGCACC GCGAACACGG TGTGGCATCC CGAGGACGTG CTGGAAATGC ACCCGACCGA TGCCGAGAAC CGTGGCGTGA AAACCGGTGA CTGGGTGCGC CTTGCCAGCC GCTCGGGCGA GACGACCTTG CGTGCGCTGG TGACCGACCG CGTGGCACCC GGTGTGGTCT ATACGACGTT CCACCACCCT GCGACGCAGG CCAATGTCGT GACGACCGAC TATTCCGACT GGGCCACCAA TTGCCCGGAA TACAAGGTGA CCGCAGTGCA GGTCACGCCC AGCAACGGGC CGAGCGACTG GCAGGAGGAC TACGAGGCGC AAGCAGCCCG TTCGCGCCGC ATCGCGGGCC ACGAGGGCGC GATGGAGCCC GCCGAATGA
|
Protein sequence | MGYERQADFG TPEVRSETEV TCFIDGREVT VPAGTTVMRA AALTGGSIPK LCATDNLKQF GSCRLCLVEI DGMRGTPASC TTPVASGMKV HTQTPRLEKL RRGVMELYIS DHPLDCLTCS ANNDCELQDQ AAAVGLRDVR FGYEGENHLG AATDTSNPYF DFDPSKCIVC SRCVRACDEV QGTFALTIDG RGFASKVSAG LGSDDYLSSE CVSCGACVQA CPTATLQEKA VKEIGKPERA VVTTCAYCGV GCTFRAEMRG EQLVRMVPWK DGKANRGHSC VKGRFAWGYA QHQERVLSPM IRESIDQPWR EVSWEEALSH TAREINRIRE TYGRNALGGI TSSRCTNEET FLVQKLIRAG FGTNNVDTCA RVCHSPTGYG LKTTFGTSAG TQDFDSVEDT DVILVIGANP TDGHPVFASR MKRRLRQGAK LIVIDPRRTD LVRSPHVEAD FHLPLRPGTN VAVLTAMAHV IVTEGLADEA FIRERCDWDE YQHWAEFVSA ARNSPEFLSP VIGVGADAIR GAARLYATGG NGAIYYGLGV TEHSQGSSTV MAIANLAMAT GNIGRRGVGV NPLRGQNNVQ GACDMGSFPH ELSGYRHVSD AETRALFEAE WGVPIDPEPG LRIPNMLDAA TDGVFKAIYI QGEDILQSDP DTRHVAAGLA AMECVIVHDL FLNETANYAH VFLPGSTFLE KNGTFTNAER RIQPVRKVME PLNGYEDWQV TQELARAIGL DWNYTHPSEI MDEIARLTPT FAGVNYKRLD AEGSLQWPVN DKAPDGSPIM HIDGFVRGRG KFVVTDYVPT DERTGPRFPL LLTTGRILSQ YNVGAQTRRT ANTVWHPEDV LEMHPTDAEN RGVKTGDWVR LASRSGETTL RALVTDRVAP GVVYTTFHHP ATQANVVTTD YSDWATNCPE YKVTAVQVTP SNGPSDWQED YEAQAARSRR IAGHEGAMEP AE
|
| |