Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4072 |
Symbol | |
ID | 7974394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4313774 |
End bp | 4316647 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644794658 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002945951 |
Protein GI | 239817041 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCC CGTCCAAGCT CGCGGAGCTG ATGCCGCAAA CCATCGAATT CACCCTCGAC GGCCAAGCCA TCCAGGCCTT CGACGGCGAA ACCATCTACA AGGCGGCCGA GCGCCACGGC GTCGAGATTC CCCACCTGTG CTTCAAGGAC GGCTACCGCG CCGACGGCAA CTGCCGCGCC TGCGTGGTCG AGGTGAAGGG CGAGCGCACG CTCGCGCCCA GCTGCTGCCG CAACGTGACG GCCGGCATGG AAGTGAAGGC CACCAGCGAG CGCGCGCTCA AGAGCCAGAA GATGGTGGTC GAGATGCTGC TGTCCGACAT GCCCGACCAG GGCTACAAAT GGATCGGCGA CGACGCCACC CAGCAGCACG GCGAGCTCAG CGCCTGGGCG AAGAAGCTCG ACATCGCGGT GCGGCCCGAA CTCAAGGCGC TGCGCCGCGA GCAGCCCAAG GCCGACATCT CGCACCCCGC GATGGCCGTC AACCTCGATG CCTGCATCCA GTGCAACCGC TGCGTGCGCG CCTGCCGCGA AGAGCAGGTC AACGACGTCA TCGGCTACGC GCTGCGCGGC GGCGACAGCA AGATCGTGTT CGACCTGGAC GACCCGATGG GCGACAGCAC CTGCGTGGCC TGCGGCGAAT GCGTGCAGGC CTGCCCGACC GGCGCGCTCA TGCCCAAGAG CCACATCGGC TCGCAAGCGG TCGACCGCAA GGTCGATTCG GTGTGCCCGT TCTGCGGCGT GGGCTGCCTC GTCACCTACA ACGTCAAGGA CGAGAAGATC GTCAGCGTCG ACGGCCGCGA CGGCCCGGCC AACCACAACC GCCTGTGCGT GAAGGGCCGC TTCGGCTTCG ACTACGCGCA CCATCCGCAG CGCCTGACCA AGCCGCTGAT CCGAAAGGCC GGCATGCCGA AGGATTTCGG CGATGCACCG CGGCCGGACG ACTGGAGCGA GTATTTCCGC GAAGCCACCT GGGAAGAGGC GCTCGCGCTC ACCACCGGCA AGCTCTCGGG CCTGCGCGAC AGCCACGGGC CCAAGTCGCT CGCGGGCTTC GGCTCGGCCA AGGGCAGCAA CGAAGAGGCC TACCTGTTCC AGAAGCTCGT GCGCACGGGC TTCGGCAGCA ACAACATCGA CCACTGCACG CGGCTGTGCC ACGCCTCCAG CGTGGCCGCG CTGCTCGAAG GTGTGGGTTC GGGCGCGGTG AGCAACCAGG TCAACGACGT GGAGCATGCG GGGCTGATCT TCGTCATCGG CTCCAACCCC ACGGCCAACC ACCCGGTGGC CGCCACCTGG ATGAAAAACG CCGCCCAGCG CGGTGCCAAG ATCGTGCTGG CCGACCCGCG CCGCACCGAC ATCAGCCGCC ATGCCTGGCG CACGCTGCAG TTCAAGGCCG ACACCGACGT GGCCATGCTG AACGCGCTGA TCCATGCCGT GATCGACGAA GGCCTGGTCG ATCAGGAGTT CGTGCGCACG CGCGCCAGCA ACTACGAGGC GCTGCGCGAG AACGTCAAGG GCTACAGCCC CGAGGCGATG GCGCCGATCT GCGGCGTGCC GGCCGAAACG CTGCGCGAAG TGGCGCGCGC CTTTGCCACG GCCAAGGGCG CGATGATCCT CTGGGGCATG GGCGTGAGCC AGCACGTGCA CGGCACCGAC AACGCGCGCT GCCTCATTGC GCTGGCCACC GTCACCGGCC AGATCGGCAA GCCGGGCTCG GGCCTGCATC CGCTGCGCGG CCAGAACAAT GTGCAGGGCG CGAGCGACGC CGGCCTGATC CCGATGATGT TCCCCAACTA CCAGCGCGTC GACAATCCGG CGGTGCATGC GTGGTTCGAG GACTTCTGGG GCACGCCGCT CGATGCGACG CCGGGCTACA CCGTGGTCGA GATCATGCAC AAGGCGCTGG CGCCCGACAC CGATCCGCAC AAGGTGCGCG GCATGTACAT CATGGGCGAG AACCCGGCCA TGAGCGACCC CGACCTGAAC CATGCACGCC ATGCGCTCGC GAGCCTGGAG CACCTGGTGG TGCAGGACAT CTTCATGACC GAGACCGCGT GGCTCGCCGA CGTGGTGCTG CCCGCGAGCG CCTGGCCCGA GAAGACCGGC ACGGTCAGCA ACACCGACCG CATGGTGCAG CTGGGCAAGC GCGCGCTCAA CCCGCCGGGC GACGCGCGGC CCGATCTCTG GATCATCCAG CAGATCGCCA GCGGCATGGG CCTCGGGTGG AACTACGAGG GCGAGGAGTC CGGCGTGGCC GCGGTCTACG AGGAAATGCG CCAGGCCATG CATGCGGTGA TCAGCGGCAT CAGCTGGGAG CGGCTGCAGC GCGATTCGAG CGTGACCTAC CCTTGCCTCA GCGAGGAAGA TCCGGGCCAG CCCACGGTGT TCATCGACGA CTTCCCCACG GCCGACGGCC GCGTGAGGCT GGTGCCGGCC GACATCATTC CGGCGGACGA GCGGCCCGAT GCCGAGTACC CCTTCGTGCT CATCACCGGC CGCCAGCTCG AGCACTGGCA CACCGGCAGC ATGACGCGCC GCGCCACGGT GCTCGATGCG CTCGAGCCCA TGGCCACGGC CTCGATGAAC CAGGCCGACC TGCTGAAGCT CGGCCTCGAG GCCGGCGACG TGATCACCGT GCAGTCGCGC CGCGGCGAGG TGGCCATCCA CGTGCGGCGC GACGACGGCA CGCCCAGCGG CGCGGTGTTC GTTCCCTTCG CCTACTACGA GGCGGCGGCC AACCTGATGA CCAATGCCGC GCTCGATCCC ATGGGCAAGA TCCCGGAGTT CAAGTACTGC GCGGTGCGCA TTGCGCGCGG CGGCCAGCCG ATGGCGGCGG CCGGCTACGG CACCGGCTCC GGCGTGCTCG CGGCGGTGGA CTGA
|
Protein sequence | MNAPSKLAEL MPQTIEFTLD GQAIQAFDGE TIYKAAERHG VEIPHLCFKD GYRADGNCRA CVVEVKGERT LAPSCCRNVT AGMEVKATSE RALKSQKMVV EMLLSDMPDQ GYKWIGDDAT QQHGELSAWA KKLDIAVRPE LKALRREQPK ADISHPAMAV NLDACIQCNR CVRACREEQV NDVIGYALRG GDSKIVFDLD DPMGDSTCVA CGECVQACPT GALMPKSHIG SQAVDRKVDS VCPFCGVGCL VTYNVKDEKI VSVDGRDGPA NHNRLCVKGR FGFDYAHHPQ RLTKPLIRKA GMPKDFGDAP RPDDWSEYFR EATWEEALAL TTGKLSGLRD SHGPKSLAGF GSAKGSNEEA YLFQKLVRTG FGSNNIDHCT RLCHASSVAA LLEGVGSGAV SNQVNDVEHA GLIFVIGSNP TANHPVAATW MKNAAQRGAK IVLADPRRTD ISRHAWRTLQ FKADTDVAML NALIHAVIDE GLVDQEFVRT RASNYEALRE NVKGYSPEAM APICGVPAET LREVARAFAT AKGAMILWGM GVSQHVHGTD NARCLIALAT VTGQIGKPGS GLHPLRGQNN VQGASDAGLI PMMFPNYQRV DNPAVHAWFE DFWGTPLDAT PGYTVVEIMH KALAPDTDPH KVRGMYIMGE NPAMSDPDLN HARHALASLE HLVVQDIFMT ETAWLADVVL PASAWPEKTG TVSNTDRMVQ LGKRALNPPG DARPDLWIIQ QIASGMGLGW NYEGEESGVA AVYEEMRQAM HAVISGISWE RLQRDSSVTY PCLSEEDPGQ PTVFIDDFPT ADGRVRLVPA DIIPADERPD AEYPFVLITG RQLEHWHTGS MTRRATVLDA LEPMATASMN QADLLKLGLE AGDVITVQSR RGEVAIHVRR DDGTPSGAVF VPFAYYEAAA NLMTNAALDP MGKIPEFKYC AVRIARGGQP MAAAGYGTGS GVLAAVD
|
| |