Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_0907 |
Symbol | |
ID | 6176044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010551 |
Strand | - |
Start bp | 997647 |
End bp | 1000598 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641680657 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001807614 |
Protein GI | 172059962 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.979482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00925571 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCCCTCG ACACGAACAA CGTCCGCCAA GGCGGCTGCG GCTCAGGCCA ATGCGCGTGC AAGAGCGCCG CCCAGGCGCG CGCCTTCGAT CCGTTCGACG ACACCGACTA CGGCACGCCG CAACGGCATG CCGACACCGA CGTCACGCTC GAAATCGACG GCCAGCCGGT GACGGTGCCG GCCGGCACGT CGGTGATGCG CGCCGCGATC GAAGCCGGCG TCAACGTGCC CAAGCTCTGC GCGACCGATT CGCTCGAACC GTTCGGCTCG TGCCGTCTGT GCCTCGTCGA AATCGAAGGC CGGCGCGGCT ATCCGGCATC GTGCACGACG CCCGCCGAAG CCGGGATGAA GGTGCGCACG CAGTCGGACC GGCTGCAGTC GCTGCGCCGC AACGTGATGG AGCTGTACAT CTCCGACCAC CCGCTCGACT GCCTCACCTG CCCCGCCAAC GGCGACTGCG AGCTGCAGGA CATGGCGGGC GTCGTCGGCT TGCGCGAAGT GCGCTACGGC TTCGACGGCG CGAATCATCT GCGCGAGCAG AAGGACGAAT CGAATCCGTA CTTCGCGTAC GATCCGTCGA AGTGCATCGT CTGCAACCGC TGCGTGCGCG CGTGCGAGGA AACCCAAGGC ACGTTCGCGC TGACGATCGC CGCGCGCGGC TTCGAATCGC GCGTCGCCGC CGGCGAAAGT GAATCGTTCA TGGCGTCGGA ATGCGTATCG TGCGGCGCGT GCGTTGCCGC CTGCCCGACC GCCACGCTGC AGGAAAAATC CGTCGTCCGG CTCGGGCAGG CCGAGCACTC GGTCGTCACG ACCTGCGCGT ACTGCGGCGT CGGCTGCTCG TTCAAGGCCG AGATGAAGGG CACGCAGGTC GTGCGCATGA CGCCGCACAA GAACGGTCTC GCGAACGAAG GCCACGCGTG CGTGAAGGGG CGCTTCGCGT GGGGCTACGC GACCCACAAG GACCGCATCA CGAAGCCGAT GATCCGCGAG AAGATCACCG ACCCGTGGCG CGAAGTCAGC TGGGACGAGG CGCTCACCTA CGCGGCGACG CAATTCCGCA AGCTGCAGCA GAAGTACGGC CGCGATTCGA TCGGCGGCAT CACGTCGTCG CGCTGCACGA ACGAGGAAAC CTACCTCGTG CAGAAGCTCG TGCGCGCCGC ATTCGGCAAC AACAACGTCG ACACCTGTGC GCGCGTGTGC CATTCGCCGA CCGGCTATGG CCTGAAGGCG ACGCTCGGCG AATCGGCCGG CACGCAGACC TTCGCATCGG TCGACCACGC CGATGTGATC GTCGTGATGG GTGCGAACCC GACCGACGGC CACCCGGTGT TCGGCTCGCG CCTGAAGCGG CGCATCCGCG AAGGCGCGAA GCTGATCGTG ATCGATCCGC GCCGCATCGA CGTGGTCGAC GGCCCGCACG TGAAGGCCGC GCACCACCTG CAGCTGCGCC CCGGCACCAA CGTCGCGATC GTCAATGCGC TCGCGCACGT GATCGTCACC GAAGGGCTCG TCGCCGACGC GTTCGTCGCC GAGCGCTGCG AAGCGCGCGC ATTCGAGCAA TGGCGCGACT TCGTGTCGCG TGCCGAGAAC TCGCCCGAAG CGACCGCCGA TGCGACGGGC GTGCCGGCCG AAGTGGTGCG CGAAGCCGCG CGGCTCTATG CGACCGGCGG CAACGCCGCG ATCTACTACG GCCTCGGCGT GACCGAACAC GCGCAAGGCT CGACGACGGT GATGGGCATC GCGAACCTCG CGATGGCGAC CGGCAACATC GGCCGCGAAG GCGTCGGCGT CAATCCGCTG CGCGGCCAGA ACAACGTGCA GGGCTCGTGC GACATGGGCT CGTTCCCGCA TGAACTGCCA GGCTACCGCC ACATCGGCGA CGACAGCGTG CGCGCGCTGT TCGAACAGGC ATGGTCGGCC ACGCTGCAAC CGGAACCGGG GCTGCGCATC CCGAACATGT TCGACGCGGC GCTCGACGGC AGCTTCAAGG GGCTCTACTG CCAGGGCGAG GACATCGTCC AGTCGGACCC GAACACGCAG CACGTCGCCG CCGCGCTGTC GTCGATGGAA TGCGTCGTCG TGCAGGACAT CTTCCTCAAC GAAACCGCGA AATACGCGCA CGTGCTGCTG CCGGGCTCGA CGTTCCTCGA GAAGGATGGC ACGTTCACGA ACGCGGAACG CCGCATCTCG CGCGTGCGCA AGGTGATGCC GCCGCTCGCG GGCTACGCGG ACTGGGAAGT GACGCTGCTG CTGTCGCAGG CGCTCGGCTA TGACATGCAT TACACGCATC CGTCGGAAAT CATGGACGAG ATCGCACGGC TCACGCCGAC CTTCTCGGGC GTGTCGTACG CGAAGCTCGA CGCGCTCGGC AGTATCCAGT GGCCGTGCAA CGAACAGGCA CCGGACGGCA CGCCGACGAT GCACATCGAC GCGTTCGTGC GCGGCAAGGG CCGCTTTGTG ATCACGCAGT ACATTCCGAC GCCGGAAAAG GTCACGCAGC GCTATCCGCT GATCCTGACC ACGGGCCGCA TCCTGTCGCA ATACAACGTC GGTGCGCAGA CGCGCCGGAC CGAGAACGTC CAGTGGCACG AAGAGGACCG CCTCGAGATC CACCCGCACG ACGCGGAGGA TCGCGGGATC CGGACCGGCG ACTGGGTCGG CATCGAATCG CGGGCGGGCC AGACGGTGTT GCGCGCGCTC GTGTCCGATC GCATGCAGCC GGGCGTCGTG TACACGACGT TCCACTTTCC CGAATCGGGC GCGAACGTGA TCACGACGGA CAGCTCCGAC TGGGCGACGA ACTGCCCGGA ATACAAGGTC ACGGCCGTGC AGGTGATGCC GGTCGCGCAG CCGTCCGACT GGCAGCGTGC ATACGCGCGC TTCAACTCGG AGCAGCTCGA CCTGCTCGAG CGTCGCGCAG CCGCGCCGGC CACCGTGACG ACGGGCAAGT GA
|
Protein sequence | MSLDTNNVRQ GGCGSGQCAC KSAAQARAFD PFDDTDYGTP QRHADTDVTL EIDGQPVTVP AGTSVMRAAI EAGVNVPKLC ATDSLEPFGS CRLCLVEIEG RRGYPASCTT PAEAGMKVRT QSDRLQSLRR NVMELYISDH PLDCLTCPAN GDCELQDMAG VVGLREVRYG FDGANHLREQ KDESNPYFAY DPSKCIVCNR CVRACEETQG TFALTIAARG FESRVAAGES ESFMASECVS CGACVAACPT ATLQEKSVVR LGQAEHSVVT TCAYCGVGCS FKAEMKGTQV VRMTPHKNGL ANEGHACVKG RFAWGYATHK DRITKPMIRE KITDPWREVS WDEALTYAAT QFRKLQQKYG RDSIGGITSS RCTNEETYLV QKLVRAAFGN NNVDTCARVC HSPTGYGLKA TLGESAGTQT FASVDHADVI VVMGANPTDG HPVFGSRLKR RIREGAKLIV IDPRRIDVVD GPHVKAAHHL QLRPGTNVAI VNALAHVIVT EGLVADAFVA ERCEARAFEQ WRDFVSRAEN SPEATADATG VPAEVVREAA RLYATGGNAA IYYGLGVTEH AQGSTTVMGI ANLAMATGNI GREGVGVNPL RGQNNVQGSC DMGSFPHELP GYRHIGDDSV RALFEQAWSA TLQPEPGLRI PNMFDAALDG SFKGLYCQGE DIVQSDPNTQ HVAAALSSME CVVVQDIFLN ETAKYAHVLL PGSTFLEKDG TFTNAERRIS RVRKVMPPLA GYADWEVTLL LSQALGYDMH YTHPSEIMDE IARLTPTFSG VSYAKLDALG SIQWPCNEQA PDGTPTMHID AFVRGKGRFV ITQYIPTPEK VTQRYPLILT TGRILSQYNV GAQTRRTENV QWHEEDRLEI HPHDAEDRGI RTGDWVGIES RAGQTVLRAL VSDRMQPGVV YTTFHFPESG ANVITTDSSD WATNCPEYKV TAVQVMPVAQ PSDWQRAYAR FNSEQLDLLE RRAAAPATVT TGK
|
| |