Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_0990 |
Symbol | |
ID | 6122670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010508 |
Strand | - |
Start bp | 1094362 |
End bp | 1097313 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641637560 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001764289 |
Protein GI | 170732342 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.656443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTCG ACACGAACAA CGTCCGCCAA GGCGGCTGCG GCTCGGGCCA ATGCGCGTGC AAGAGCGCCG CCCAGGCGCG CGCATTCGAT CCGTTCGACG ACACCGACTA CGGTACGCCG CAACGGCATG CCGACACCGA CGTCACGCTC GAAATCGACG GCCAGCCGGT CACGGTGCCG GCCGGCACGT CGGTGATGCG CGCGGCGATC GAAGCCGGCG TCAATGTCCC GAAGCTCTGC GCGACCGATT CGCTCGAACC GTTCGGCTCG TGCCGGCTGT GCCTCGTCGA GATCGAGGGC CGACGCGGCT ATCCGGCGTC GTGCACGACG CCCGCCGAAG CCGGCATGAA GGTGCGCACG CAGTCGGACC GGCTGCAGTC GCTGCGCCGC AACGTGATGG AGCTGTACAT CTCCGACCAT CCGCTCGACT GCCTCACCTG CCCGGCCAAC GGCGACTGCG AGCTGCAGGA CATGGCCGGC GTGGTCGGCC TGCGCGAAGT GCGCTACGGC TTCGACGGCG CGAATCACCT GAAGGACACG AAGGACGAAT CGAACCCGTA CTTCACGTAC GACCCGTCGA AGTGCATCGT CTGCAACCGC TGCGTGCGCG CGTGCGAGGA AACGCAGGGC ACGTTCGCGC TGACGATCGC CGCGCGCGGC TTCGAATCGC GCGTCGCCGC CGGCGAAAGC GAATCGTTCA TGGCGTCGGA ATGCGTGTCG TGCGGCGCGT GCGTCGCCGC CTGCCCGACG GCCACGCTGC AGGAAAAATC CGTCGTGCAG CTCGGGCAGG CCGAGCACTC GGTCGTCACG ACCTGCGCAT ACTGCGGCGT CGGCTGCTCG TTCAAGGCCG AGATGAAGGG CACGCAGGTC GTGCGCATGA CGCCGCACAA GAACGGGCTC GCGAACGAAG GCCACGCGTG CGTGAAGGGA CGCTTCGCGT GGGGCTACGC GACGCACAAG GACCGCATCA CGAAGCCGAT GATCCGCGAG AAGATCACCG ACCCGTGGCG CGAAGTCAGC TGGGAAGAGG CGCTCACCTA CGCGGCGACG CAGTTCCGCA AGCTGCAGCA GAAGTACGGC CGCGATTCGA TCGGCGGCAT CACGTCGTCG CGCTGCACGA ACGAGGAAAC CTATCTCGTA CAGAAGCTCG TGCGCGCCGC GTTCGGCAAC AACAACGTCG ATACCTGCGC ACGCGTGTGC CACTCGCCGA CCGGCTATGG TCTGAAGACG ACGCTCGGCG AATCGGCCGG CACGCAGACG TTCGCCTCGA TCGGCCAGGC CGACGTGATC GTCGTGATGG GCGCGAACCC GACCGACGGC CATCCGGTGT TCGGCTCGCG GCTGAAGCGG CGCATACGCG AAGGCGCGAA GCTGATCGTG ATCGATCCGC GCCGCATCGA CGTCGTCGAC GGCCCGCACG TGAAGGCCAC GCATCATCTG CAGTTGCGCC CGGGCACCAA CGTCGCGATC GTCAATGCGC TGGCGCACGT GATCGTCACC GAAGGGCTGG TGGCCGATGC ATTCGTCGCC GAGCGCTGCG ACACGCGTGC GTTCGAGCAA TGGCGCGACT TCGTCGCGCA GGCCGACAAT TCGCCCGAGG CGACCGCCGA CGTGACGGGC GTGCCGGCCG AACTGGTGCG CGAGGCCGCG CGCCTCTACG CGACGGGCGG CAACGCGGCG ATCTATTACG GCCTGGGCGT GACCGAACAC GCACAGGGCT CGACCACGGT GATGGGCATC GCGAACCTTG CGATGGCGAC CGGCAACGTC GGCCGCGAAG GCGTCGGCGT CAATCCGCTG CGTGGCCAGA ACAACGTGCA GGGTTCGTGC GACATGGGTT CGTTCCCGCA CGAACTGCCC GGCTACCGGC ACATCAGCGA CACGGTCGTG CGCACGCAAT TCGAAGAAGC CTGGTCGGCC ACGCTCCAGC CGGAGCCCGG CCTGCGCATC CCGAACATGT TCGACGCGGC GCTCGACGGC AGCTTCAAGG GGCTCTACTG CCAGGGCGAG GACATCGTCC AGTCGGACCC GAACACGCAG CACGTCGCGG CCGCGCTGTC GGCGATGGAA TGCATCGTCG TGCAGGACAT CTTCCTGAAC GAAACCGCGA AATACGCGCA CGTGCTGCTG CCGGGCTCGA CGTTCCTCGA GAAGGACGGC ACGTTCACGA ACGCGGAGCG CCGCATCTCG CGCGTGCGCA AGGTGATGCC GCCGCTCGCG GGCTACGCGG ACTGGGAAGT GACGCTGCTG CTGTCGCAGG CGCTCGGCTA CGACATGCAC TACACGCATC CGTCGGAAAT CATGGACGAG ATCGCGCGGC TCACGCCGAC CTTCTCGGGC GTGTCGTACG CGAAGCTCGA CGCGCTCGGC AGCATCCAGT GGCCGTGCAA CGAACACGCG CCGGAAGGCA CGCCGACCAT GCACATCGAC ACGTTCGTGC GCGGCAAGGG CCGGTTCGTG ATCACCAAGT TCATTCCGAC GCCGGAGAAA GTCACGCAGC GCTATCCGCT GATCCTGACG ACGGGCCGCA TCCTGTCGCA ATACAACGTC GGCGCGCAGA CGCGCCGGAC CGAGAACGTC CGGTGGCACG AAGAGGATCG CCTCGAAATC CATCCGCACG ACGCGGAAGA TCGCGGAATC CGGACGGGCG ACTGGGTCGG CATCGAGTCG CGTGCCGGTC AGACAGTGTT GCGCGCGCTG GTGTCCGAGC GCATGCAGCC GGGCGTCGTC TACACGACGT TCCACTTCCC CGAATCGGGT GCGAACGTGA TCACGACGGA AAGCTCCGAC TGGGCGACGA ACTGCCCGGA ATACAAGGTG ACGGCCGTTC AGGTGATGCC GGTTGCGCAA CCGTCCGACT GGCAACAAGC GTATGCGCGC TTCAACTCGG AGCAGCTCGG TCTGCTCGAG CGCCGCGCCG CCGAGCCGGC CGCCGTGACG ACGGGCAAGT GA
|
Protein sequence | MSLDTNNVRQ GGCGSGQCAC KSAAQARAFD PFDDTDYGTP QRHADTDVTL EIDGQPVTVP AGTSVMRAAI EAGVNVPKLC ATDSLEPFGS CRLCLVEIEG RRGYPASCTT PAEAGMKVRT QSDRLQSLRR NVMELYISDH PLDCLTCPAN GDCELQDMAG VVGLREVRYG FDGANHLKDT KDESNPYFTY DPSKCIVCNR CVRACEETQG TFALTIAARG FESRVAAGES ESFMASECVS CGACVAACPT ATLQEKSVVQ LGQAEHSVVT TCAYCGVGCS FKAEMKGTQV VRMTPHKNGL ANEGHACVKG RFAWGYATHK DRITKPMIRE KITDPWREVS WEEALTYAAT QFRKLQQKYG RDSIGGITSS RCTNEETYLV QKLVRAAFGN NNVDTCARVC HSPTGYGLKT TLGESAGTQT FASIGQADVI VVMGANPTDG HPVFGSRLKR RIREGAKLIV IDPRRIDVVD GPHVKATHHL QLRPGTNVAI VNALAHVIVT EGLVADAFVA ERCDTRAFEQ WRDFVAQADN SPEATADVTG VPAELVREAA RLYATGGNAA IYYGLGVTEH AQGSTTVMGI ANLAMATGNV GREGVGVNPL RGQNNVQGSC DMGSFPHELP GYRHISDTVV RTQFEEAWSA TLQPEPGLRI PNMFDAALDG SFKGLYCQGE DIVQSDPNTQ HVAAALSAME CIVVQDIFLN ETAKYAHVLL PGSTFLEKDG TFTNAERRIS RVRKVMPPLA GYADWEVTLL LSQALGYDMH YTHPSEIMDE IARLTPTFSG VSYAKLDALG SIQWPCNEHA PEGTPTMHID TFVRGKGRFV ITKFIPTPEK VTQRYPLILT TGRILSQYNV GAQTRRTENV RWHEEDRLEI HPHDAEDRGI RTGDWVGIES RAGQTVLRAL VSERMQPGVV YTTFHFPESG ANVITTESSD WATNCPEYKV TAVQVMPVAQ PSDWQQAYAR FNSEQLGLLE RRAAEPAAVT TGK
|
| |