Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4144 |
Symbol | |
ID | 3749336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 1075985 |
End bp | 1078936 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637762428 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_368385 |
Protein GI | 78065616 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.110494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTCG ACACGAACAA CGTCCGCCAA GGCGGCTGCG GCTCGGGCCA ATGCGCGTGC AAGAGCGCCG CGCAGGCGCG TGCCGGCAAC CCGTTCGACG ATACCGATTA CGGCACGCCC CAGCGGCATG CCGATACCGA TGTCACGCTC GAAATCGACG GCCAACCGGT CACGGTGCCG GCCGGCACGT CGGTGATGCG CGCGGCGATC GAAGCCGGCG TGAACGTCCC GAAGCTCTGC GCGACCGATT CGCTCGAACC GTTCGGCTCG TGCCGGCTGT GCCTCGTCGA GATCGAAGGC CGGCGCGGTT ATCCGGCGTC GTGCACAACA CCTGCCGAAG CGGGCATGAA GGTGCGCACG CAGTCGGACC GGCTGCAGTC GCTGCGTCGC AACGTGATGG AGCTGTACAT CTCCGACCAT CCGCTCGACT GCCTCACCTG CCCGGCCAAC GGCGACTGCG AGCTGCAGGA CATGGCGGGC GTCGTCGGGC TGCGCGAGGT GCGGTACGGC TTCGACGGCG CAAATCATCT GCGCGACAAA AAGGACGAGT CGAATCCGTA CTTCACGTAC GACCCGTCGA AGTGCATCGT CTGCAACCGC TGCGTGCGCG CCTGCGAGGA AACGCAGGGC ACGTTCGCGC TGACGATCGC CGCACGCGGC TTCGAATCGC GCGTCGCCGC AGGCGAAAGC GAATCGTTCA TGGCATCGGA ATGCGTGTCG TGCGGCGCAT GCGTTGCCGC ATGTCCGACG GCCACGCTGC AGGAAAAATC CGTCGTGCAA CTCGGGCAGG CCGAACACTC GGTCGTCACG ACCTGCGCGT ATTGCGGCGT GGGCTGCTCG TTCAAGGCGG AGATGAAGGG CACGCAGGTC GTGCGCATGA CGCCGCACAA GAACGGCCTC GCGAACGAGG GCCACGCGTG CGTGAAGGGC CGCTTCGCGT GGGGCTATGC GACGCACAAA GACCGCATCA CGAAGCCGAT GATCCGCGAG AAGATCACCG ACCCGTGGCG CGAAGTCAGC TGGGACGAAG CGCTCACCTA CGCGGCCACG CAATTCCGCA AGCTGCAGGA CAAGTACGGC CGCGATTCCA TCGGCGGCAT CACGTCGTCG CGCTGCACGA ACGAGGAAAC CTACCTCGTG CAGAAGCTGG TGCGCGCCGC GTTCGGCAAC AACAACGTCG ACACCTGCGC ACGCGTGTGC CACTCGCCGA CGGGCTATGG CCTCAAGACG ACGCTCGGCG AATCGGCCGG CACGCAGACA TTCGCGTCGG TCGGCCAGGC CGACGTGATC GTCGTGATGG GTGCGAACCC GACCGACGGC CACCCGGTGT TCGGCTCACG GCTGAAACGG CGCGTACGTG AAGGCGCAAA ACTGATCGTG ATCGACCCGC GCCGCATCGA CGTCGTCGAC GGCCCGCACG TGAAAGCCAC CCACCACCTC CAGTTGCGCC CCGGCACCAA CGTCGCGATC GTCAACGCGC TCGCGCACGT GATCGTCACC GAAGGGCTCG TCGCCGACGC ATTCGTCGCC GAGCGCTGCG AGACGCGCGC ATTCGAGCAA TGGCGCGACT TCGTGTCGCG TGCCGACAAC TCGCCCGAGG CGACCGCCGG CGTGACGGGC GTGCCGGCCG AGTCGGTACG CGAAGCCGCG CGCCTCTACG CGACGGGCGG CAATGCCGCG ATCTACTACG GGCTGGGCGT GACCGAACAC GCGCAGGGCT CGACGACGGT GATGGGCATC GCGAACCTCG CGATGGCGAC CGGCAACATT GGCCGCGAAG GCGTCGGCGT CAATCCGCTG CGCGGCCAGA ACAACGTGCA GGGCTCGTGC GACATGGGCT CGTTCCCGCA CGAACTGCCC GGCTACCGCC ACATCGGCGA CGAGGTCGTG CGCGCGCAGT TCGAAGCTGC ATGGTCGGCG AAACTGCAAC CGGAACCGGG GCTGCGCATC CCGAACATGT TCGATGCGGC GCTCGACGGC AGCTTCAAGG GGCTCTACTG CCAGGGCGAG GACATCGTCC AGTCGGACCC GAATACGCAG CACGTCGCGG CCGCGCTGTC GGAAATGGAA TGCATCGTCG TGCAGGACAT CTTCCTGAAC GAGACCGCGA AATACGCGCA CGTGCTGCTG CCCGGCTCGT CGTTCCTCGA GAAGGACGGC ACGTTCACGA ACGCGGAACG CCGCATCTCA CGCGTGCGCA AGGTGATGCC GCCGCTCGCG GGCTACGCGG ACTGGGAAGT CACGCTGATG CTGTCGCGTG CGCTCGGCTA CGAGATGGAC TATGCGCATC CGTCGGAAAT CATGGACGAG ATCGCGCGAC TCACGCCGAC CTTCTCGGGT GTGTCGTACG CGAAGCTCGA CACGCTCGGC AGCATTCAGT GGCCGTGCAA CGAGCACGCG CCGGAAGGCA CGCCGACGAT GCATATCGAC GCATTCGTGC GCGGCAAGGG CAAGTTCGTG ATCACGCAGT TCATCGCGTC GCCGGAGAAG GTCACGCAAC GCTATCCGCT GATCCTGACG ACGGGTCGCA TCCTGTCTCA GTACAACGTC GGCGCGCAGA CCCGCCGCAC CGAGAACGTG CAGTGGCACG AAGAGGATCG CCTCGAGATT CATCCGCACG ACGCGCAGGA TCGCGGTATC CGCAGCGGCG ACTGGGTGGG CATCGAATCG CGTGCGGGGC AGACGGTGTT GCGCGCGCTC GTGACCGAAC GCATGCAGCC GGGCGTCGTC TATACGACGT TCCACTTCCC CGAATCGGGC GCGAACGTGA TCACGACGGA CAGCTCGGAC TGGGCGACGA ATTGCCCCGA GTACAAGGTG ACGGCCGTGC AGGTGCTGCC CGTCGCGCAG CCGTCCGACT GGCAGCAAGC GTATGCGCGC TTCAACGCGG AGCAGCTCGA CCTGCTCGAG CGCCGCGCCG CCGCGACCGC CACCGTGACG ACAGGCAAGT GA
|
Protein sequence | MSLDTNNVRQ GGCGSGQCAC KSAAQARAGN PFDDTDYGTP QRHADTDVTL EIDGQPVTVP AGTSVMRAAI EAGVNVPKLC ATDSLEPFGS CRLCLVEIEG RRGYPASCTT PAEAGMKVRT QSDRLQSLRR NVMELYISDH PLDCLTCPAN GDCELQDMAG VVGLREVRYG FDGANHLRDK KDESNPYFTY DPSKCIVCNR CVRACEETQG TFALTIAARG FESRVAAGES ESFMASECVS CGACVAACPT ATLQEKSVVQ LGQAEHSVVT TCAYCGVGCS FKAEMKGTQV VRMTPHKNGL ANEGHACVKG RFAWGYATHK DRITKPMIRE KITDPWREVS WDEALTYAAT QFRKLQDKYG RDSIGGITSS RCTNEETYLV QKLVRAAFGN NNVDTCARVC HSPTGYGLKT TLGESAGTQT FASVGQADVI VVMGANPTDG HPVFGSRLKR RVREGAKLIV IDPRRIDVVD GPHVKATHHL QLRPGTNVAI VNALAHVIVT EGLVADAFVA ERCETRAFEQ WRDFVSRADN SPEATAGVTG VPAESVREAA RLYATGGNAA IYYGLGVTEH AQGSTTVMGI ANLAMATGNI GREGVGVNPL RGQNNVQGSC DMGSFPHELP GYRHIGDEVV RAQFEAAWSA KLQPEPGLRI PNMFDAALDG SFKGLYCQGE DIVQSDPNTQ HVAAALSEME CIVVQDIFLN ETAKYAHVLL PGSSFLEKDG TFTNAERRIS RVRKVMPPLA GYADWEVTLM LSRALGYEMD YAHPSEIMDE IARLTPTFSG VSYAKLDTLG SIQWPCNEHA PEGTPTMHID AFVRGKGKFV ITQFIASPEK VTQRYPLILT TGRILSQYNV GAQTRRTENV QWHEEDRLEI HPHDAQDRGI RSGDWVGIES RAGQTVLRAL VTERMQPGVV YTTFHFPESG ANVITTDSSD WATNCPEYKV TAVQVLPVAQ PSDWQQAYAR FNAEQLDLLE RRAAATATVT TGK
|
| |