Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1622 |
Symbol | |
ID | 3847458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1827042 |
End bp | 1829996 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637841292 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_442160 |
Protein GI | 83718922 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.437101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCTT CCGCATTCGA TTCGTCCCGG CAAGGCTGCG GCTCCGGCCA GTGCGCGTGC AAGAGCGCCG CGCACGCGCG TCGCGCCGAT CCGTTCGACG ACACCGACTA CGGCACCCCG CAGCGCCACG CCGACGCCGA CGTCACGCTC GACATCGACG GCCGCGAGGT CACGGTGCCG GCCGGCACGT CGGTGATGCG CGCGGCGATC GAGGCCGGCA TCAACGTGCC GAAGCTCTGC GCGACCGATT CGCTCGAGCC GTTCGGCTCA TGCCGCCTGT GCCTCGTCGA GATCGAGGGC CGGCGCGGCT ACCCGGCGTC GTGCACGACG CCCGTCGAGG CCGGCATGAA GGTGCGCACG CAAAGCGACC GGCTGCAGGA TCTGCGCCGC AACGTGATGG AGCTCTACAT CTCCGATCAT CCGCTCGACT GCCTCACATG CGCGGCGAAC GGCGATTGCG AGCTTCAGGA CATGGCAGGC GCGGTCGGGC TGCGCGAGGT GCGCTACGGC TTCGACGGCA AGAACCATCT GAGCGACGCG AAGGACGAAT CGAACCCTTA CTTCAGCTAC GACCCGGCGA AGTGCATCGT CTGCAATCGC TGCGTGCGCG CGTGCGAGGA GACGCAGGGC ACGTTCGCGC TGACGATCGC CGCGCGCGGC TTCGACTCGC GCGTCGCGGC GAGCGCGGGC GACGCGTTCA TGGATTCGGA GTGCGTGTCG TGCGGCGCAT GCGTCGCCGC GTGCCCGACC GCGACGCTCA TCGAAAAGAG CGTCGCGCGG CTCGGCCAGC CCGAGCACGA GGTCGTCACG ACCTGCGCGT ACTGCGGCGT CGGCTGTTCG CTGAAGGCAG AGATGAAAGG CGAGCAGGTC GTGCGGATGA CGCCGCACAA GAACGGCCAG GCGAACGAAG GCCATGCGTG CGTGAAAGGC CGCTTTGCGT GGGGCTATGC AACGCACAAG GATCGCATCA CGAAGCCGAT GATCCGCGAG AAGATCACCG ATCCGTGGCG CGAAGTGAGC TGGGACGAAG CGATCGGCCA TGCGGCCGCG CAGTTCCGCC GCATCCAGGA CAAGCACGGC CGCGATTCGA TCGGCGGCAT CACGTCGTCG CGCTGCACGA ACGAAGAGAC GTACCTCGTG CAGAAGCTCG TGCGCGCGGC GTTCGGCAAC AACAACGTCG ATACCTGCGC GCGCGTCTGC CATTCGCCGA CGGGCTACGG CCTGAAGGTC ACGCTCGGCG AATCGGCGGG CACGCAGACG TTCGCGTCGG TCGGCTCGGC CGACGTGATC GTCGTGATCG GCGCGAATCC GACCGACGGC CACCCGGTGT TCGGCTCGCG CCTGAAGCGC CGCGTGCGCG AGGGCGCGAA ACTGATCGTC GCCGATCCGC GCCGGATCGA TCTCGTCGAC GGCCCGCACG TGAAGGCCGT CCATCATCTG CAACTGCGCC CCGGCACGAA CGTCGCGCTC GTCAACGCGC TCGCGCACGT GATCGTCACC GAGGGCCTCG TCGACGAGGC GTTCGTCGCC GAACGCTGCG AGCCGCACGC GTTCGACGTG TGGCGCGCGT TCGCCGCGCG GCCCGAGAAC TCGCCCGAGG CGACGGCGGA CATCACCGGC GTGCCGGCCG ACGCGGTGCG CGCCGCCGCG CGCCTGTACG CGACGGGCGG GCGCGCGGCG ATCTTCTACG GGCTCGGCGT CACCGAGCAC GCGCAGGGCT CGACGATGGT GATGGGCATC GCGAACCTCG CGATGGCGAC GGGCAACCTC GGCATCGAGG GCGCGGGCGT GAACCCGCTG CGCGGGCAGA ACAACGTGCA GGGCTCGTGC GACATGGGCT CGTTCCCGCA CGAACTGCCC GGCTACCGGC ATATCGGCGA TGCGGCCGTG CGCGCGCGCT TCGACGAGGC ATGGGCGACG ACGCTGCAGC CGGAGCCCGG CCTGCGCATC CCGAACATGT TCGACGCGGC GCTCGACGGC AGCTTCAAGG GCCTCTACTG CCAGGGCGAG GACATCGTCC AGTCGGACCC GAACACGCAG CACGTCGCGG CGGCGCTATC GTCGCTCGAA TGCCTCGTCG TGCAGGACAT CTTCCTGAAC GAAACCGCGA AGTACGCGCA CGTGTTCCTG CCCGGCGCGA CCTTCCTCGA GAAGGACGGC ACGTTCACGA ACGCCGAGCG CCGGATCTCG CGCGTGCGCC GCGCGATGAG GCCGCTTTCC GGCTACGCGG ACTGGGAGGT GACGCTGATG CTGTCGCGCG CGCTCGGCTA CGACATGCAT TACGCGCATC CGTCCGAGAT CATGGACGAG ATCGCGCGTC TCACGCCGAC GTTCGCGGGC GTGTCGTACG CGCTGCTCGA CGCGCTGGGC AGCGTCCAGT GGCCGTGCAA CGATGCGGCG CCGGAAGGCA CGCCGACGAT GCACGTCGAT CACTTCGTGC GCGGCAAGGG CAAGTTCATG ATCACGCAGT ACATCGCGTC GCCGGAGAAG GTCACGCCGC GCTATCCGCT CATCCTGACG ACGGGCCGAA TCCTGTCGCA GTACAACGTC GGCGCGCAGA CGCGCCGCAC GGAAAACGTG CGCTGGCACG ACGAGGACCG GCTCGAGATC CATCCGCACG ACGCGAGCGA TCGCGGCATC AAGACGGGCG ACTGGGTCGG TGTCGAATCA CGCGCCGGGC AGACGGTGCT GCGCGCGCTC GTCACCGAGC GGATGCAGCC GGGCGTCGTC TACACGACGT TCCACTTCCC GGAATCCGGC GCGAACGTGA TCACGACCGA CAGCTCGGAC TGGGCGACCA ACTGCCCCGA GTACAAGGTG ACGGCCGTGC AGGTCGCGCC CGTCGCGCAA CCGTCCGAGT GGCAGCGCGC GTACACGCGC TTTCGCGCGG AGCAGCTCGC GCTGCTCGAG CAGCGCACCG CCGCGAACGC GCCGGCAACC GCAACGGGCA AGTGA
|
Protein sequence | MTSSAFDSSR QGCGSGQCAC KSAAHARRAD PFDDTDYGTP QRHADADVTL DIDGREVTVP AGTSVMRAAI EAGINVPKLC ATDSLEPFGS CRLCLVEIEG RRGYPASCTT PVEAGMKVRT QSDRLQDLRR NVMELYISDH PLDCLTCAAN GDCELQDMAG AVGLREVRYG FDGKNHLSDA KDESNPYFSY DPAKCIVCNR CVRACEETQG TFALTIAARG FDSRVAASAG DAFMDSECVS CGACVAACPT ATLIEKSVAR LGQPEHEVVT TCAYCGVGCS LKAEMKGEQV VRMTPHKNGQ ANEGHACVKG RFAWGYATHK DRITKPMIRE KITDPWREVS WDEAIGHAAA QFRRIQDKHG RDSIGGITSS RCTNEETYLV QKLVRAAFGN NNVDTCARVC HSPTGYGLKV TLGESAGTQT FASVGSADVI VVIGANPTDG HPVFGSRLKR RVREGAKLIV ADPRRIDLVD GPHVKAVHHL QLRPGTNVAL VNALAHVIVT EGLVDEAFVA ERCEPHAFDV WRAFAARPEN SPEATADITG VPADAVRAAA RLYATGGRAA IFYGLGVTEH AQGSTMVMGI ANLAMATGNL GIEGAGVNPL RGQNNVQGSC DMGSFPHELP GYRHIGDAAV RARFDEAWAT TLQPEPGLRI PNMFDAALDG SFKGLYCQGE DIVQSDPNTQ HVAAALSSLE CLVVQDIFLN ETAKYAHVFL PGATFLEKDG TFTNAERRIS RVRRAMRPLS GYADWEVTLM LSRALGYDMH YAHPSEIMDE IARLTPTFAG VSYALLDALG SVQWPCNDAA PEGTPTMHVD HFVRGKGKFM ITQYIASPEK VTPRYPLILT TGRILSQYNV GAQTRRTENV RWHDEDRLEI HPHDASDRGI KTGDWVGVES RAGQTVLRAL VTERMQPGVV YTTFHFPESG ANVITTDSSD WATNCPEYKV TAVQVAPVAQ PSEWQRAYTR FRAEQLALLE QRTAANAPAT ATGK
|
| |