Gene Bcep18194_A4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4144 
Symbol 
ID3749336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1075985 
End bp1078936 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content66% 
IMG OID637762428 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_368385 
Protein GI78065616 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.110494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCG ACACGAACAA CGTCCGCCAA GGCGGCTGCG GCTCGGGCCA ATGCGCGTGC 
AAGAGCGCCG CGCAGGCGCG TGCCGGCAAC CCGTTCGACG ATACCGATTA CGGCACGCCC
CAGCGGCATG CCGATACCGA TGTCACGCTC GAAATCGACG GCCAACCGGT CACGGTGCCG
GCCGGCACGT CGGTGATGCG CGCGGCGATC GAAGCCGGCG TGAACGTCCC GAAGCTCTGC
GCGACCGATT CGCTCGAACC GTTCGGCTCG TGCCGGCTGT GCCTCGTCGA GATCGAAGGC
CGGCGCGGTT ATCCGGCGTC GTGCACAACA CCTGCCGAAG CGGGCATGAA GGTGCGCACG
CAGTCGGACC GGCTGCAGTC GCTGCGTCGC AACGTGATGG AGCTGTACAT CTCCGACCAT
CCGCTCGACT GCCTCACCTG CCCGGCCAAC GGCGACTGCG AGCTGCAGGA CATGGCGGGC
GTCGTCGGGC TGCGCGAGGT GCGGTACGGC TTCGACGGCG CAAATCATCT GCGCGACAAA
AAGGACGAGT CGAATCCGTA CTTCACGTAC GACCCGTCGA AGTGCATCGT CTGCAACCGC
TGCGTGCGCG CCTGCGAGGA AACGCAGGGC ACGTTCGCGC TGACGATCGC CGCACGCGGC
TTCGAATCGC GCGTCGCCGC AGGCGAAAGC GAATCGTTCA TGGCATCGGA ATGCGTGTCG
TGCGGCGCAT GCGTTGCCGC ATGTCCGACG GCCACGCTGC AGGAAAAATC CGTCGTGCAA
CTCGGGCAGG CCGAACACTC GGTCGTCACG ACCTGCGCGT ATTGCGGCGT GGGCTGCTCG
TTCAAGGCGG AGATGAAGGG CACGCAGGTC GTGCGCATGA CGCCGCACAA GAACGGCCTC
GCGAACGAGG GCCACGCGTG CGTGAAGGGC CGCTTCGCGT GGGGCTATGC GACGCACAAA
GACCGCATCA CGAAGCCGAT GATCCGCGAG AAGATCACCG ACCCGTGGCG CGAAGTCAGC
TGGGACGAAG CGCTCACCTA CGCGGCCACG CAATTCCGCA AGCTGCAGGA CAAGTACGGC
CGCGATTCCA TCGGCGGCAT CACGTCGTCG CGCTGCACGA ACGAGGAAAC CTACCTCGTG
CAGAAGCTGG TGCGCGCCGC GTTCGGCAAC AACAACGTCG ACACCTGCGC ACGCGTGTGC
CACTCGCCGA CGGGCTATGG CCTCAAGACG ACGCTCGGCG AATCGGCCGG CACGCAGACA
TTCGCGTCGG TCGGCCAGGC CGACGTGATC GTCGTGATGG GTGCGAACCC GACCGACGGC
CACCCGGTGT TCGGCTCACG GCTGAAACGG CGCGTACGTG AAGGCGCAAA ACTGATCGTG
ATCGACCCGC GCCGCATCGA CGTCGTCGAC GGCCCGCACG TGAAAGCCAC CCACCACCTC
CAGTTGCGCC CCGGCACCAA CGTCGCGATC GTCAACGCGC TCGCGCACGT GATCGTCACC
GAAGGGCTCG TCGCCGACGC ATTCGTCGCC GAGCGCTGCG AGACGCGCGC ATTCGAGCAA
TGGCGCGACT TCGTGTCGCG TGCCGACAAC TCGCCCGAGG CGACCGCCGG CGTGACGGGC
GTGCCGGCCG AGTCGGTACG CGAAGCCGCG CGCCTCTACG CGACGGGCGG CAATGCCGCG
ATCTACTACG GGCTGGGCGT GACCGAACAC GCGCAGGGCT CGACGACGGT GATGGGCATC
GCGAACCTCG CGATGGCGAC CGGCAACATT GGCCGCGAAG GCGTCGGCGT CAATCCGCTG
CGCGGCCAGA ACAACGTGCA GGGCTCGTGC GACATGGGCT CGTTCCCGCA CGAACTGCCC
GGCTACCGCC ACATCGGCGA CGAGGTCGTG CGCGCGCAGT TCGAAGCTGC ATGGTCGGCG
AAACTGCAAC CGGAACCGGG GCTGCGCATC CCGAACATGT TCGATGCGGC GCTCGACGGC
AGCTTCAAGG GGCTCTACTG CCAGGGCGAG GACATCGTCC AGTCGGACCC GAATACGCAG
CACGTCGCGG CCGCGCTGTC GGAAATGGAA TGCATCGTCG TGCAGGACAT CTTCCTGAAC
GAGACCGCGA AATACGCGCA CGTGCTGCTG CCCGGCTCGT CGTTCCTCGA GAAGGACGGC
ACGTTCACGA ACGCGGAACG CCGCATCTCA CGCGTGCGCA AGGTGATGCC GCCGCTCGCG
GGCTACGCGG ACTGGGAAGT CACGCTGATG CTGTCGCGTG CGCTCGGCTA CGAGATGGAC
TATGCGCATC CGTCGGAAAT CATGGACGAG ATCGCGCGAC TCACGCCGAC CTTCTCGGGT
GTGTCGTACG CGAAGCTCGA CACGCTCGGC AGCATTCAGT GGCCGTGCAA CGAGCACGCG
CCGGAAGGCA CGCCGACGAT GCATATCGAC GCATTCGTGC GCGGCAAGGG CAAGTTCGTG
ATCACGCAGT TCATCGCGTC GCCGGAGAAG GTCACGCAAC GCTATCCGCT GATCCTGACG
ACGGGTCGCA TCCTGTCTCA GTACAACGTC GGCGCGCAGA CCCGCCGCAC CGAGAACGTG
CAGTGGCACG AAGAGGATCG CCTCGAGATT CATCCGCACG ACGCGCAGGA TCGCGGTATC
CGCAGCGGCG ACTGGGTGGG CATCGAATCG CGTGCGGGGC AGACGGTGTT GCGCGCGCTC
GTGACCGAAC GCATGCAGCC GGGCGTCGTC TATACGACGT TCCACTTCCC CGAATCGGGC
GCGAACGTGA TCACGACGGA CAGCTCGGAC TGGGCGACGA ATTGCCCCGA GTACAAGGTG
ACGGCCGTGC AGGTGCTGCC CGTCGCGCAG CCGTCCGACT GGCAGCAAGC GTATGCGCGC
TTCAACGCGG AGCAGCTCGA CCTGCTCGAG CGCCGCGCCG CCGCGACCGC CACCGTGACG
ACAGGCAAGT GA
 
Protein sequence
MSLDTNNVRQ GGCGSGQCAC KSAAQARAGN PFDDTDYGTP QRHADTDVTL EIDGQPVTVP 
AGTSVMRAAI EAGVNVPKLC ATDSLEPFGS CRLCLVEIEG RRGYPASCTT PAEAGMKVRT
QSDRLQSLRR NVMELYISDH PLDCLTCPAN GDCELQDMAG VVGLREVRYG FDGANHLRDK
KDESNPYFTY DPSKCIVCNR CVRACEETQG TFALTIAARG FESRVAAGES ESFMASECVS
CGACVAACPT ATLQEKSVVQ LGQAEHSVVT TCAYCGVGCS FKAEMKGTQV VRMTPHKNGL
ANEGHACVKG RFAWGYATHK DRITKPMIRE KITDPWREVS WDEALTYAAT QFRKLQDKYG
RDSIGGITSS RCTNEETYLV QKLVRAAFGN NNVDTCARVC HSPTGYGLKT TLGESAGTQT
FASVGQADVI VVMGANPTDG HPVFGSRLKR RVREGAKLIV IDPRRIDVVD GPHVKATHHL
QLRPGTNVAI VNALAHVIVT EGLVADAFVA ERCETRAFEQ WRDFVSRADN SPEATAGVTG
VPAESVREAA RLYATGGNAA IYYGLGVTEH AQGSTTVMGI ANLAMATGNI GREGVGVNPL
RGQNNVQGSC DMGSFPHELP GYRHIGDEVV RAQFEAAWSA KLQPEPGLRI PNMFDAALDG
SFKGLYCQGE DIVQSDPNTQ HVAAALSEME CIVVQDIFLN ETAKYAHVLL PGSSFLEKDG
TFTNAERRIS RVRKVMPPLA GYADWEVTLM LSRALGYEMD YAHPSEIMDE IARLTPTFSG
VSYAKLDTLG SIQWPCNEHA PEGTPTMHID AFVRGKGKFV ITQFIASPEK VTQRYPLILT
TGRILSQYNV GAQTRRTENV QWHEEDRLEI HPHDAQDRGI RSGDWVGIES RAGQTVLRAL
VTERMQPGVV YTTFHFPESG ANVITTDSSD WATNCPEYKV TAVQVLPVAQ PSDWQQAYAR
FNAEQLDLLE RRAAATATVT TGK