Gene Bcep18194_A4369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4369 
Symbol 
ID3749568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1323038 
End bp1324708 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content69% 
IMG OID637762658 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_368609 
Protein GI78065840 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.535677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG ATTCGTACGA CTACGTGATC GTCGGCGCCG GCTCGGCCGG TTGCGCACTC 
GCCTACCGGC TCGGCGAGGA TCCGAACGTC CGCATCCTCG TGATCGAGGC CGGCGAACAG
GATCGCTCGC CGTACATCAA GGTGCCGCTG ACGTGGGGCC AGATTCTGAA GAACCGGCTG
TTCGACTGGG GCTATTTCAC CGAGCCGGAA GCCGGCATGG ACGGTCGCCG GATCGAGTGC
GCGCGCGGCA AGGTGGTGGG CGGCTCGTCG TCGATCAACG GCATGGCCTA CGCACGCGGC
GCGCGGGAAG ACTACGAAGG CTGGGCCGAC GAGTTCGGCC TGACCGACTG GTCCTACGAC
GCGGTGCTGC CGTACTTCAA GCGCTCCGAA TCGTGGGAGC GCGGCGAATC GGCGTTGCGC
GGCGGTCGCG GCCCGCTGAC CGTGATCAAG CTCGACTATC GCGACCCGCT GGTCGGCGGC
TTTCTCGACG CGACGCGTGC GTGCGGCTAT CCGGAAAACG ACGACTACAA CGGCGCATCC
GTCGAAGGCT TCGGGCCGAT GCAGGCCACC ATCCGCAACG GCCTGCGCTG CAGCGCCGCG
GTCGCCTATC TGCGCCCGGC GCTCGCGCGC GGCAACGTCA CGCTGGTGAC CGGGGCCCTC
GCGAAACGGA TCGTGCTCGA TACCGACAGC GGTACGCCGC GCGCCATCGC GATCGAGTAT
CGCCGTGGCG AGTCCGACTA CCGCGCCGAT GCACGCCGCG AAGTCATCCT CTGCGGCGGC
GTGATCAATT CGCCGCAGCT GCTGATGCTC TCCGGCATCG GCGCGGCCGA CAGCCTGCGC
ACGCACGGCA TCGCGTCGAA AGTCGAATTG CCCGGCGTGG GCGCCAACCT GCATGACCAC
ATCGTGTTCG ACCTGCGCTG GAGCCGCAAG GAACCGGGGC CGCTGCACCG GATGATGCGT
GCCGACCGTA TCGCGTTCGA CGTCGCGCGC ACGCTGGCAG GCGGCAACGG CTTCTCGAGT
GCGATCCCCG CCGCCGCGCT CGGGCTGGTC CGCAGCCAGC CTCACCTGCC GCTGCCGGAC
GTGCAGCTGA TCCTCGCGGC CGGCGCGATG AACGCCGCGC CGTACTTCGA GCCGTTCAAG
CACGCGTATG CCGATTCGTT CGCGATCAAG GGCATTTTCC TCACGCCCGA AAGCCGCGGC
CGCGTGTCGC TGAAGTCGGC CGATCCGGCC CAGCACGCGC GGATCGAGCA AAACTTCCTC
GCCACCGAGC ACGATCGCGT CGCAGCGCGC GAGATGTTCC GGCGCATGCG GGAAATCGGC
GCGCAGGCCG GGCTGCGCCC GTTCATCGAC GCAGAGATCG CACCGGGCCC GCAGGTCCAG
AGCGACGCAG ACGTCGACGC CTTCATTCGT CGTGTCGCGA TCACCCTTCA TCACCCGGTC
GGCACGTGCC GCATGGGTCG CGACGACGAT CCGGCGGCCG TGGTCGATAC GCAGATGCGC
GTGCGCGGCG TCGCCGGATT GCGGGTGGTC GACGGCTCGT CGATCCCGCG CATCATCCGC
GGGCCGACCA ACGCGCTGAT CATGACGATG GCCGAACGCG CAGCCGATTT CATGACCGGG
AAAGCCATGC CCGTGCCGCA GGCGCAGGTT CGCGCCACCG CACCGATGTA G
 
Protein sequence
MKTDSYDYVI VGAGSAGCAL AYRLGEDPNV RILVIEAGEQ DRSPYIKVPL TWGQILKNRL 
FDWGYFTEPE AGMDGRRIEC ARGKVVGGSS SINGMAYARG AREDYEGWAD EFGLTDWSYD
AVLPYFKRSE SWERGESALR GGRGPLTVIK LDYRDPLVGG FLDATRACGY PENDDYNGAS
VEGFGPMQAT IRNGLRCSAA VAYLRPALAR GNVTLVTGAL AKRIVLDTDS GTPRAIAIEY
RRGESDYRAD ARREVILCGG VINSPQLLML SGIGAADSLR THGIASKVEL PGVGANLHDH
IVFDLRWSRK EPGPLHRMMR ADRIAFDVAR TLAGGNGFSS AIPAAALGLV RSQPHLPLPD
VQLILAAGAM NAAPYFEPFK HAYADSFAIK GIFLTPESRG RVSLKSADPA QHARIEQNFL
ATEHDRVAAR EMFRRMREIG AQAGLRPFID AEIAPGPQVQ SDADVDAFIR RVAITLHHPV
GTCRMGRDDD PAAVVDTQMR VRGVAGLRVV DGSSIPRIIR GPTNALIMTM AERAADFMTG
KAMPVPQAQV RATAPM