Gene BTH_II2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2121 
Symbol 
ID3845828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2606460 
End bp2607452 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content71% 
IMG OID637839422 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_440309 
Protein GI83718255 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGG TCGCTCAGAT TCGTTTCGAT TCCATTGCCC GCTGGATGCC GGTCGCGCTG 
TCCGAGCAGG TGAGCGGCAG GGCGGCGCTT GCCGTCATCT GCATGGAGCA GCCGCTCGTG
CTGTTTCGCG ACGCGTCGGG CGCCGTATGC GCGATGGAGG ATCGTTGCGC GCATCGCCGA
GCGCCGCTAT CGCTCGGGCG CGTCACGCCC GACGGCCGGC TGCAGTGCGC GTATCACGGC
TGGACCTACG ACGGCGCGAC GGGCGCCTGC GTGGCGATTC CGAATCTGTC GGCGAGCGAG
CGCGTGCCCG CGCACTATGC CGCGCATGCG TACAAGACGC TCGAACGCGA CGGCTTCATA
TGGGCCTGCG CGCGCGATGC ACCGCCACCC GCCGAGGCGA TCGCTCGCGA CGCCCGCAGC
GCCCGGCGAT TCGCGGGCTC GGTGACGGTC GCCATCGCGC GCGACGAATA CGTCGCCGCA
TTGGCCGACG GGCCGCATCT GACGATGCGC ATCGCCGGCC TGTACATCAC GGATTACGTG
ATCGCGGACG CGACGCCGCA CGACGGCGAC ATCGCGACGG AACGCGGCGT CACGTGGCTG
GCGCACATCG TCGACAGGCA CTTCGGCGTG CGTCATCCGT GGACGCTGCG CGTCACGTCG
CCGCGAGACG GTGTCCTCGC GTCGGTCGAA CTCGCATCGC GCGACGGCGC GACGGCGCTC
TGGGCGTCGA TCGCGATCAC GCCGGCGGCG CGCGGCGCGA CGAACGTACT GTGGCGCGGC
GGCGTCGCGG CCGACGCGAG CGGCTTCGGC GCAAAACTGT TTCGGACGTG GGCGCGCCTG
CACGCCGTGC CGTTCGCGAT GCTCGCGCAC GTCGACGGCC GCGCGCTATC GACGCTCGAC
GCGCTCTATT CGCGGGCATG GCGCGGCCCG ATCCCGGAGG GCATCGCCCA CACGCGGCCG
ATGCCGGCCG ACTATCGCAC AAGGAGCCGA TGA
 
Protein sequence
MNTVAQIRFD SIARWMPVAL SEQVSGRAAL AVICMEQPLV LFRDASGAVC AMEDRCAHRR 
APLSLGRVTP DGRLQCAYHG WTYDGATGAC VAIPNLSASE RVPAHYAAHA YKTLERDGFI
WACARDAPPP AEAIARDARS ARRFAGSVTV AIARDEYVAA LADGPHLTMR IAGLYITDYV
IADATPHDGD IATERGVTWL AHIVDRHFGV RHPWTLRVTS PRDGVLASVE LASRDGATAL
WASIAITPAA RGATNVLWRG GVAADASGFG AKLFRTWARL HAVPFAMLAH VDGRALSTLD
ALYSRAWRGP IPEGIAHTRP MPADYRTRSR