Gene BTH_II0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0452 
Symbol 
ID3845293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp539936 
End bp541078 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content66% 
IMG OID637837757 
Productsulfotransferase domain-containing protein 
Protein accessionYP_438652 
Protein GI83716768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.279842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATG CAGCAACCAG CGGCGCGACC GCCGCCGCCC GTCCCCGTCC CGTGCTGATG 
ATTCCGCTGA GGCGCTGCGG CAGTCACGCG CTGCGGCTGC GGCTGAATCT CAATTCGCAG
TTCTATTCGC CGTATCCGCT GCATGTCGTC GACTTCATGC CGCTGTTGCC GCTGTACGGC
GATCTCGCCG ACGACCGCGC GTACTTCCGG CTCGTCGCCG ACGTCGTCGG CCTGCAGGCG
GCGAGCATGG TCAAGTGGCC CGGCGTAGCG TTCGATCCGG TCGAGATTTT CGACGCGGTC
CGGCACGCGC CGCGCAGCGT TCACCGCATC GTCTGGGAGC TGCTGTTGCG CGCGGGCGAG
CACGAGGGCG CGCGCGTCGT GATGGACAAG TCGCTCGACA GCGTGCACTA CGCCGACGAA
CTGATGGCAT TGTTTCCGGA CATGCTGTTT CTGAACGTCG TGCGCGATCC GCGCGCGCAG
GTTGCGTCGA TGAACCGCGC GATCATTCAT GATTTCGACA CGCTGCTCAA CGCGCGCACG
TGGGTCGCCG CGCATCGCGC GGCCGACGCC GTGATCGCGC GCCATCCGCA GCGCGTGCTG
ACGATCCGCT ATGAAGACTT CCTGTCGGAT CAGGCGGGCA CGCTGCAGCG CATATGCGCA
TTCTTCGGCA TCGATTTCCT GCCGAGGATG CTCGACGTCG CGAATTCGCA CGAGGCGCTG
CGCATCTCGC GCATGTCCGC GTTGTGGGCG TCGAACTGTT TCGCGCCGAT CGCGGCGAAC
GCGGACAAGT TCAAGCAGCA ACTGTCGATC GCCGAAATCG CGACGATCGA AACGCTCACG
CATGAATACA TGCAACGCTA CGGCTATCAG CGGATGACCG ACGCGAGCGC GCCGCCCGAT
GCATTCGCGG CCGCCGCCGC GCGCCGCCGC TCCGATGCGC GGCGGCGGCA CGCGTGGCGC
GAGCTCGAGC AATCGAATTT TCGCGATTTC GTGCTGCGCC GGCATCGCGC CGATTATCTG
GAGGCGGTGC GCGCCCGATT GCAGCGGCAC GCGGGCACGC ACCTGGATTC GGACGGCAAT
TCGCACGCCG GCGCACCCGG GCGGCTCGAT ACGCTGACCG CGGCATTCGA CGTAACCGAC
TGA
 
Protein sequence
MTHAATSGAT AAARPRPVLM IPLRRCGSHA LRLRLNLNSQ FYSPYPLHVV DFMPLLPLYG 
DLADDRAYFR LVADVVGLQA ASMVKWPGVA FDPVEIFDAV RHAPRSVHRI VWELLLRAGE
HEGARVVMDK SLDSVHYADE LMALFPDMLF LNVVRDPRAQ VASMNRAIIH DFDTLLNART
WVAAHRAADA VIARHPQRVL TIRYEDFLSD QAGTLQRICA FFGIDFLPRM LDVANSHEAL
RISRMSALWA SNCFAPIAAN ADKFKQQLSI AEIATIETLT HEYMQRYGYQ RMTDASAPPD
AFAAAAARRR SDARRRHAWR ELEQSNFRDF VLRRHRADYL EAVRARLQRH AGTHLDSDGN
SHAGAPGRLD TLTAAFDVTD