Gene BTH_I2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2844 
Symbol 
ID3848206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3261445 
End bp3263391 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content65% 
IMG OID637842512 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_443356 
Protein GI83719533 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA ACCCCAAGTT CCTGTCGGCC GACGCCCGCG TCGACGCCGC CGCCGTCGCA 
CCACTGCCGA ATTCGCGCAA GGTCTACGTG ACGGGCTCGC AACCCGACAT CCGCGTGCCG
ATGCGTGAAA TCACGCAAGC CGATACGCCG ACGAGCTTCG GCGGCGAGAA GAATCCGCCG
ATCTACGTCT ACGACACATC GGGCCCGTAC ACGGACCCGG ACGCGAAGAT CGACATCCGC
GCGGGCCTGC CCGCGCTGCG CCAGCGCTGG ATCGACGCGC GCGGCGACAC CGAGACGCTC
GCGGGCCTCA CGAGCGAATA CGGCCGCGAG CGCGCGGCCG ACCCGGCGAC CGCCGAGCTG
CGCTTCCCCG ATCTGCACCG TCATCCGCGC CGCGCCAAAG CCGGCAGGAA CGTCACGCAG
ATGCACTACG CGCGCCAGGG CATCATCACG CCGGAAATGG AATTCATCGC GATCCGCGAG
AACCAGCGCC GCGCCGAGTA TCTGGAAAGC CTGAAGGCGA GCGGCCCGAA CGGCGCGAAG
CTCGCCGCGA TGATGGGCCG CCAGCACGCG GGCCAGGCGT TCGGCGCCGC CGCGTTCGGC
GCAAACGAGG CCGGCACGAA TATGCTGACC GAGATCACGC CGGAATTCGT GCGCTCGGAA
GTGGCGTGCG GCCGCGCGAT CATTCCGGCG AACATCAACC ACCCGGAAAC CGAGCCGATG
ATCATCGGCC GCAACTTCCT CGTGAAGATC AACGCGAACA TCGGCAACTC GGCCGTCACG
TCGTCGATCG GCGAGGAAGT CGACAAGATG ACGTGGGCGA TCCGCTGGGG CGGCGACACG
GTGATGGACT TGTCGACCGG CAAGCACATC CATGAAACGC GCGAGTGGAT CATCCGCAAC
AGCCCGGTGC CGATCGGCAC GGTGCCGATC TACCAGGCGC TGGAAAAGGT CAACGGCAAG
GCCGAGGACC TGACCTGGGA AATCTTCCGC GACACGCTGA TCGAGCAGGC CGAGCAAGGC
GTCGACTACT TCACGATCCA CGCGGGCGTG CGCCTGCAGT ACGTGCCGCT CACCGCGAAC
CGGATGACGG GCATCGTGTC GCGCGGCGGC TCGATCATGG CGAAGTGGTG TCTCGCGCAT
CACAAGGAGA GCTTCCTGTA CGAGCACTTC GAAGAGATCT GCGAAATCAT GAAGGCGTAC
GACGTGAGCT TCTCGCTCGG CGACGGCCTG CGCCCCGGCT CGATCTACGA CGCGAACGAC
GAAGCGCAGT TGGGCGAACT GAAGACGCTC GGCGAGCTCA CGCAAATCGC GTGGAAGCAC
GACGTGCAGG TGATGATCGA AGGCCCCGGC CACGTGCCGA TGCAGTTGAT CAAGGAGAAC
ATGGATCTGC AGCTCGACTG GTGCAAGGAA GCGCCGTTCT ACACGCTCGG GCCGCTCACG
ACCGACATCG CGCCCGGCTA CGACCACATC ACGTCGGGCA TCGGCGCCGC GATGATCGGC
TGGTTCGGCA CCGCGATGCT CTGCTACGTG ACACCGAAGG AACACCTCGG CCTGCCGAAC
AAGGACGACG TGAAGGAAGG CATCATCACG TACAAGCTCG CCGCGCATGC CGCGGACCTC
GCGAAGGGCC ACCCGGGCGC GCAGGTGCGC GACAACGCGT TGTCGAAGGC GCGCTTCGAG
TTCCGCTGGG AAGACCAGTT CAACCTCGGC CTCGATCCGG ACAAGGCGCG CGAATTCCAC
GACGAAACGC TGCCGAAGGA TTCGGCGAAG GTCGCGCACT TCTGCTCGAT GTGCGGCCCG
CACTTCTGCT CGATGAAGAT CACGCAGGAC GTGCGCGAGT TCGCCGCGCA GCAAGGCATG
TCGGAAGACG ATGCGCTGAA GAAGGGGATG GAAGTGAAGG CGGTCGAGTT CGTGAAGACC
GGCTCGGAGA TCTATCACCG CCAGTAA
 
Protein sequence
MNANPKFLSA DARVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TSFGGEKNPP 
IYVYDTSGPY TDPDAKIDIR AGLPALRQRW IDARGDTETL AGLTSEYGRE RAADPATAEL
RFPDLHRHPR RAKAGRNVTQ MHYARQGIIT PEMEFIAIRE NQRRAEYLES LKASGPNGAK
LAAMMGRQHA GQAFGAAAFG ANEAGTNMLT EITPEFVRSE VACGRAIIPA NINHPETEPM
IIGRNFLVKI NANIGNSAVT SSIGEEVDKM TWAIRWGGDT VMDLSTGKHI HETREWIIRN
SPVPIGTVPI YQALEKVNGK AEDLTWEIFR DTLIEQAEQG VDYFTIHAGV RLQYVPLTAN
RMTGIVSRGG SIMAKWCLAH HKESFLYEHF EEICEIMKAY DVSFSLGDGL RPGSIYDAND
EAQLGELKTL GELTQIAWKH DVQVMIEGPG HVPMQLIKEN MDLQLDWCKE APFYTLGPLT
TDIAPGYDHI TSGIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVKEGIIT YKLAAHAADL
AKGHPGAQVR DNALSKARFE FRWEDQFNLG LDPDKAREFH DETLPKDSAK VAHFCSMCGP
HFCSMKITQD VREFAAQQGM SEDDALKKGM EVKAVEFVKT GSEIYHRQ