Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2844 |
Symbol | |
ID | 3848206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 3261445 |
End bp | 3263391 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637842512 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_443356 |
Protein GI | 83719533 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA ACCCCAAGTT CCTGTCGGCC GACGCCCGCG TCGACGCCGC CGCCGTCGCA CCACTGCCGA ATTCGCGCAA GGTCTACGTG ACGGGCTCGC AACCCGACAT CCGCGTGCCG ATGCGTGAAA TCACGCAAGC CGATACGCCG ACGAGCTTCG GCGGCGAGAA GAATCCGCCG ATCTACGTCT ACGACACATC GGGCCCGTAC ACGGACCCGG ACGCGAAGAT CGACATCCGC GCGGGCCTGC CCGCGCTGCG CCAGCGCTGG ATCGACGCGC GCGGCGACAC CGAGACGCTC GCGGGCCTCA CGAGCGAATA CGGCCGCGAG CGCGCGGCCG ACCCGGCGAC CGCCGAGCTG CGCTTCCCCG ATCTGCACCG TCATCCGCGC CGCGCCAAAG CCGGCAGGAA CGTCACGCAG ATGCACTACG CGCGCCAGGG CATCATCACG CCGGAAATGG AATTCATCGC GATCCGCGAG AACCAGCGCC GCGCCGAGTA TCTGGAAAGC CTGAAGGCGA GCGGCCCGAA CGGCGCGAAG CTCGCCGCGA TGATGGGCCG CCAGCACGCG GGCCAGGCGT TCGGCGCCGC CGCGTTCGGC GCAAACGAGG CCGGCACGAA TATGCTGACC GAGATCACGC CGGAATTCGT GCGCTCGGAA GTGGCGTGCG GCCGCGCGAT CATTCCGGCG AACATCAACC ACCCGGAAAC CGAGCCGATG ATCATCGGCC GCAACTTCCT CGTGAAGATC AACGCGAACA TCGGCAACTC GGCCGTCACG TCGTCGATCG GCGAGGAAGT CGACAAGATG ACGTGGGCGA TCCGCTGGGG CGGCGACACG GTGATGGACT TGTCGACCGG CAAGCACATC CATGAAACGC GCGAGTGGAT CATCCGCAAC AGCCCGGTGC CGATCGGCAC GGTGCCGATC TACCAGGCGC TGGAAAAGGT CAACGGCAAG GCCGAGGACC TGACCTGGGA AATCTTCCGC GACACGCTGA TCGAGCAGGC CGAGCAAGGC GTCGACTACT TCACGATCCA CGCGGGCGTG CGCCTGCAGT ACGTGCCGCT CACCGCGAAC CGGATGACGG GCATCGTGTC GCGCGGCGGC TCGATCATGG CGAAGTGGTG TCTCGCGCAT CACAAGGAGA GCTTCCTGTA CGAGCACTTC GAAGAGATCT GCGAAATCAT GAAGGCGTAC GACGTGAGCT TCTCGCTCGG CGACGGCCTG CGCCCCGGCT CGATCTACGA CGCGAACGAC GAAGCGCAGT TGGGCGAACT GAAGACGCTC GGCGAGCTCA CGCAAATCGC GTGGAAGCAC GACGTGCAGG TGATGATCGA AGGCCCCGGC CACGTGCCGA TGCAGTTGAT CAAGGAGAAC ATGGATCTGC AGCTCGACTG GTGCAAGGAA GCGCCGTTCT ACACGCTCGG GCCGCTCACG ACCGACATCG CGCCCGGCTA CGACCACATC ACGTCGGGCA TCGGCGCCGC GATGATCGGC TGGTTCGGCA CCGCGATGCT CTGCTACGTG ACACCGAAGG AACACCTCGG CCTGCCGAAC AAGGACGACG TGAAGGAAGG CATCATCACG TACAAGCTCG CCGCGCATGC CGCGGACCTC GCGAAGGGCC ACCCGGGCGC GCAGGTGCGC GACAACGCGT TGTCGAAGGC GCGCTTCGAG TTCCGCTGGG AAGACCAGTT CAACCTCGGC CTCGATCCGG ACAAGGCGCG CGAATTCCAC GACGAAACGC TGCCGAAGGA TTCGGCGAAG GTCGCGCACT TCTGCTCGAT GTGCGGCCCG CACTTCTGCT CGATGAAGAT CACGCAGGAC GTGCGCGAGT TCGCCGCGCA GCAAGGCATG TCGGAAGACG ATGCGCTGAA GAAGGGGATG GAAGTGAAGG CGGTCGAGTT CGTGAAGACC GGCTCGGAGA TCTATCACCG CCAGTAA
|
Protein sequence | MNANPKFLSA DARVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TSFGGEKNPP IYVYDTSGPY TDPDAKIDIR AGLPALRQRW IDARGDTETL AGLTSEYGRE RAADPATAEL RFPDLHRHPR RAKAGRNVTQ MHYARQGIIT PEMEFIAIRE NQRRAEYLES LKASGPNGAK LAAMMGRQHA GQAFGAAAFG ANEAGTNMLT EITPEFVRSE VACGRAIIPA NINHPETEPM IIGRNFLVKI NANIGNSAVT SSIGEEVDKM TWAIRWGGDT VMDLSTGKHI HETREWIIRN SPVPIGTVPI YQALEKVNGK AEDLTWEIFR DTLIEQAEQG VDYFTIHAGV RLQYVPLTAN RMTGIVSRGG SIMAKWCLAH HKESFLYEHF EEICEIMKAY DVSFSLGDGL RPGSIYDAND EAQLGELKTL GELTQIAWKH DVQVMIEGPG HVPMQLIKEN MDLQLDWCKE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVKEGIIT YKLAAHAADL AKGHPGAQVR DNALSKARFE FRWEDQFNLG LDPDKAREFH DETLPKDSAK VAHFCSMCGP HFCSMKITQD VREFAAQQGM SEDDALKKGM EVKAVEFVKT GSEIYHRQ
|
| |