Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0718 |
Symbol | |
ID | 3848222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 827218 |
End bp | 828084 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637840391 |
Product | HesA/MoeB/ThiF family protein |
Protein accession | YP_441274 |
Protein GI | 83720758 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0036619 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGCA CCGACGCCAT TGCGACGCTT CACGATGTTA CTCCGCAGCC ATCCGGCGAG CTTGACGCGG ATCGCGCCCG GCGCTTCGGC GGCGTCGCCC GGCTGTACGG CGCCAACGCG CTCGCCGCGT TCGAGCGCGC GCGCGTCGCC GTGATCGGCA TCGGCGGCGT CGGCTCGTGG GCGGCTGAGG CGCTCGCGCG CAGCGCCGTC GGGGAACTGA CCCTGATCGA TCTCGACAAC GTCGCCGAAA GCAACACGAA CCGGCAGATC CACGCGCTCG ACGGCAACTA CGGCAAACCG AAGGTCGACG CGATGGCCGA GCGGATCGCG CTCATCGATT CGGCGTGCCG CGTCGTGAAG ATCGAGGATT TCGTCGAGCC GGACAATCTC GATACGCTGC TCGGCGGCGG CTTCGACTAC ATCGTCGACG CGATCGACAG CGTGCGCACG AAAGTCGCGC TGATCGCGTG GTGCGTCGCG CGCGGCCAGC CGCTCGTGAC GGTCGGCGGC GCGGGCGGCC AACTCGATCC GACCCGCATC CGGATCGACG ATCTCGCGCA GACGATCCAG GACCCGCTGC TGTCGAAGGT GCGCGCGCAG TTACGCAAGC AGCACGGCTT TGCGCGCGGG CCGAAAGCGA AATTCAAGGT GAGCGCCGTG TATTCGGACG AGCCGCTGAT CTATCCGGAG GCGGCCGTGT GCGACGTCGA CGAGGTCGCG ATGCACGCGG CAACCGACGC GCAGGCGCCG GGGCCCACCG GGCTCAATTG CGCAGGCTTC GGCTCGAGCG TGTGCGTGAC CGCGAGCTTC GGATTCGCGG CGGCCGCGCA TGCGCTGCGC GCGCTCGCCG CGCGGGCGCA GCGGTAA
|
Protein sequence | MSRTDAIATL HDVTPQPSGE LDADRARRFG GVARLYGANA LAAFERARVA VIGIGGVGSW AAEALARSAV GELTLIDLDN VAESNTNRQI HALDGNYGKP KVDAMAERIA LIDSACRVVK IEDFVEPDNL DTLLGGGFDY IVDAIDSVRT KVALIAWCVA RGQPLVTVGG AGGQLDPTRI RIDDLAQTIQ DPLLSKVRAQ LRKQHGFARG PKAKFKVSAV YSDEPLIYPE AAVCDVDEVA MHAATDAQAP GPTGLNCAGF GSSVCVTASF GFAAAAHALR ALAARAQR
|
| |