Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0905 |
Symbol | |
ID | 4903263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 890270 |
End bp | 891136 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640134135 |
Product | HesA/MoeB/ThiF family protein |
Protein accession | YP_001065186 |
Protein GI | 126453730 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.386795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGTA CCGACGCCAT TGCGACGCCT CACGATGTTA CTCCGCAGCC ATCCGGCGAG CTTGACGCGG ATCGCGCCCG GCGCTTCGGC GGCGTCGCCC GGCTCTACGG CGCCGATGCG CTCGCCGCGT TCGAGCGCGC GCGCGTCGCG GTGATCGGCA TCGGCGGCGT CGGCTCGTGG GCGGCCGAGG CGCTCGCGCG CAGCGCCGTG GGGGAACTGA CCCTGATCGA TCTCGACAAC GTCGCCGAAA GCAACACGAA CCGGCAGATC CATGCGCTCG ACGGCAATTA CGGCAAGCCG AAGGTCGACG CGATGGCCGA GCGGATCGCG CTCATCGATC CGGCGTGCCG CGTCGTGAAG ATCGAGGATT TCGTCGAGCC GGACAATCTC GACGCACTGC TCGGCGGCGG CTTCGACTAC ATCGTCGACG CGATCGACAG CGTGCGCACG AAGGTCGCGC TGATCGCGTG GTGCGTCGCG CGCGCGCAGC CGCTCGTGAC GGTCGGCGGC GCGGGCGGCC AGCTCGATCC GACCCGCATC CGCATTGACG ATCTCGCGCA GACGATCCAG GACCCGCTGC TGTCGAAGGT GCGCGCGCAA CTGCGCAAGC AGCACGGTTT CCCGCGCGGG CCGAAAGCCC GGTTCAAGGT GAGCGCCGTC TATTCGGACG AGCCGCTGAT CTATCCGGAG GCGGCCGTGT GCGACGTCGA CGATGTCGCG ATGCACACCG CAACCGACGC GCAGGCGCCG GGGCCGACCG GGCTCAATTG CGCGGGCTTC GGCTCGAGCG TGTGCGTGAC CGCGAGCTTC GGGTTCGCGG CGGCCGCGCA TGCGCTGCGT GCGCTCGCCG CGCGGGCGGG GCGCTAA
|
Protein sequence | MSRTDAIATP HDVTPQPSGE LDADRARRFG GVARLYGADA LAAFERARVA VIGIGGVGSW AAEALARSAV GELTLIDLDN VAESNTNRQI HALDGNYGKP KVDAMAERIA LIDPACRVVK IEDFVEPDNL DALLGGGFDY IVDAIDSVRT KVALIAWCVA RAQPLVTVGG AGGQLDPTRI RIDDLAQTIQ DPLLSKVRAQ LRKQHGFPRG PKARFKVSAV YSDEPLIYPE AAVCDVDDVA MHTATDAQAP GPTGLNCAGF GSSVCVTASF GFAAAAHALR ALAARAGR
|
| |