Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2097 |
Symbol | |
ID | 4902075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2091137 |
End bp | 2092384 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640135327 |
Product | EutG protein |
Protein accession | YP_001066362 |
Protein GI | 126452408 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.531832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGTG CATTTTCGTT TCCCGGCATT ACGGGCGGCT CAACACGAGA CGCACCGACA CAACCAGGAG ACGACATGAG TAATCTGAGC AGCGCGGAGC GCACCGACAG CTTCTTCATT CCCCGCGTGA CCCTGATCGG CCCGGGCTGC GCGCGCGAGA CGGGCGCGCG CGCCAAATCG CTGGGCGCGA AAAAGGCGCT CATCGTGACC GACGCGGGCT TGCACAAGAT GGGGGTGTCC GAGATCGTCG CGGGCCATAT CCGCGAAGCG GGGCTTCAGG CCGCGATCTT TCCCGGCGCG CAGCCCAATC CGACCGACGT CAACGTTCAC GACGGCGTTG AACTCTATCG GCGGGAAGGG TGCGATTTCA TCGTGTCGCT CGGCGGCGGC TCGTCGCACG ACTGCGCGAA GGGCATCGGG CTCGTCACCG CCGGCGGCGG ACATATCCGC GACTACGAAG GCATCGACAA ATCGACGGTG CCGATGACGC CGTTGATTTC GATCAACACG ACGGCGGGCA CGGCGGCGGA GATGACGCGC TTTTGCATCA TCACGAATTC CAGCAATCAC GTGAAGATGG CGATCGTCGA CTGGCGTTGC ACGCCGCTCA TCGCGATCGA CGACCCGAGC CTGATGGTGG CGATGCCGCC CGCGCTGACG GCCGCAACCG GCATGGACGC GCTCACGCAC GCGGTGGAGG CCTACGTTTC CACCGCCGCG ACGCCGATCA CCGACGCCTG CGCCGAAAAG GCGATCGCGT TGATCGGCGA ATGGCTGCCG AAGGCCGTCG CGAACGGCGA ATCGATGCAG GCGCGCGCGG CGATGTGCTA CGCGCAGTAC CTCGCCGGGA TGGCGTTCAA CAATGCGTCG CTCGGCTATG TGCACGCGAT GGCGCACCAG CTCGGCGGGT TCTACAACCT TCCGCACGGG GTCTGCAACG CGATCCTGCT GCCGCACGTG TGCGAGTTCA ACCTGATCGC CGCGCCCGAG CGTTTCGCCG CCATCGCGCC GCTGCTCGGC GTCAGGACGG CGGGCATGAG CACCCCCGAT GCCGCCCGCG CCGCCATTGC GGCGATCCGC GCGCTCTCGG CGTCGATCGG CATCCCGTCG GGCCTGGCCG CGCTCGGCGT GAAGGCTGAA GACCATGAGG TGATGGCCGG CAACGCGCAG AAAGATGCGT GCATGCTGAC CAATCCGCGC AAGGCGACGC TCGCGCAGGT CATCGCGATC TTCGCGGCGG CGATGTGA
|
Protein sequence | MDRAFSFPGI TGGSTRDAPT QPGDDMSNLS SAERTDSFFI PRVTLIGPGC ARETGARAKS LGAKKALIVT DAGLHKMGVS EIVAGHIREA GLQAAIFPGA QPNPTDVNVH DGVELYRREG CDFIVSLGGG SSHDCAKGIG LVTAGGGHIR DYEGIDKSTV PMTPLISINT TAGTAAEMTR FCIITNSSNH VKMAIVDWRC TPLIAIDDPS LMVAMPPALT AATGMDALTH AVEAYVSTAA TPITDACAEK AIALIGEWLP KAVANGESMQ ARAAMCYAQY LAGMAFNNAS LGYVHAMAHQ LGGFYNLPHG VCNAILLPHV CEFNLIAAPE RFAAIAPLLG VRTAGMSTPD AARAAIAAIR ALSASIGIPS GLAALGVKAE DHEVMAGNAQ KDACMLTNPR KATLAQVIAI FAAAM
|
| |