Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2229 |
Symbol | |
ID | 3692611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 2720344 |
End bp | 2721489 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637732483 |
Product | putative alpha-ketoglutarate-dependent taurine dioxygenase |
Protein accession | YP_337380 |
Protein GI | 76819790 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.124177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGGAAA TCTGCGCCTG CCAAGACACC GACAGCCTGT TCGACGTGTT CAACGCTGCC GCGGCGAACC GCGCCTCCAA AGGCTATGGG GCCTCCATTG CCGCCGTGGC GTTCGGCGGC TTCGCGGCGG CCCTCATTGC CTACGCCTTT GCGCCCGAGC AGCTACAAGT GGGGCGCGAC AGCATCGTTT CGAGAAACGG TCCGCCTGAA GTCGTATCAA CCATGTCCAT GTTCAAGGAG CAAGAAGCCA TGAATGACAC TGTGCGGCGG CGCCGCTCTG AGGCGCTAAG CATTCGACCG CTCTCGGGCC ACATTGGCGC TGAGGTGCAG GGTATCCAAC TCGGCTCGCA GATGGCCCCG AACGACATTC GCTTCATCAC CCAGGCGCTG CTGACGCACC GTGTCATCTT CTTTCGGCGG CAGCACCATC TCGACGACCT GGCGCAGGAA CTGTTTGCCC AAGCCTTTGG CGAGATCGTC AAACACCCCA CCATGGGTGG CAAGACTGGC TCCGCCATTC TGGAACTGCA CTCACACGAA GGAGGGCGAG CGAACTCCTG GCACACCGAT GTGACCTTCG GTCTTCGGCC CCCGAAGCTC TCAGTCCTGC GTGCCTTGGC CCTGCCCGAT GCGGGCGGCG ACACCGTGTG GGCCAACACG GTGGCTGCCT ACCAGCATCT GCCATCTTCC TTGCAGGACC TGGTGGACAA GCTGTGGGCT GTCCATGGCA ACGACTTCGA CTATGCCGCA AGCCGCGTCG AGCTCCTGCA CGATCCCGTA GCCAAGGAGT ACCGCAAGAA GTACGCAGCC CAAGTCATCA AGACGGAGCA CCCTGTCGTG CAGATCCACC CTGAGACCGG CGAGAAGAGC TTGCTGCTGG GGCACTATGC TCAGCGCTTC GTTCAGTACG ATACCCATGA TTCGAACCGG CTCTACGAAA TCCTTCAGGC GCACATCACG CGATTGGAGA ACACAGTTCG CTGGCATTGG GCAGCCGGCG ACGTCGCGAT CTGGGACAAC CGATCCACCC AGCACTACGC CATCAATGAC TATGGCGACG CCACGCGGGT AATGCGCCGT GTGACGGTCA TCGGAGATAT TCCCGTCGCC GTGGACGGAC GCAAGAGCGT CCCCCACGAG GCTTGA
|
Protein sequence | MWEICACQDT DSLFDVFNAA AANRASKGYG ASIAAVAFGG FAAALIAYAF APEQLQVGRD SIVSRNGPPE VVSTMSMFKE QEAMNDTVRR RRSEALSIRP LSGHIGAEVQ GIQLGSQMAP NDIRFITQAL LTHRVIFFRR QHHLDDLAQE LFAQAFGEIV KHPTMGGKTG SAILELHSHE GGRANSWHTD VTFGLRPPKL SVLRALALPD AGGDTVWANT VAAYQHLPSS LQDLVDKLWA VHGNDFDYAA SRVELLHDPV AKEYRKKYAA QVIKTEHPVV QIHPETGEKS LLLGHYAQRF VQYDTHDSNR LYEILQAHIT RLENTVRWHW AAGDVAIWDN RSTQHYAIND YGDATRVMRR VTVIGDIPVA VDGRKSVPHE A
|
| |