Gene BURPS1710b_A2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2229 
Symbol 
ID3692611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2720344 
End bp2721489 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID637732483 
Productputative alpha-ketoglutarate-dependent taurine dioxygenase 
Protein accessionYP_337380 
Protein GI76819790 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.124177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGGAAA TCTGCGCCTG CCAAGACACC GACAGCCTGT TCGACGTGTT CAACGCTGCC 
GCGGCGAACC GCGCCTCCAA AGGCTATGGG GCCTCCATTG CCGCCGTGGC GTTCGGCGGC
TTCGCGGCGG CCCTCATTGC CTACGCCTTT GCGCCCGAGC AGCTACAAGT GGGGCGCGAC
AGCATCGTTT CGAGAAACGG TCCGCCTGAA GTCGTATCAA CCATGTCCAT GTTCAAGGAG
CAAGAAGCCA TGAATGACAC TGTGCGGCGG CGCCGCTCTG AGGCGCTAAG CATTCGACCG
CTCTCGGGCC ACATTGGCGC TGAGGTGCAG GGTATCCAAC TCGGCTCGCA GATGGCCCCG
AACGACATTC GCTTCATCAC CCAGGCGCTG CTGACGCACC GTGTCATCTT CTTTCGGCGG
CAGCACCATC TCGACGACCT GGCGCAGGAA CTGTTTGCCC AAGCCTTTGG CGAGATCGTC
AAACACCCCA CCATGGGTGG CAAGACTGGC TCCGCCATTC TGGAACTGCA CTCACACGAA
GGAGGGCGAG CGAACTCCTG GCACACCGAT GTGACCTTCG GTCTTCGGCC CCCGAAGCTC
TCAGTCCTGC GTGCCTTGGC CCTGCCCGAT GCGGGCGGCG ACACCGTGTG GGCCAACACG
GTGGCTGCCT ACCAGCATCT GCCATCTTCC TTGCAGGACC TGGTGGACAA GCTGTGGGCT
GTCCATGGCA ACGACTTCGA CTATGCCGCA AGCCGCGTCG AGCTCCTGCA CGATCCCGTA
GCCAAGGAGT ACCGCAAGAA GTACGCAGCC CAAGTCATCA AGACGGAGCA CCCTGTCGTG
CAGATCCACC CTGAGACCGG CGAGAAGAGC TTGCTGCTGG GGCACTATGC TCAGCGCTTC
GTTCAGTACG ATACCCATGA TTCGAACCGG CTCTACGAAA TCCTTCAGGC GCACATCACG
CGATTGGAGA ACACAGTTCG CTGGCATTGG GCAGCCGGCG ACGTCGCGAT CTGGGACAAC
CGATCCACCC AGCACTACGC CATCAATGAC TATGGCGACG CCACGCGGGT AATGCGCCGT
GTGACGGTCA TCGGAGATAT TCCCGTCGCC GTGGACGGAC GCAAGAGCGT CCCCCACGAG
GCTTGA
 
Protein sequence
MWEICACQDT DSLFDVFNAA AANRASKGYG ASIAAVAFGG FAAALIAYAF APEQLQVGRD 
SIVSRNGPPE VVSTMSMFKE QEAMNDTVRR RRSEALSIRP LSGHIGAEVQ GIQLGSQMAP
NDIRFITQAL LTHRVIFFRR QHHLDDLAQE LFAQAFGEIV KHPTMGGKTG SAILELHSHE
GGRANSWHTD VTFGLRPPKL SVLRALALPD AGGDTVWANT VAAYQHLPSS LQDLVDKLWA
VHGNDFDYAA SRVELLHDPV AKEYRKKYAA QVIKTEHPVV QIHPETGEKS LLLGHYAQRF
VQYDTHDSNR LYEILQAHIT RLENTVRWHW AAGDVAIWDN RSTQHYAIND YGDATRVMRR
VTVIGDIPVA VDGRKSVPHE A