Gene BTH_I0709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I0709 
Symbol 
ID3849849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp818322 
End bp819272 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content68% 
IMG OID637840382 
Producttryptophan 2,3-dioxygenase family protein 
Protein accessionYP_441265 
Protein GI83719842 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3483] Tryptophan 2,3-dioxygenase (vermilion) 
TIGRFAM ID[TIGR03036] tryptophan 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.113949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACAA TCGTGAATTC AGGTCACATG CAGCCGCCCC GCGACGACGA CGCGCCCCGC 
TGCCCGTTCG CCGGCGCTCA CGCGCCCGAC GCGCCGCACG TGAGCCCCGC CGCCGGCGAA
GACGACGCGC AGGCCGGCTG GCATCGCGCG CAGCTCGACT TCTCGCAGTC GATGAGCTAC
GGCGATTATC TGTCGCTCGA TCCGATCCTC GATGCGCAAC ATCCGCGCTC GCCCGATCAC
AACGAGATGC TGTTCATCAT CCAGCATCAG ACGAGCGAGC TGTGGATGAA GCTCGCGCTC
TACGAGCTGC GCGCGGCGCT CGCGTCGATC CGTGACGACG CGCTGCCGCC CGCGTTCAAG
ATGCTCGCGC GCGTGTCGCG CGTGCTCGAG CAGCTCGTGC AGGCATGGAA CGTGCTCGCG
ACGATGACGC CGTCCGAGTA TTCGGCGATG CGGCCGTACC TGGGCGCGTC GTCGGGTTTC
CAGTCGTACC AGTATCGCGA GCTCGAGTTC ATCCTCGGCA ACAAGAACGC GCAGATGCTG
CGTCCGCATG CGCACCGGCC GGCGATCCAC GCGCATCTCG AGGCGTCGCT GCAATCGCCT
TCGCTGTACG ACGAAGTGAT TCGCCTGCTC GCGCGCCGCG GCTTTCCGAT CGCGCCCGAG
CGGCTCGACG CGGACTGGAC GCAGCCGACG CGCCACGATC CGACCGTCGA GGCCGCGTGG
CTCGCCGTGT ACCGCGAGCC GAACGCGCAC TGGGAGCTGT ACGAGATGGC CGAAGAGCTC
GTCGATCTCG AGGACGCGTT CCGCCAATGG CGCTTCCGCC ACGTGACGAC GGTCGAGCGG
ATCATCGGCT TCAAGCAGGG CACGGGCGGC ACGAGCGGCG CGCCGTATCT GCGCAAGATG
CTCGACGTCG TGCTGTTCCC CGAGCTCTGG CACGTGCGCA CGACGCTGTA G
 
Protein sequence
METIVNSGHM QPPRDDDAPR CPFAGAHAPD APHVSPAAGE DDAQAGWHRA QLDFSQSMSY 
GDYLSLDPIL DAQHPRSPDH NEMLFIIQHQ TSELWMKLAL YELRAALASI RDDALPPAFK
MLARVSRVLE QLVQAWNVLA TMTPSEYSAM RPYLGASSGF QSYQYRELEF ILGNKNAQML
RPHAHRPAIH AHLEASLQSP SLYDEVIRLL ARRGFPIAPE RLDADWTQPT RHDPTVEAAW
LAVYREPNAH WELYEMAEEL VDLEDAFRQW RFRHVTTVER IIGFKQGTGG TSGAPYLRKM
LDVVLFPELW HVRTTL