Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0709 |
Symbol | |
ID | 3849849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 818322 |
End bp | 819272 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637840382 |
Product | tryptophan 2,3-dioxygenase family protein |
Protein accession | YP_441265 |
Protein GI | 83719842 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | [TIGR03036] tryptophan 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.113949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACAA TCGTGAATTC AGGTCACATG CAGCCGCCCC GCGACGACGA CGCGCCCCGC TGCCCGTTCG CCGGCGCTCA CGCGCCCGAC GCGCCGCACG TGAGCCCCGC CGCCGGCGAA GACGACGCGC AGGCCGGCTG GCATCGCGCG CAGCTCGACT TCTCGCAGTC GATGAGCTAC GGCGATTATC TGTCGCTCGA TCCGATCCTC GATGCGCAAC ATCCGCGCTC GCCCGATCAC AACGAGATGC TGTTCATCAT CCAGCATCAG ACGAGCGAGC TGTGGATGAA GCTCGCGCTC TACGAGCTGC GCGCGGCGCT CGCGTCGATC CGTGACGACG CGCTGCCGCC CGCGTTCAAG ATGCTCGCGC GCGTGTCGCG CGTGCTCGAG CAGCTCGTGC AGGCATGGAA CGTGCTCGCG ACGATGACGC CGTCCGAGTA TTCGGCGATG CGGCCGTACC TGGGCGCGTC GTCGGGTTTC CAGTCGTACC AGTATCGCGA GCTCGAGTTC ATCCTCGGCA ACAAGAACGC GCAGATGCTG CGTCCGCATG CGCACCGGCC GGCGATCCAC GCGCATCTCG AGGCGTCGCT GCAATCGCCT TCGCTGTACG ACGAAGTGAT TCGCCTGCTC GCGCGCCGCG GCTTTCCGAT CGCGCCCGAG CGGCTCGACG CGGACTGGAC GCAGCCGACG CGCCACGATC CGACCGTCGA GGCCGCGTGG CTCGCCGTGT ACCGCGAGCC GAACGCGCAC TGGGAGCTGT ACGAGATGGC CGAAGAGCTC GTCGATCTCG AGGACGCGTT CCGCCAATGG CGCTTCCGCC ACGTGACGAC GGTCGAGCGG ATCATCGGCT TCAAGCAGGG CACGGGCGGC ACGAGCGGCG CGCCGTATCT GCGCAAGATG CTCGACGTCG TGCTGTTCCC CGAGCTCTGG CACGTGCGCA CGACGCTGTA G
|
Protein sequence | METIVNSGHM QPPRDDDAPR CPFAGAHAPD APHVSPAAGE DDAQAGWHRA QLDFSQSMSY GDYLSLDPIL DAQHPRSPDH NEMLFIIQHQ TSELWMKLAL YELRAALASI RDDALPPAFK MLARVSRVLE QLVQAWNVLA TMTPSEYSAM RPYLGASSGF QSYQYRELEF ILGNKNAQML RPHAHRPAIH AHLEASLQSP SLYDEVIRLL ARRGFPIAPE RLDADWTQPT RHDPTVEAAW LAVYREPNAH WELYEMAEEL VDLEDAFRQW RFRHVTTVER IIGFKQGTGG TSGAPYLRKM LDVVLFPELW HVRTTL
|
| |