Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0895 |
Symbol | kynA |
ID | 4901157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 881160 |
End bp | 882221 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640134125 |
Product | tryptophan 2,3-dioxygenase |
Protein accession | YP_001065176 |
Protein GI | 126452072 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | [TIGR03036] tryptophan 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.284823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCG CCCGAGTTCG CCACGCGCGC CGCGGTGACC TGAGCCCGCG CGAGCGCCGC GCGAAGCGTG CGGCCATCGG CCGGATCGGC CGCCCGCACC GCCCCGTCTA TTCCATGGAG ACAAACGTGA ATTCAGGTCA CATGCAGCCG CCCGGCGACG ACGCCGCGCC CCGCTGCCCG TTCGCCGGCG CGCACGCGCC CGACGCGCCG CACGTGCCCG AGGCCGCCGG CGACGACGCG CAGGCCGGCT GGCATCGCGC GCAGCTCGAC TTCTCGCAAT CGATGAGCTA CGGCGATTAC CTGTCGCTCG ACCCGATCCT CGATGCGCAG CATCCGCGCT CGCCCGATCA CAACGAGATG CTGTTCATCA TCCAGCATCA GACGAGCGAG CTGTGGATGA AGCTCGCGCT CTACGAACTG CGCGCGGCGC TCGCGTCGAT CCGCGACGAC GCGCTGCCGC CCGCGTTCAA GATGCTCGCG CGCGTGTCGC GCGTGCTCGA GCAGCTCGTG CAGGCGTGGA ACGTGCTCGC GACGATGACG CCGTCCGAAT ACTCGGCGAT GCGGCCGTAT CTCGGCGCGT CGTCGGGCTT CCAGTCGTAC CAGTATCGGG AGCTGGAGTT CATCCTCGGC AACAAGAACG CGCAGATGCT GCGCCCGCAC GCGCACCGGC CGGCGATTCA TGCGCATCTG GAAGCGTCGC TGCAGGCGCC GTCGCTATAC GATGAAGTGA TTCGCCTGCT CGCGCGGCGC GGCTTTCCGA TCGCGCCCGA GCGGCTCGAC GCCGACTGGA CGCAGCCGAC GCGCCACGAT CGCACCGTCG AGGCCGCGTG GCTCGCGGTG TACCGCGAGC CGAACGCGCA CTGGGAGCTG TACGAGATGG CCGAAGAGCT CGTCGATCTC GAGGACGCGT TCCGCCAATG GCGCTTCCGC CACGTGACGA CGGTCGAGCG GATCATCGGC TTCAAGCAGG GCACGGGCGG CACGAGCGGC GCGCCGTATC TGCGCAAGAT GCTCGACGTC GTGCTGTTCC CCGAACTCTG GCACGTGCGC ACGACGCTGT AG
|
Protein sequence | MARARVRHAR RGDLSPRERR AKRAAIGRIG RPHRPVYSME TNVNSGHMQP PGDDAAPRCP FAGAHAPDAP HVPEAAGDDA QAGWHRAQLD FSQSMSYGDY LSLDPILDAQ HPRSPDHNEM LFIIQHQTSE LWMKLALYEL RAALASIRDD ALPPAFKMLA RVSRVLEQLV QAWNVLATMT PSEYSAMRPY LGASSGFQSY QYRELEFILG NKNAQMLRPH AHRPAIHAHL EASLQAPSLY DEVIRLLARR GFPIAPERLD ADWTQPTRHD RTVEAAWLAV YREPNAHWEL YEMAEELVDL EDAFRQWRFR HVTTVERIIG FKQGTGGTSG APYLRKMLDV VLFPELWHVR TTL
|
| |