Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A0650 |
Symbol | kynA |
ID | 4679695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | - |
Start bp | 663755 |
End bp | 664675 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639844924 |
Product | tryptophan 2,3-dioxygenase |
Protein accession | YP_991996 |
Protein GI | 121600523 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | [TIGR03036] tryptophan 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00581911 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCCGC CCGGCGACGA CGCCGCGCCC CGCTGCCCGT TTGCCGGCGC GCACGCGCCC GACGCGCCGC ACGTGCCCGA GGCCGCCGGC GACGACGTGC AGGCCGGCTG GCATCGCGCG CAGCTCGACT TCTCGCAATC GATGAGCTAC GGCGATTACC TGTCGCTCGA CCCGATCCTC GATGCGCAGC ATCCGCGCTC GCCCGATCAC AACGAGATGC TGTTCATCAT CCAGCATCAG ACGAGCGAGC TGTGGATGAA GCTCGCGCTC TACGAGCTGC GCGCGGCGCT CGCGTCGATC CGCGACGACG CGCTGCCGCC CGCGTTCAAG ATGCTCGCGC GCGTGTCGCG CGTGCTCGAG CAGCTCGTGC AGGCGTGGAA CGTGCTCGCG ACGATGACGC CGTCCGAATA CTCGGCGATG CGGCCGTATC TCGGCGCGTC GTCGGGCTTC CAGTCGTACC AGTATCGGGA GCTGGAGTTC ATCCTCGGCA ACAAGAACGC GCAGATGCTG CGCCCGCACG CGCACCGGCC GGCGATTCAT GCGCATCTGG AAGCGTCGCT GCAGGCGCCG TCGCTATACG ATGAAGTGAT TCGCCTGCTC GCGCGGCGCG GCTTTCCGAT CGCGCCCGAG CGACTCGACG CCGACTGGAC GCAGCCGACG CGCCACGATC GCACCGTCGA GACCGCGTGG CTCGCGGTGT ACCGCGAGCC GAACGCGCAC TGGGAGCTGT ACGAGATGGC CGAAGAGCTC GTCGATCTCG AGGACGCGTT CCGCCAATGG CGCTTCCGCC ACGTGACGAC GGTCGAGCGG ATCATCGGCT TCAAGCAGGG CACGGGCAGC ACGAGCGGCG CGCCGTATCT GCGCAAGATG CTCGACGTCG TGCTGTTCCC CGAACTCTGG CACGTGCGCA CGACGCTGTA G
|
Protein sequence | MQPPGDDAAP RCPFAGAHAP DAPHVPEAAG DDVQAGWHRA QLDFSQSMSY GDYLSLDPIL DAQHPRSPDH NEMLFIIQHQ TSELWMKLAL YELRAALASI RDDALPPAFK MLARVSRVLE QLVQAWNVLA TMTPSEYSAM RPYLGASSGF QSYQYRELEF ILGNKNAQML RPHAHRPAIH AHLEASLQAP SLYDEVIRLL ARRGFPIAPE RLDADWTQPT RHDRTVETAW LAVYREPNAH WELYEMAEEL VDLEDAFRQW RFRHVTTVER IIGFKQGTGS TSGAPYLRKM LDVVLFPELW HVRTTL
|
| |