Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2857 |
Symbol | |
ID | 4074086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008010 |
Strand | - |
Start bp | 166510 |
End bp | 167481 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641228623 |
Product | tryptophan 2,3-dioxygenase |
Protein accession | YP_594360 |
Protein GI | 94972320 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | [TIGR03036] tryptophan 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.202528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGATC TCCAGCGTGA GGCCCGCAAG GTCGAGGATG GGGGAAGGCA TGGACCCAGG GTACCGCCGC CGAGAAAGAG GCCAGCGCCG CTGCGCACTT CCGCTACTCT TCTAGACATG GCAGACGCGG GCGACAGACA CGTGACCGAC CGGGACGCAC CCGAGCGGGC ACAACTGAAT TTTGAGCAGC GGCTCAGCTA CGGGGACTAC CTGCGAACCG ACGTGCTGCT GAGTGCACAC CGGCCCATCA CACCCGCGCA CGACGAGCAC CTCTTTATCA CCGTGCACCA TGTCTCGGAG CTGTGGCTGG GCCTGATTAT CCGTGAGGTG CAGGCGGCGA TGGCGCTGCT CTCGGCGGGC GTGATCGATA CGCCGCTGAA GCTGCTGACG CGGGTGGTGC GCGCACAGGA ACAGTTGACA GCGGCCTGGG AGGTGCTCAA GACCATGACG CCCGCCGACT ACCTCCAGTT TCGGGGGGCC TTCGGCCAGG CGTCGGGCTT TCAATCGGCG CAGTACCGCA TGCTGGAGGT ATTGCTGGGC AACCGCAACC CCACGCTGCT GCGGCCCTTT GAGCACCGCC CGGACCTGCA TGAACCGCTG CTCGCGGCGC TGCACGCTCC CAGCCTGTAC GACCTGACGC TGCGCCTGCT GGCCGCGCGC GGCTTCGCGC TTCCCAGGGA GGTGCTGGAG CGCGACTTCA GCCAGCCCCC CAGCGAGCAC ACCGCTGTTC TGGACGCCTG GCTGGCAGTC TACCGCGATC CCGAGCGCTT CTGGGACCTC TACGAGCTGG CCGAGAAGCT ACTGGACGTA GAAGACCATT TCCGTGCCTG GCGCTTCAAC CACCTCACGA CGGTGGAGCG CACCATCGGC TTCAAACCCG GCAGCGGAGG CACCAGCGGA GCGGGTTACC TGCGCCGCGC CCTCAGCGTC GTGCTGTTTC CAGAACTCTG GCAGGTGCGC ACCCACCTGT AG
|
Protein sequence | MVDLQREARK VEDGGRHGPR VPPPRKRPAP LRTSATLLDM ADAGDRHVTD RDAPERAQLN FEQRLSYGDY LRTDVLLSAH RPITPAHDEH LFITVHHVSE LWLGLIIREV QAAMALLSAG VIDTPLKLLT RVVRAQEQLT AAWEVLKTMT PADYLQFRGA FGQASGFQSA QYRMLEVLLG NRNPTLLRPF EHRPDLHEPL LAALHAPSLY DLTLRLLAAR GFALPREVLE RDFSQPPSEH TAVLDAWLAV YRDPERFWDL YELAEKLLDV EDHFRAWRFN HLTTVERTIG FKPGSGGTSG AGYLRRALSV VLFPELWQVR THL
|
| |