Gene Dgeo_2857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2857 
Symbol 
ID4074086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp166510 
End bp167481 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content66% 
IMG OID641228623 
Producttryptophan 2,3-dioxygenase 
Protein accessionYP_594360 
Protein GI94972320 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3483] Tryptophan 2,3-dioxygenase (vermilion) 
TIGRFAM ID[TIGR03036] tryptophan 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.202528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGATC TCCAGCGTGA GGCCCGCAAG GTCGAGGATG GGGGAAGGCA TGGACCCAGG 
GTACCGCCGC CGAGAAAGAG GCCAGCGCCG CTGCGCACTT CCGCTACTCT TCTAGACATG
GCAGACGCGG GCGACAGACA CGTGACCGAC CGGGACGCAC CCGAGCGGGC ACAACTGAAT
TTTGAGCAGC GGCTCAGCTA CGGGGACTAC CTGCGAACCG ACGTGCTGCT GAGTGCACAC
CGGCCCATCA CACCCGCGCA CGACGAGCAC CTCTTTATCA CCGTGCACCA TGTCTCGGAG
CTGTGGCTGG GCCTGATTAT CCGTGAGGTG CAGGCGGCGA TGGCGCTGCT CTCGGCGGGC
GTGATCGATA CGCCGCTGAA GCTGCTGACG CGGGTGGTGC GCGCACAGGA ACAGTTGACA
GCGGCCTGGG AGGTGCTCAA GACCATGACG CCCGCCGACT ACCTCCAGTT TCGGGGGGCC
TTCGGCCAGG CGTCGGGCTT TCAATCGGCG CAGTACCGCA TGCTGGAGGT ATTGCTGGGC
AACCGCAACC CCACGCTGCT GCGGCCCTTT GAGCACCGCC CGGACCTGCA TGAACCGCTG
CTCGCGGCGC TGCACGCTCC CAGCCTGTAC GACCTGACGC TGCGCCTGCT GGCCGCGCGC
GGCTTCGCGC TTCCCAGGGA GGTGCTGGAG CGCGACTTCA GCCAGCCCCC CAGCGAGCAC
ACCGCTGTTC TGGACGCCTG GCTGGCAGTC TACCGCGATC CCGAGCGCTT CTGGGACCTC
TACGAGCTGG CCGAGAAGCT ACTGGACGTA GAAGACCATT TCCGTGCCTG GCGCTTCAAC
CACCTCACGA CGGTGGAGCG CACCATCGGC TTCAAACCCG GCAGCGGAGG CACCAGCGGA
GCGGGTTACC TGCGCCGCGC CCTCAGCGTC GTGCTGTTTC CAGAACTCTG GCAGGTGCGC
ACCCACCTGT AG
 
Protein sequence
MVDLQREARK VEDGGRHGPR VPPPRKRPAP LRTSATLLDM ADAGDRHVTD RDAPERAQLN 
FEQRLSYGDY LRTDVLLSAH RPITPAHDEH LFITVHHVSE LWLGLIIREV QAAMALLSAG
VIDTPLKLLT RVVRAQEQLT AAWEVLKTMT PADYLQFRGA FGQASGFQSA QYRMLEVLLG
NRNPTLLRPF EHRPDLHEPL LAALHAPSLY DLTLRLLAAR GFALPREVLE RDFSQPPSEH
TAVLDAWLAV YRDPERFWDL YELAEKLLDV EDHFRAWRFN HLTTVERTIG FKPGSGGTSG
AGYLRRALSV VLFPELWQVR THL