Gene Dgeo_2455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2455 
Symbol 
ID4073683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp28766 
End bp29956 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID641228498 
Productrhamnose isomerase-related protein 
Protein accessionYP_593963 
Protein GI94971923 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4952] Predicted sugar isomerase 
TIGRFAM ID[TIGR02635] L-rhamnose isomerase, Streptomyces subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAG ACACCCTGTT CCAGGCCCTC GACCGGCAGC GCATTGAGAC GCCCTCGTGG 
GGCTACGGTA ACTCGGGGAC CCGCTTCAAG ACGTTTGCCG CGCCGGGTGC GGCGCGCACC
ATCTGGGAGA AGCTTGAGGA CGCCGCCGAA GTGCACCGTC TCAGCGGCAT CGCGCCCACT
GTCGCCCTGC ATCTGCCGTG GGATGAGGTT GAAGACTACG CCGCACTGCG CCGCTTTGCC
GAGGAACGCG GTCTCACCCT CGGGGCAATC AATCCCAATG TGTTTCAGGC CGACACCTAC
CGGCTTGGGA GCATCGCCAA TCCGGATCCG AAGGTGCGGG AGCAGGCCCT CGCGCATGTG
CTGGACTGCG TAGCGGTGAT GTCGCAGACC GGTGCCCGTG ACCTCTCCCT GTGGGTGGCG
GACGGCACGA ACTACGCCGG GCAAGATGAC CTGCGGGCCA GGAAACGCCG CGTGCGGGAG
GCGTTGCAGC GGGTGCATGA CGCGCTTCCC GACGGCACGC GTTTGCTGGT TGAATACAAG
CTGTTTGAAC CCGCCTTCTA TGCCACCGAT CTCTTTGATT GGGGGGCTGC CTACGCCCAC
TGTCTCGCGG TGGGGGAAAA GGCCCAGGTG CTGGTGGACC TAGGGCACCA CGCGCAGAGT
GTGAATATCG AGCAGATCGT TGCCTTTCTC CTCGATGAGG GACGGCTGGG CGGTTTTCAC
TTCAATGCCC GCCGCTATGC CGATGACGAT CTGATCGTGG GCACCACCAA TCCCTTTGAG
CTGTTTTGCA TCTACGCAGA ACTTGTGGCG GCGGAGGAAG CCGCAGACGA CCTCACCCGT
GCCACCGCGC GCAACGTTGC CTACATGATC GACCAGAGTC ACAACATTGA GCCGAAAGTA
GAAGCGATGC TCCAGAGCGT TTTGAATTGC CAAGAAGCAT ATGCCAAAGC GCTGCTGATT
GACCGTGAAC GCCTGCAGGC CGCGCAGCAG GGCGGGGATG TGCTGGAGGC GCACCGGGTG
TTGCTCGAGG CTTTCCGCAC CGATGTTCGC CCCCTCCTCG CCGACTGGCG CCGCCGGCGC
GGTCTGCCGG AAGACCCCAT CGCCGCCCAC CGCGCGAGTG GCTACCAGGC GAGGGTGGCC
GCAGAGCGCG GGACCGTCGC CGCGGCTGGA GGGTTCCCGG TGGGGAGTTG A
 
Protein sequence
MNPDTLFQAL DRQRIETPSW GYGNSGTRFK TFAAPGAART IWEKLEDAAE VHRLSGIAPT 
VALHLPWDEV EDYAALRRFA EERGLTLGAI NPNVFQADTY RLGSIANPDP KVREQALAHV
LDCVAVMSQT GARDLSLWVA DGTNYAGQDD LRARKRRVRE ALQRVHDALP DGTRLLVEYK
LFEPAFYATD LFDWGAAYAH CLAVGEKAQV LVDLGHHAQS VNIEQIVAFL LDEGRLGGFH
FNARRYADDD LIVGTTNPFE LFCIYAELVA AEEAADDLTR ATARNVAYMI DQSHNIEPKV
EAMLQSVLNC QEAYAKALLI DRERLQAAQQ GGDVLEAHRV LLEAFRTDVR PLLADWRRRR
GLPEDPIAAH RASGYQARVA AERGTVAAAG GFPVGS