Gene Dgeo_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0054 
Symbol 
ID4058495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp50524 
End bp52458 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content71% 
IMG OID641229050 
Producthypothetical protein 
Protein accessionYP_603526 
Protein GI94984162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCG CTCCTAACCC TTCTGAACCC CTCCACCGTC TCCCGGTGCG GCTGCTGGGC 
GATCTCATCT CGCCACGCGC GCTGGAGCGC ATCTTGCAAG ACGCAGCGCA GGCCCGGGGC
CGGACGCCCG AGACCCTCGA CGCCCCCACG CTCGAAGACA TCCTCAAGCG CGAGGTCTTT
AAGCGGCTGC AACTGAGCGT GCCCGCTGCG CTCGCCAAGC GCCGGGTGTC GGACGTGATC
AAAGAGGTGC TGGCCGCCAC TCCAGCCCCG CAAACCCCTC GCAGCGGGGA ACAATCACTG
GAGGTCCTGG AGGAGGGCGC GCGGCGCTTC ACCCTCTATT TTGATTGGCC GGAGACGCAG
CGGCTGCGCG GCGTGCTGGG GGTGGCCCGG CAGCAACAGC AGGCCGGACA AGACATCACG
GCGCTGGTGC GCGAAGGCCA GGACCTGATC AACCTGATGG AGCGCCGTCT GCAAGAAGGG
TTGGTCACGC AGGCACAAGA CCTGGCGGAG CTGCAGGCGG CTTACCAGCG GGTCCAGAGT
ATGGGCGGTA AGGATGTCCG CCGCCTGGAA GGCCTGATCG CGCAGATCAA GGAGGCGCAG
AGCCAGGGCG TGCTGCTGCC CGCCGAAGTG GAGCGGGCGC GCACCATCAC GTTCACCCTG
CGCAAGCTGC TGGAGTCGTC GGTGGTGCAG CCGCTTGAGT CCGGCAAAGC CCCGCCCCTT
CTCGATCCCG AGGCGCAGGC ACGGGTGCTG GCGCTCGAAC AAGAGCACGT TGCTCGGCAA
CTGGCCGACC TCGCCCGCGA GTTTGGGCCG TTGGTGCGCG CCCGCCCAGA GCTGGAAACG
CGCCTGCAGA TCATCCGCAG TCAGCATGCC AGCGGTACGC TCAAGGCCGA GACGGTCGAT
CTCTGGCGCG CTGAGCTGGA AGCCACCCGC GACCTGGTCC TGGCCTCCCA GCGGGAAGAA
CTTGCGGGCC TGGAAGCCCG GCTGGCAGCG CTGCCGGAAA GCCCCGAACT GGCCGAGGCC
CGCACCGCAC TGAACGTGGC TCGGCTCACA CTGGCGGGAG GCGGCCTCGC CACCGATGAG
CTGCGCGACC TGGGGGGTAC CCTGGCCGCG CTGGAGGCGG CCCCCGCGCT GGCTGCGCGC
CTGCTCGCCG GTCAGCGCGA ACTGGCCGAA GTGGAGCGGG CCGCTCGGGA CGTGCCCGGA
GCCAGCGCCG AACTCGCCCC GCAGCTCGCC GCCGCCCGCG AGGCGCTGGC CCGGGGTGAG
GATGTGGAGA TCGACGCGCT GTGGGCTGCC TTGGAGCGCC GCATGGGTCA GGCGGCCCAG
CAGCGCCAAG ACTTCGACGC CCGCGCCGAC TTTGTCATCC GCGAATACGA CACCGTGCGG
CATCTGGCGG GCGAAACCAT TCAGCGACTG GGGCGCCTGG CCGATACCCT GCGCGCGCAG
CGCCGCCTGG GGCCGATGTC AGCGGACGCC CGCGAGCGGT ACGCCCAGAC CCTCGCGGAC
GCTGAGGCCC TGCTGACAGA GGCTCGCGCC GAGTATCAGG CGGCCCAGGA GGTGACGGCC
AGCTTCGGCG CCGAGGCCCT CAGCGGCCTG CTTGATGTCT TTGATTTCGG GGGGGACCCG
GCGGGCGATC TGTTTGGCGC CGCGGCCCCC ACTGAACAGA TGCCCGATGC CGCCGGTCTC
CCCGACGACA CCTGGCTGAT TCGGGGGCGC ACGGTGGTGG CAGGCCGCAC GGACCCGGCG
GTGTCCGGCA TCGCCGCCCT GCTCGAACAG GCCGCGCTGC TGGACGTGCG GGTCCTGCGC
TTTGAAGATC CTCAGGGGGC TTGGGCGGCA CGGCAAGACG GGGGAGGCGG CTGGCGGCTG
GCCCGCGGCC CCAACGCCGC CTCCCTAGAA GACCGAGTGG GCGACTGGCT GGCGAGCGGC
GAGCTCCGGC GTTAG
 
Protein sequence
MTAAPNPSEP LHRLPVRLLG DLISPRALER ILQDAAQARG RTPETLDAPT LEDILKREVF 
KRLQLSVPAA LAKRRVSDVI KEVLAATPAP QTPRSGEQSL EVLEEGARRF TLYFDWPETQ
RLRGVLGVAR QQQQAGQDIT ALVREGQDLI NLMERRLQEG LVTQAQDLAE LQAAYQRVQS
MGGKDVRRLE GLIAQIKEAQ SQGVLLPAEV ERARTITFTL RKLLESSVVQ PLESGKAPPL
LDPEAQARVL ALEQEHVARQ LADLAREFGP LVRARPELET RLQIIRSQHA SGTLKAETVD
LWRAELEATR DLVLASQREE LAGLEARLAA LPESPELAEA RTALNVARLT LAGGGLATDE
LRDLGGTLAA LEAAPALAAR LLAGQRELAE VERAARDVPG ASAELAPQLA AAREALARGE
DVEIDALWAA LERRMGQAAQ QRQDFDARAD FVIREYDTVR HLAGETIQRL GRLADTLRAQ
RRLGPMSADA RERYAQTLAD AEALLTEARA EYQAAQEVTA SFGAEALSGL LDVFDFGGDP
AGDLFGAAAP TEQMPDAAGL PDDTWLIRGR TVVAGRTDPA VSGIAALLEQ AALLDVRVLR
FEDPQGAWAA RQDGGGGWRL ARGPNAASLE DRVGDWLASG ELRR