Gene Dgeo_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1417 
Symbol 
ID4059050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1503561 
End bp1505381 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content65% 
IMG OID641230433 
Producthypothetical protein 
Protein accessionYP_604881 
Protein GI94985517 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0731587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0956406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGC CAAGTGTGAG TAGGCCGGAC GAGCGCATCG GCATGGTGCT GGGCACCGAG 
GACGCTACCC CGGTGACCTT TTGGTTCGCG GTCAGTCCTG GCGCGAGCGT ACAGATGGAC
GACCTGGTGG TGGTTCGCAC GCAGAAACCG AACGGGCAGC CGGTCCATTT CTACGGCATC
GTTGATCACG TCCGCACCCG GCATGAGGGC GTCACTTTCG ACAGTGACGT GGCCGACGTG
GTGGCGGGTC TGTTGCCGGC CAGCGTGAGC TATGCCGCGC GCGTGCTGGT GACGCGGGTA
GACCCCGAAG ACTTTATTCC GCCTCAGCCC GGCGACGAGG TGCGGCACGC GCGGGGCGAC
GACCTGCGCC TGGCGCTCAG CGCCGACAAG ATGGACTACG CCTTTGCGGG CGGCCTCCTC
GCGGACGGGC AGGTCTTGCC GGTCAATTAC CAGTTCGTGA ACGGGGAGCA GGGCGGCCAC
ATCAACATCA GTGGGATTTC CGGTGTGGCG ACCAAGACGA GTTACGCCCT TTTCCTGCTG
CATTCGATCT TCCGGGGCGG CGTGCTGGCG CAGCGGCGTG AGGGGCACAA CACCCGCGCC
CTGATCTTCA ACGTGAAGGG CGAGGACCTG CTCTTTCTGG ACAAGCCCAA CGCAAAGGTG
GGCCAGAAAG AGGAGGGCGT GCGGGCGCGT AAGGGTTGGC GCGAGGGCCG CTACGACCTG
CTGGGCCTGC CCACCGAACC GTTCCGTGAC GTGCAGTTTC TCGCGCCGCC CAAGGGGGGA
GCGGGCGACG TGATCGTACC GGACGTAGAG CAGCGCTCGG AGGGCGTCAT CCCCTTCGTT
TTCAGCCTGC GCGAATTCTG CACGAAGCGG ATGCTGCCCT ACGTCTTCTC GGACGCCGGA
ACCAGCCTGA ACCTGGGCTT CGTGATCGGC AACATCGAGG AAAAGCTCGC GCGCCTGGCC
GCCGGGGACG ACGCGCCCTA TCTCACGGTC GAGGACTGGC AGCCTGACAC GGAAGTGCTG
CTCTCCGAAA ACCTGCGCTT TGATGAGATG GGCAAGACGC GCATCGAAAC CTTTGGGGCG
CTCGTCTCCT ACCTCGAATA CAAGCTGCTG GAGCAAAACG ACGGCGAGGG CGATTCCAAG
TGGGTGCTCA AGCAAAACGG GGGCACCCTG CGCGCCTTTA TCCGCCGCTT GCGTGGCGTG
CAAAAGCACC TCTCGCCGCT GGTGCGCGGT GACCTCACAC CAGCGCAGGC CGCAAAGTAC
CGCCCGGACA TTCTGAGACA AGGGGTGCAG ACCAGCGTGG TCGACATCCA CAAGCTGGGC
GCACACGCGC AGAGCTTTGT GGTGGGCGTG CTGCTGCGTG ACCTCTTCGA GCACAAGGAG
CGCTACGGGC GGCAAGACAC GGTGTTTGTT GTCCTCGATG AGCTGAACAA GTACGCGCCG
CGCGAGGGAG ATAGCCCAAT CAAGGACGTG CTGCTGGACA TCGCAGAACG CGGGCGCTCC
CTCGGGATCA TCCTGATCGG GGCGCAGCAG ACCGCTTCGG AAGTCGAGCG GCGCATTGTC
TCCAATGCAG CCATTCGTGT GGTGGGTCGC CTCGATCTCG CGGAAGCTGA GCGGCCCGAG
TACCGCTTCC TCCCGCAGAG CTTCCGCGCC CGCGCGGGCA TCCTTCAGCC GGGGACCATG
TTGGTCTCTC AGCCCGACGT CCCCAATCCG GTCCTAGTGG GCTACCCCTT TCCCGCCTGG
GCCACCCGCC GCGACGAGGT GGCCGAGAGT GTGACGGTGC AGGAGACAGA GGACACGGGC
AAAGACTGGC TGGGCCTATA G
 
Protein sequence
MTGPSVSRPD ERIGMVLGTE DATPVTFWFA VSPGASVQMD DLVVVRTQKP NGQPVHFYGI 
VDHVRTRHEG VTFDSDVADV VAGLLPASVS YAARVLVTRV DPEDFIPPQP GDEVRHARGD
DLRLALSADK MDYAFAGGLL ADGQVLPVNY QFVNGEQGGH INISGISGVA TKTSYALFLL
HSIFRGGVLA QRREGHNTRA LIFNVKGEDL LFLDKPNAKV GQKEEGVRAR KGWREGRYDL
LGLPTEPFRD VQFLAPPKGG AGDVIVPDVE QRSEGVIPFV FSLREFCTKR MLPYVFSDAG
TSLNLGFVIG NIEEKLARLA AGDDAPYLTV EDWQPDTEVL LSENLRFDEM GKTRIETFGA
LVSYLEYKLL EQNDGEGDSK WVLKQNGGTL RAFIRRLRGV QKHLSPLVRG DLTPAQAAKY
RPDILRQGVQ TSVVDIHKLG AHAQSFVVGV LLRDLFEHKE RYGRQDTVFV VLDELNKYAP
REGDSPIKDV LLDIAERGRS LGIILIGAQQ TASEVERRIV SNAAIRVVGR LDLAEAERPE
YRFLPQSFRA RAGILQPGTM LVSQPDVPNP VLVGYPFPAW ATRRDEVAES VTVQETEDTG
KDWLGL