Gene Dgeo_3109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_3109 
Symbol 
ID5687572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_009939 
Strand
Start bp199490 
End bp200770 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content60% 
IMG OID641262572 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001527846 
Protein GI158421619 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATC CGAACTGGAA CTGGCGCCCT TTGGGTGAGC TGTTTGAGAT CGGCGCCGGC 
AAAACGATGT CCGCAGCGGC GCGGGCGGGG GCCGACAAGG TGCCGTTTCT GCGCACATCG
AACGTCCTCT GGGACGAGAT CGATCTCACC CAGGTCGACG AAATGTCGAT TTCCCCGACC
GAGTTGGTCG ACAAGAGCCT CAAGGCTGGG GATCTGCTGG TCTGCGAGGG GGGGGAGATC
GGGCGTGCGG CCGTCTGGGA TGGTCGCGTG CCGGTGATGT CCTTCCAGAA CCACCTTCAT
CGACTACGCC GCAAACAGGA CGATGTCGAT GCACATTTCT ATGTGTACTT TCTGCAGAGC
GCGTTCACCC AGCTCGGCAT CTTCGAGGGC GCCGGCAACA AGACAACGAT CCCGAATCTC
TCGCGCAACC GGCTCGCGGC CCTGGATGTA CCCCACCCCC CTAAGCCGGA ACAGCAGTCC
GTGGCACAGG TGCTGGCCAA GGTGCGAGAA GCCATCGCTG TTCACGATCA GGCGACATCT
ACCGCTTTGG AGCTGAAACA TGCGGTGATG AACGACCTGT TCACGCGCGG CCTACGTGGC
GAGCCCCAGA AAGAAACCGA GATCGGGCTG GTGCCGGAAA GCTGGGCCGA GGTTTCCATC
GCGGACCTGG GTGAAATCGT TACCGGCACC ACGCCGCCAA CAAGGGAGCG CGCCTACTAC
GATGACGGGA ACATTCCTTT CATCTCGCCG GGTGACATTG AACACGGGAC CCCCATTGCC
TCAACGCAGA AGTGCATCAC GGACTCTGGA CTTGCCGTTT CGCGCGCACT TCCCGCAGGC
ACGACTTGCG TGGTGTGCAT TGGCTCGACC ATCGGCAAGG TCGGACGCAC AACGGCGGCA
GCCAGTGCCA CCAACCAACA AATCAACGCC ATCGTTCCGG GCGTGGGCTA TGACCCGAAC
TATCTTTCGC ACTTGCTCAC TTACCAGTCA AACATTGTGC GCAACGCAGC CTCACCCAGT
CCAGTTCCGA TTCTGAGCAA GGGCGCATTC GAGAAACTCG TCTTGTTCAC CTCGACGAAT
CCCGATGAAC AGGTAGAGAT TGCCACCATC CTTGACGCCG TCGACCGCAA GATCGACCTG
CACCAGAAGA AGCGCAAGGT GGTGGAGGAG CTCTTCGAGT CCCTGCTACA CAAGCTCATG
ACCGGCGAGA TCGCCGTGTC GGATCTGGAT CTGTCGGCAC TAGCCCCGGC CTCGACGCAA
CTCGAGGAGG CCACGGCATG A
 
Protein sequence
MTNPNWNWRP LGELFEIGAG KTMSAAARAG ADKVPFLRTS NVLWDEIDLT QVDEMSISPT 
ELVDKSLKAG DLLVCEGGEI GRAAVWDGRV PVMSFQNHLH RLRRKQDDVD AHFYVYFLQS
AFTQLGIFEG AGNKTTIPNL SRNRLAALDV PHPPKPEQQS VAQVLAKVRE AIAVHDQATS
TALELKHAVM NDLFTRGLRG EPQKETEIGL VPESWAEVSI ADLGEIVTGT TPPTRERAYY
DDGNIPFISP GDIEHGTPIA STQKCITDSG LAVSRALPAG TTCVVCIGST IGKVGRTTAA
ASATNQQINA IVPGVGYDPN YLSHLLTYQS NIVRNAASPS PVPILSKGAF EKLVLFTSTN
PDEQVEIATI LDAVDRKIDL HQKKRKVVEE LFESLLHKLM TGEIAVSDLD LSALAPASTQ
LEEATA