Gene Dgeo_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1704 
Symbol 
ID4058947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1809837 
End bp1812857 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content66% 
IMG OID641230727 
Producttransposase Tn3 
Protein accessionYP_605168 
Protein GI94985804 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.019193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTTG AATTCCTGAG CGACGACCAG GCCGCCCGCT ATGGCTGCTA CCATACCGAC 
CCCACCCCCG AGCAGCTCAC CCGCTTCTTC TATCTCAGTG AACAGGACCA CGCCTTCATT
GCGCAGCGGC GCCGGGAACA CAACAAGCTG GGCTGCGCTC TCCAGCTCTG CACCCTCCGC
TTCCTGGGCA CCTTCCAACT CGACCCCACC GCCGTCCCTG TGGTCGCCAT TCAGAATGTC
GCTGACCAAC TACAGCTCGA CCCCGCAGTG CTGCCCCAGT ACCTTCGGCG GGTCAAAACG
CGCTTCCAGC ACCAGCAACT GATCCTCGCC CACCTGGACT ACCAGCCGTT CGACGACGTG
CAGGCCTTCC GGTTGATCCG CTGGCTGTAC GCGCAGGTCG CCACCAGCAC CGTCCGTCCC
AGCGTGCTGT TCGATCTGGC GACCGCCCAT CTGGTGTCCC AACGGGTCGT GCTGCCCGGC
GTAACCACAC TCGCCCGTCT CATCGCCCGT GTCCGGGACC GGCTCAGCCG CAAAACCTTT
GAGGGTCTCA GCCATCGCCT GACATCTGAA CAGCGGGCCA ACCTGGAGGC GCTGCTGGTC
CTGTCCGAGG GCGAGCGGCT GACGCCCCTG GAAGTGCTCC GCACTTCCCC CACCCGGGTC
ACCAGCCCGG CCCTGCTGGC CGCCTTGTTG CGGATCGGAC AACTCCGCGA GATCGGCGTC
GGCTCGCTCA ACTTGAGTGA CGTGCCAGAA GGGAGGCGGG CGTTGCTGGC GCGGCACGCT
CAGACAGCCT GGGCGCAGAC CTTGTTGCGA ATGGGCGAGG ACCGGCGGCT GGCGACCCTG
CTGGTCTTTG TCCAGCACCT GGAGCGCACG GCCACCGATG ATGTTCTTGA CCTGTTCGAT
GCCCTGATGA CCTCGCTAGC GCTCAAGGGG GAAGCCAAAC GCCGTCAGGA ACGGTTGCGG
ACCCTCCGTG ACCTCGACCA GGCGGCCCTG GTCTTGCAGG ACGCAGTCCG CGTCCTGCTG
GACGAGTCGG TTCCGGAAGT GGACCTTCGC CGGATAGTGT TCTCCCGGGT CGGGCAGACT
CGGCTCTGGG AGGCGGTGGG AACCGTCCAG GCACTTGCAA GTGAGGACGA TGACACGACG
CCGGAGGCCC TGAGCGGCAG CTATGCCACT GTCCGGCGTT TCCTGCCCAC CTTCCTTAAA
ACCGTCGAGT TCCAGGGGAC ACCGACGGCC AAGCCGCTGC TAGAGGCCTG GCGTTTCCTG
GCCCGGCAGG AAGAGGGTGG ACGGGGCAAG CCGAAATGGA CCGAGGCCCC TCGATCTTTT
GTGCCCAGAG CCTGGGAACG GCGGGTGTTC CCCGCCAAGG GTGAGGTCAA CCGGCAAGCC
TACACCCTCT GCGTGCTGGA CCGGCTCCAA CAGGCGCTGA AGCGCCGTGA GGTCTTCGCG
CCCAGAAGCG AGCGGTACGG CGATCCCCGA GCAGAATTGT TGCAGGGGCC AGGCTGGGAG
GCCGCGCGGG ACGATGTGTC CCGCGCTCTG GGCCGTTCCC TTGACCCCCA GGCGGAACTC
GAACTGTTGC GGACGGAATT GAACGCGGCC TACCGGGAAG TGGAAGAGAA CCTGCCGCAG
AACACGGCAC TCAAACTGGA AGTTCGGGAC GGCCACACGC AGGTCAGCCT GACCCCCCTG
GACGCTCAAC CGGAGCCGCC CAGCCTGGTT CGCCTGCGGG AACAGGTGAC GCTGCGACTG
CCGCAGGTGG AACTTGCGGC TCTGCTGCTC GAAATCCATG CCTTCACGGG CTTCGCGTCG
GCCTTCACGC ACCTGACAGA TGGGAGAGTG CAGGTCAAAG ACCTGCCACT CAGCGTCTGC
GCGGTCCTGC TCGCGCAGGC CTGCAACATC GGGTTGAAAG CCGTGGCCCG GCAGGATGTT
CCTGCCCTGA CCCTTTCCCG CTTGTCGTGG GTCCAGCAGA ACTATGTTCG GGCGGAAACG
ATCACGGCTG CCAATGCCCG CCTGGTGGAT GCCCAGCTTG ACCTGCCGCT GGCACAGTCC
TGGGGCGGCG GGGAAGTGGC GTCGGCGGAC GGGCTGCGCT TCATCGTCCC GGTGCGGACC
ATCCACGCGG GCTGGAACAG CAAGTATTTC GGCTCGCAGC GGGGCGTGAC CTACGACAAC
TTCACCAGCG ACCAGTTCAC GGGTTTTCAC GGGATCGTGG TGCCGGGGAC GCTGCGGGAC
TCACTGTTCA TCCTGGCGGG GCTGCTGGAG CAGCAGACCC GGCTCGACCC CCGCGAGATC
ATGGCGGATA CGCACGGGTA CAGCGATGTC GTGTTCGGCC TGTTCGCCCT GCTGGGCTAC
CAGTTCAGCC CCCGCCTCGC TGATCTTGCA GATCAGCGGT TCTGGCGCTT GGAGAAGAAC
GCCGATTACG GCGCGTTGAA CGATCTGAGC CGCCATGTGG TGAACGAACG GCTGATCGCG
GAACACTGGG AGGACCTGTT GCGGCTGGCG GGGTCATTGA AGCTGGGAAA GGTCAAGGCG
ACGGCGGTGA TGCGGACCTT GCAACGCGGC GGCAGCCTGT CCGGCCTGGG GCGGGCGGTG
GCAGAACTGG GGCGGATCGA GAAGACGCTG TACCTGCTGG CCTATGTGCA GGACGAGGCG
TACCGGCGAC GGATTCTGGG GCAACTGAAT CGCGGGGAGG GACGGCACAG GGTAGGCCGT
GCCGTGTTCC ACGGCCACAA GGGGGAGTTG CGGCAACGCT ACCGGGAGGG AATGGAGGAC
CAGTTGGGGG CGCTGGGGCT GGTGGTGAAT GCCATCGTGC TGTGGAACAC CCGCTACATG
CAGGTGGCGC TGGAGGACCT GCGCCACGCG GGGATGGACG TTCAAGAGGA GGACGTGGCC
CGACTCTCGC CGCTGCTCCA CGAGCATGTG AACATGCTCG GGAAGTACGA TTTCACGCTG
CCCGACGAGG TGGCCGGAGG GCAGCTCCGG CCCCTGCGTG ACCCGAATAG TCCAGAGGAG
TGGCTGGCGC AACTCGCTTA G
 
Protein sequence
MPVEFLSDDQ AARYGCYHTD PTPEQLTRFF YLSEQDHAFI AQRRREHNKL GCALQLCTLR 
FLGTFQLDPT AVPVVAIQNV ADQLQLDPAV LPQYLRRVKT RFQHQQLILA HLDYQPFDDV
QAFRLIRWLY AQVATSTVRP SVLFDLATAH LVSQRVVLPG VTTLARLIAR VRDRLSRKTF
EGLSHRLTSE QRANLEALLV LSEGERLTPL EVLRTSPTRV TSPALLAALL RIGQLREIGV
GSLNLSDVPE GRRALLARHA QTAWAQTLLR MGEDRRLATL LVFVQHLERT ATDDVLDLFD
ALMTSLALKG EAKRRQERLR TLRDLDQAAL VLQDAVRVLL DESVPEVDLR RIVFSRVGQT
RLWEAVGTVQ ALASEDDDTT PEALSGSYAT VRRFLPTFLK TVEFQGTPTA KPLLEAWRFL
ARQEEGGRGK PKWTEAPRSF VPRAWERRVF PAKGEVNRQA YTLCVLDRLQ QALKRREVFA
PRSERYGDPR AELLQGPGWE AARDDVSRAL GRSLDPQAEL ELLRTELNAA YREVEENLPQ
NTALKLEVRD GHTQVSLTPL DAQPEPPSLV RLREQVTLRL PQVELAALLL EIHAFTGFAS
AFTHLTDGRV QVKDLPLSVC AVLLAQACNI GLKAVARQDV PALTLSRLSW VQQNYVRAET
ITAANARLVD AQLDLPLAQS WGGGEVASAD GLRFIVPVRT IHAGWNSKYF GSQRGVTYDN
FTSDQFTGFH GIVVPGTLRD SLFILAGLLE QQTRLDPREI MADTHGYSDV VFGLFALLGY
QFSPRLADLA DQRFWRLEKN ADYGALNDLS RHVVNERLIA EHWEDLLRLA GSLKLGKVKA
TAVMRTLQRG GSLSGLGRAV AELGRIEKTL YLLAYVQDEA YRRRILGQLN RGEGRHRVGR
AVFHGHKGEL RQRYREGMED QLGALGLVVN AIVLWNTRYM QVALEDLRHA GMDVQEEDVA
RLSPLLHEHV NMLGKYDFTL PDEVAGGQLR PLRDPNSPEE WLAQLA