Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_1704 |
Symbol | |
ID | 4058947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 1809837 |
End bp | 1812857 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641230727 |
Product | transposase Tn3 |
Protein accession | YP_605168 |
Protein GI | 94985804 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.019193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTTG AATTCCTGAG CGACGACCAG GCCGCCCGCT ATGGCTGCTA CCATACCGAC CCCACCCCCG AGCAGCTCAC CCGCTTCTTC TATCTCAGTG AACAGGACCA CGCCTTCATT GCGCAGCGGC GCCGGGAACA CAACAAGCTG GGCTGCGCTC TCCAGCTCTG CACCCTCCGC TTCCTGGGCA CCTTCCAACT CGACCCCACC GCCGTCCCTG TGGTCGCCAT TCAGAATGTC GCTGACCAAC TACAGCTCGA CCCCGCAGTG CTGCCCCAGT ACCTTCGGCG GGTCAAAACG CGCTTCCAGC ACCAGCAACT GATCCTCGCC CACCTGGACT ACCAGCCGTT CGACGACGTG CAGGCCTTCC GGTTGATCCG CTGGCTGTAC GCGCAGGTCG CCACCAGCAC CGTCCGTCCC AGCGTGCTGT TCGATCTGGC GACCGCCCAT CTGGTGTCCC AACGGGTCGT GCTGCCCGGC GTAACCACAC TCGCCCGTCT CATCGCCCGT GTCCGGGACC GGCTCAGCCG CAAAACCTTT GAGGGTCTCA GCCATCGCCT GACATCTGAA CAGCGGGCCA ACCTGGAGGC GCTGCTGGTC CTGTCCGAGG GCGAGCGGCT GACGCCCCTG GAAGTGCTCC GCACTTCCCC CACCCGGGTC ACCAGCCCGG CCCTGCTGGC CGCCTTGTTG CGGATCGGAC AACTCCGCGA GATCGGCGTC GGCTCGCTCA ACTTGAGTGA CGTGCCAGAA GGGAGGCGGG CGTTGCTGGC GCGGCACGCT CAGACAGCCT GGGCGCAGAC CTTGTTGCGA ATGGGCGAGG ACCGGCGGCT GGCGACCCTG CTGGTCTTTG TCCAGCACCT GGAGCGCACG GCCACCGATG ATGTTCTTGA CCTGTTCGAT GCCCTGATGA CCTCGCTAGC GCTCAAGGGG GAAGCCAAAC GCCGTCAGGA ACGGTTGCGG ACCCTCCGTG ACCTCGACCA GGCGGCCCTG GTCTTGCAGG ACGCAGTCCG CGTCCTGCTG GACGAGTCGG TTCCGGAAGT GGACCTTCGC CGGATAGTGT TCTCCCGGGT CGGGCAGACT CGGCTCTGGG AGGCGGTGGG AACCGTCCAG GCACTTGCAA GTGAGGACGA TGACACGACG CCGGAGGCCC TGAGCGGCAG CTATGCCACT GTCCGGCGTT TCCTGCCCAC CTTCCTTAAA ACCGTCGAGT TCCAGGGGAC ACCGACGGCC AAGCCGCTGC TAGAGGCCTG GCGTTTCCTG GCCCGGCAGG AAGAGGGTGG ACGGGGCAAG CCGAAATGGA CCGAGGCCCC TCGATCTTTT GTGCCCAGAG CCTGGGAACG GCGGGTGTTC CCCGCCAAGG GTGAGGTCAA CCGGCAAGCC TACACCCTCT GCGTGCTGGA CCGGCTCCAA CAGGCGCTGA AGCGCCGTGA GGTCTTCGCG CCCAGAAGCG AGCGGTACGG CGATCCCCGA GCAGAATTGT TGCAGGGGCC AGGCTGGGAG GCCGCGCGGG ACGATGTGTC CCGCGCTCTG GGCCGTTCCC TTGACCCCCA GGCGGAACTC GAACTGTTGC GGACGGAATT GAACGCGGCC TACCGGGAAG TGGAAGAGAA CCTGCCGCAG AACACGGCAC TCAAACTGGA AGTTCGGGAC GGCCACACGC AGGTCAGCCT GACCCCCCTG GACGCTCAAC CGGAGCCGCC CAGCCTGGTT CGCCTGCGGG AACAGGTGAC GCTGCGACTG CCGCAGGTGG AACTTGCGGC TCTGCTGCTC GAAATCCATG CCTTCACGGG CTTCGCGTCG GCCTTCACGC ACCTGACAGA TGGGAGAGTG CAGGTCAAAG ACCTGCCACT CAGCGTCTGC GCGGTCCTGC TCGCGCAGGC CTGCAACATC GGGTTGAAAG CCGTGGCCCG GCAGGATGTT CCTGCCCTGA CCCTTTCCCG CTTGTCGTGG GTCCAGCAGA ACTATGTTCG GGCGGAAACG ATCACGGCTG CCAATGCCCG CCTGGTGGAT GCCCAGCTTG ACCTGCCGCT GGCACAGTCC TGGGGCGGCG GGGAAGTGGC GTCGGCGGAC GGGCTGCGCT TCATCGTCCC GGTGCGGACC ATCCACGCGG GCTGGAACAG CAAGTATTTC GGCTCGCAGC GGGGCGTGAC CTACGACAAC TTCACCAGCG ACCAGTTCAC GGGTTTTCAC GGGATCGTGG TGCCGGGGAC GCTGCGGGAC TCACTGTTCA TCCTGGCGGG GCTGCTGGAG CAGCAGACCC GGCTCGACCC CCGCGAGATC ATGGCGGATA CGCACGGGTA CAGCGATGTC GTGTTCGGCC TGTTCGCCCT GCTGGGCTAC CAGTTCAGCC CCCGCCTCGC TGATCTTGCA GATCAGCGGT TCTGGCGCTT GGAGAAGAAC GCCGATTACG GCGCGTTGAA CGATCTGAGC CGCCATGTGG TGAACGAACG GCTGATCGCG GAACACTGGG AGGACCTGTT GCGGCTGGCG GGGTCATTGA AGCTGGGAAA GGTCAAGGCG ACGGCGGTGA TGCGGACCTT GCAACGCGGC GGCAGCCTGT CCGGCCTGGG GCGGGCGGTG GCAGAACTGG GGCGGATCGA GAAGACGCTG TACCTGCTGG CCTATGTGCA GGACGAGGCG TACCGGCGAC GGATTCTGGG GCAACTGAAT CGCGGGGAGG GACGGCACAG GGTAGGCCGT GCCGTGTTCC ACGGCCACAA GGGGGAGTTG CGGCAACGCT ACCGGGAGGG AATGGAGGAC CAGTTGGGGG CGCTGGGGCT GGTGGTGAAT GCCATCGTGC TGTGGAACAC CCGCTACATG CAGGTGGCGC TGGAGGACCT GCGCCACGCG GGGATGGACG TTCAAGAGGA GGACGTGGCC CGACTCTCGC CGCTGCTCCA CGAGCATGTG AACATGCTCG GGAAGTACGA TTTCACGCTG CCCGACGAGG TGGCCGGAGG GCAGCTCCGG CCCCTGCGTG ACCCGAATAG TCCAGAGGAG TGGCTGGCGC AACTCGCTTA G
|
Protein sequence | MPVEFLSDDQ AARYGCYHTD PTPEQLTRFF YLSEQDHAFI AQRRREHNKL GCALQLCTLR FLGTFQLDPT AVPVVAIQNV ADQLQLDPAV LPQYLRRVKT RFQHQQLILA HLDYQPFDDV QAFRLIRWLY AQVATSTVRP SVLFDLATAH LVSQRVVLPG VTTLARLIAR VRDRLSRKTF EGLSHRLTSE QRANLEALLV LSEGERLTPL EVLRTSPTRV TSPALLAALL RIGQLREIGV GSLNLSDVPE GRRALLARHA QTAWAQTLLR MGEDRRLATL LVFVQHLERT ATDDVLDLFD ALMTSLALKG EAKRRQERLR TLRDLDQAAL VLQDAVRVLL DESVPEVDLR RIVFSRVGQT RLWEAVGTVQ ALASEDDDTT PEALSGSYAT VRRFLPTFLK TVEFQGTPTA KPLLEAWRFL ARQEEGGRGK PKWTEAPRSF VPRAWERRVF PAKGEVNRQA YTLCVLDRLQ QALKRREVFA PRSERYGDPR AELLQGPGWE AARDDVSRAL GRSLDPQAEL ELLRTELNAA YREVEENLPQ NTALKLEVRD GHTQVSLTPL DAQPEPPSLV RLREQVTLRL PQVELAALLL EIHAFTGFAS AFTHLTDGRV QVKDLPLSVC AVLLAQACNI GLKAVARQDV PALTLSRLSW VQQNYVRAET ITAANARLVD AQLDLPLAQS WGGGEVASAD GLRFIVPVRT IHAGWNSKYF GSQRGVTYDN FTSDQFTGFH GIVVPGTLRD SLFILAGLLE QQTRLDPREI MADTHGYSDV VFGLFALLGY QFSPRLADLA DQRFWRLEKN ADYGALNDLS RHVVNERLIA EHWEDLLRLA GSLKLGKVKA TAVMRTLQRG GSLSGLGRAV AELGRIEKTL YLLAYVQDEA YRRRILGQLN RGEGRHRVGR AVFHGHKGEL RQRYREGMED QLGALGLVVN AIVLWNTRYM QVALEDLRHA GMDVQEEDVA RLSPLLHEHV NMLGKYDFTL PDEVAGGQLR PLRDPNSPEE WLAQLA
|
| |