Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_1742 |
Symbol | |
ID | 3756742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 1797091 |
End bp | 1800063 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637782623 |
Product | type III restriction system endonuclease |
Protein accession | YP_388234 |
Protein GI | 78356785 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.109484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTC ACTTCGAAGA TAACCTTGAC TATCAGCTTG ATGCTATACG TGCTGTTACT GACATATTTA AAGGGCAGGA GCGCAACGCC TCTCTGTTCT CCGTTTCCAA GACGTATGCC AAGGGCATGA TTGAATCTGA CAAGGGTATT GGTAACAGGC TCGAACTTCT CGACGACGAG CTTCAGGAGA ACCTCAACGC TATTCAGCTG AGGAACGGTC TACGCCCAAC GCATTCTCTG ACCAGCGGCG ACTTCACTGT CGAAATGGAG ACCGGAACAG GCAAGACCTA CGTCTATCTC CGCACGATTT TCGAGCTGAA TAAGCGCTAC GGATTCACAA AATTCGTCAT TGTTGTGCCT TCAGTGGCCA TCAAGGAAGG CGTCTACAAA ACGCTTCAGA TTACGAGAGA TCACTTCGAG AACCTCTATC CCGAATCGAA GGGATATGAC TATTTTTCTT ATGATTCATC GAAGATGGGA CAAGTTCGCA ACTTTGCGAC AAGTCCAAAC GTTCAGATCA TGGTTGTGAC TGTCGGTGCC ATCCATAAAA AGGACGTCAA CAATCTTTAC AAGGAAAGCG AGAAGACAGG CGGAGATAAG CCCATCGACC TTGTGAAGGC GACACACCCT GTCATCATCG TTGATGAACC GCAAAGCGTT GATGGTGGCT TGAAGGGCCA AGGCAAGAAA GCCCTTGAAT CAATGAATCC GCTCTGCACG TTGCGTTATT CTGCCACCCA CGTGGACAAA CATCACATGG TGTATCGTCT CGATGCTGTT GATGCTTACG AAAAGCAGCT TGTTAAACAG ATTGAAGTCG CTTCGCTTGA AATTGATGGC GGTTACAATA AGCCTTACGT CAAGCTTATT TCTACACAGA ACAAGCGCGG TGTCATATCC GCAAAGATTG AAGTCCATAT CCAAAGTGGC AAATCTGTCC GTCCTCAGAT CATCACTGTT CAGGACGGCG ACGACTTGGA GCAAGAAACC GGTCGCGCCA TTTATGAAAA CTTTCGTATC GGCACGATCA ACTGCGGCAA GGGAAATCAG TCCATTGAGC TGAGCGTGCC TAACGGCGAA TATGTCCTGT ATCCCGACGA TGAAATTGGT GGAGTCAATC AAGACGACAT CAAACGCCTC ATGATCCGCC GCACAATTAA AGAGCACCTT GATAAGGAAC TTCGCTTTGC CGCGCTCGGT CACAGAGTGA AAGTGCTCAG CCTGTTCTTC ATCGACAAGG TTGAATTTTA CCGCCAATAC GATGAAGACG GGAACGAGAT TCAAGGCAAG TATGCCAAGA TATTTGAAGA AGAGTATCGC AAGCTGGCAC ATAGCGACGA TTACCGCACG CTCTTTGAGG GGATCGATCT TGAATCCGAT GCTTCGGAAG TCCACAACGG CTACTTTTCC ATCGACAAGA AAGGCAAGCT GACCGACACA GCCGAAAACA ACCAGACCAA CCGCGACAAC GCAGAAAGGG CGTACAACCT GATCATGCGA GACAAGGAAA AGCTTTTGAG TTTCACCTCA AAGCTCAAAT TCATTTTCTC GCACTCTGCG CTCCGTGAAG GCTGGGATAA CCCCAACGTC TTTCAGATTT GTGTGTTGAG AGACATGGGA ACTGAACTGG CAAGAAGGCA GTCCATTGGG CGAGGGCTTC GCCTTTGCGT CAATCAGGAC GGCCAACGAC TTAGGGGCTT TGACGTAAAC ACGCTGACGG TCATTGCTAA CGAAAGCTAT GAACAGTTTG CGGAAACGCT CCAAAAGGAA ATCGAAGAGG ACGCCGGAAT CCGCTTTGGT GTTGTTGAAA AGCACCAGTT CGCAAACATT CCTGTTCAAG GAGCTGATGG CGAACTTCAT CCCCTCGGTT TCAAGAAGTC GGAAGAGCTT TGGCGTATTC TCAAAGAGAG AGAGTTCATC GATAAAAATG GCAAGGTGCA GGACACTCTT AGGACTGCCC TGAAAGAGAA GACGCTCGAA CTTCCTGAAG AATTCAAGCC TGAACTTGAT GCTATATCTT CTGTTCTTCG AAAGCTGGCT GGCAGGCTCG ATATTAAAAA CGCCGATGAA CGTATCACGG TAAAGCCAAA CAAAGAGCGG CTGCTAAGTC CTGAGTTCCA AGAGCTTTGG GAGCGCATCA AGCACAAGAC GACCTATCGT GTCGATTTCG ACAATGAGAA ACTGATTCAG GATTGTGCAA AATCCATCCT TGAAGGACCG CCCATAACCA AGACAAGAGC GCGGTTCAGA AATGCGAATC TGGCGATTGG TGAGGGCGGC ATTGAGGCAG ATGAAAAAGC TGTTAGCCAG TACACGACGA TAAATGAGTC TGACATCGAA TTGCCGGACA TCCTCACCGA TTTGCAGGAT AAGACGCAGT TGACGCGAAA AAGCATCGTC CGCATTCTCA CGTCTTGCCG AAGACTGAAC GATTTCAAAC GTAACCCTCA AGAATTTATT CAGCTGGCGG CAGAGGCTAT CAACCGGACA AAGCGTTTGG CTCTTGTGGA AGGCATCAAG TACCGCAGAA TCGGCGACGA ATACTACTAC GCTCAGGAAC TCTTTGAGAA AGAAGAGCTG ACCGGATACC TCAAGAATAC CGTTGAGGTC CAAAAATCCG TTTATGAACG CGTCGTGTAT GATTCTGCCG GAGTTGAACA GCAATTTGCC CAAGCACTGG AAAAGAACGA ATCAGTCAAA GTGTACGCCA AGCTCCCCGG CTGGTTCAAA GTTCCTACCC CGCTTGGCAC CTACAACCCG GACTGGGCCG TCCTTGTCGA GGAAGACGGA CATGAGAAGC TCTATCTGGT CGTAGAAACA AAGGGTAGCC TCTGGTGGGA CGATCTTCGT CATCACGAGG GGGCAAAGAT CAAGTGCGGC AAAGAACATT TCGCCGTTCT CGCCGAACAC GCCGATAATC CCGCGCGGTT CATCAAAGCT AAAACGGTAG AAGACATGCT GTCATACGAT TAG
|
Protein sequence | MKLHFEDNLD YQLDAIRAVT DIFKGQERNA SLFSVSKTYA KGMIESDKGI GNRLELLDDE LQENLNAIQL RNGLRPTHSL TSGDFTVEME TGTGKTYVYL RTIFELNKRY GFTKFVIVVP SVAIKEGVYK TLQITRDHFE NLYPESKGYD YFSYDSSKMG QVRNFATSPN VQIMVVTVGA IHKKDVNNLY KESEKTGGDK PIDLVKATHP VIIVDEPQSV DGGLKGQGKK ALESMNPLCT LRYSATHVDK HHMVYRLDAV DAYEKQLVKQ IEVASLEIDG GYNKPYVKLI STQNKRGVIS AKIEVHIQSG KSVRPQIITV QDGDDLEQET GRAIYENFRI GTINCGKGNQ SIELSVPNGE YVLYPDDEIG GVNQDDIKRL MIRRTIKEHL DKELRFAALG HRVKVLSLFF IDKVEFYRQY DEDGNEIQGK YAKIFEEEYR KLAHSDDYRT LFEGIDLESD ASEVHNGYFS IDKKGKLTDT AENNQTNRDN AERAYNLIMR DKEKLLSFTS KLKFIFSHSA LREGWDNPNV FQICVLRDMG TELARRQSIG RGLRLCVNQD GQRLRGFDVN TLTVIANESY EQFAETLQKE IEEDAGIRFG VVEKHQFANI PVQGADGELH PLGFKKSEEL WRILKEREFI DKNGKVQDTL RTALKEKTLE LPEEFKPELD AISSVLRKLA GRLDIKNADE RITVKPNKER LLSPEFQELW ERIKHKTTYR VDFDNEKLIQ DCAKSILEGP PITKTRARFR NANLAIGEGG IEADEKAVSQ YTTINESDIE LPDILTDLQD KTQLTRKSIV RILTSCRRLN DFKRNPQEFI QLAAEAINRT KRLALVEGIK YRRIGDEYYY AQELFEKEEL TGYLKNTVEV QKSVYERVVY DSAGVEQQFA QALEKNESVK VYAKLPGWFK VPTPLGTYNP DWAVLVEEDG HEKLYLVVET KGSLWWDDLR HHEGAKIKCG KEHFAVLAEH ADNPARFIKA KTVEDMLSYD
|
| |