Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1114 |
Symbol | |
ID | 3229569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | + |
Start bp | 1018023 |
End bp | 1021082 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637120678 |
Product | type III restriction-modification system, restriction endonuclease subunit |
Protein accession | YP_181829 |
Protein GI | 57234103 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.189519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTC AATACAAACA TCAAAAATTC CAAGCTGATG CTGCCAAGGC GGTCGTGGAT GTTTTTGCAG GCCAGCCTTA TCTGACGCCT ACCTATATGA TGGATAGAGG AACCGGTATG TTTCAGATTG GATTGAATGA GGAACGTGAT TATACCGGTT GGAGTAACCA GAAGATTGTG CCGGAGCTTT CCGATCATCT GATTCTGGAG CATATACAGA AAATTCAGCG TACAAATCAA ATCCAGCCGT CTTCCAAGCT GGAAGGCCGC AGCGACGGTT TCAATCTCAC CATCGAGATG GAGACCGGTG TTGGCAAGAC TTATACCTAT ATCAAGACAA TGTATGAGCT GAACCGCGCC TATGGCTGGA GTAAGTTCAT CATTGTGGTT CCGAGTATCG CTATACGTGA GGGCGTGTAT AAGTCCTTCC AGATGACGCA AGAGCATTTC GCAGAAGAGT ACGGAAAAAA GATCCGCTTC TTTATCTATA ATTCTGCACA GCTCACGGAG ATTGACCGCT TTGCTTCGGA TAGCTCTATT AATGTCATGA TCATCAATTC GCAGGCGTTC AATGCCAAGG GCAAGGATGC AAGGCGTATT TACATGAAGC TGGACGAGTT CCGCAGCCGT CGCCCGATTG ACATCATTGC GAAGACGAAT CCGATTCTGA TCATCGACGA GCCACAGTCT GTGGAAGGTA AGCAGACAAA GGAGCGCCTG AAGGAATTCC AGCCGCTGCT GACGCTGCGT TATTCCGCTA CGCATAAGTC AGACAGCATT TACAATATGA TATATCGGCT GGACGCTATG GAAGCATATA ACAAGCGTCT GGTTAAGAAG ATCGCCGTTA AGGGTATCAC CGAGACCGGC AGCACGGCCA CGGACAGCTA TCTCTATCTG GAAGGTCTGA ATCTTTCCAA GGGTGATCCT ACTGCTACGC TGCAGTTTGA GGTCAAGATG GCCAGTGGCA CTCCAAAGAA AAAGAGCCGC GTTGTGAAAA TCGGAGATAA TCTCTACGAC TATTCCGGCG GGCTGGAAGA ATACAAAAAC GGCTTTGTGG TCAAGCAGAT CGATGGCCGG GACGACTCCG TGGAATTCCT GAACGGCATC AAGATTTATG CCGGTGACGT GATGGGCGCG GTCGATGAAG ACCAGCTGCG CCGTATTCAG ATCAGAGAGA CAATCTTGTC GCATATTCAG AGAGAACGTC AGCTCTTCTA TAAGGGCATC AAGGTGTTGT CCCTGTTCTT TATTGACGAG GTTGCAAACT ACAGAGAGTA TGATGCTGCA GGTCAGCCGG TAAATGGAAA GTACGCCAGA ATGTTTGAAG AGGAGTACGA GGACATCATC AGCAATATGC AGCTTAGTAT AGGCGAGGAT GAATACATCA AATACCTGCA GAGCATCAAG GCTTCTAAGA CGCATGCGGG ATATTTCTCA GTGGACGGAA AAGGCAAGAT GATCAATTCC AAGGTCGGTC GTAAAGAAAC GACCTCAGAC GATGTGAGCG CCTATGAGCT GATTATGAAG AACAAGGAGC TGCTTCTTGA TCGTGATCCG AAGAAGTCTC CGGTGCGCTT CATCTTTTCA CACTCCGCTC TGCGTGAAGG ATGGGACAAT CCTAATGTGT TCCAGATCTG TACGCTGAAG CAGTCCAGCA GTGATGTTCG TAAGCGTCAG GAAGTAGGCC GTGGCCTGCG TCTTTGCGTC AATCAGGACG GTGAGCGTAT GGATACCAAT GCGCTCGGAA ATGACGTTCA TAACGTGAAT GTTCTGACGG TTATTGCCAG CGAGAGCTAC GATTCCTTTG CCAAAGGCCT GCAGACCGAG ATGGCTGACG CCGTTGCTGA TAGACCGCGT GCGGTTACCA TCGATCTCTT TGTCGGAAAA GTAATCAAGG ACGACAAGGG TAATGAGCAG GTTATCGATC AGGATACTGC TTCCGCAATC CACTACGATA TGATCGTTAA CGGTTATATC GACCGTAAGG GCGTTCTGAC AGATAAGTAT TACGAGGATA AGGCAAACGG AGAAATCAAG GTCGCAGAGG AAGTGGCTGA TTCCGCAGCC TCTGTTATTG AGATTGTTGA CTCCATTTAC GATGCTCGTA GCATGCAGCC TGAGAATGCC CGCAGCAATA ACGTCGAGCT TCAGGTCGAT GAAGAGAAAC TGGCTATGCC GGAATTCAAA GCGCTGTGGT CAAAGATCAA TGCTAAGTCC GTATATGTTG TTGATTTTGA TACCGACGAG CTGATCAGGA AGTCCATCGC CTCCCTTGAT GCCAAGCTGC GTGTATCGAA GATTTACTTC CGCGTAGAGT CCGGTGCAAT GGATAACATC AAGTCGAAGG AAGAGCTTGT ATCCGGAGCC TCCTTCGTAA AAGAAGAGTC CGCCAGCTAC GGTGTGACCA TCACAGCGAA CTCGAATGTA AAGTATGACC TGATTGGGAA ACTGGTGGAC GAGACCGGGC TTACCCGTAA GGCGGTCATT GCAATCCTTC AGGGAATCAA GCCGTTTGTG TTCGACCAGT TTAAGGATAA TCCGGAGGAG TTTATCGTCA AGGCCGCCGC GCTGATCAAT GACGAGAAAG CCACGGCCAT TATCGAGCAC ATTACCTACG ACGTTCTGGA TGAGCATTAT GGCATGGATG TTTTCACCGA TCCCACCATC AAGGGCAAGC TGGGCGTTAA TGCGATGAAG GCAAAGAAGC ACCTGTACGA TCACATCGTT TACGATTCAT CGAACGAGCG CGACTTTGCT ACAGACCTTG ATACAAACAC CGATGTTGCG GTTTACGTGA AGCTGCCGGA TGGATTCTAT ATTTCCACTC CTGTCGGCCA CTACAATCCC GACTGGGCAA TTGCCTTCTA CGAGGGCAAG GTAAAGCACA TATACTTCGT GGCAGAGACG AAGGGCTCCA TGAGCTCGAT GCAGCTGCGG CTGATTGAGG AGTCCAAGAT TCACTGCGCG AGAGAACACT TTAAGGCCAT TTCCAATGGG AATGTGGTAT ACGACGTAGT CGATAGCTAT AAGTCCCTAT TGGAGAAGGT AATGAAATAA
|
Protein sequence | MRIQYKHQKF QADAAKAVVD VFAGQPYLTP TYMMDRGTGM FQIGLNEERD YTGWSNQKIV PELSDHLILE HIQKIQRTNQ IQPSSKLEGR SDGFNLTIEM ETGVGKTYTY IKTMYELNRA YGWSKFIIVV PSIAIREGVY KSFQMTQEHF AEEYGKKIRF FIYNSAQLTE IDRFASDSSI NVMIINSQAF NAKGKDARRI YMKLDEFRSR RPIDIIAKTN PILIIDEPQS VEGKQTKERL KEFQPLLTLR YSATHKSDSI YNMIYRLDAM EAYNKRLVKK IAVKGITETG STATDSYLYL EGLNLSKGDP TATLQFEVKM ASGTPKKKSR VVKIGDNLYD YSGGLEEYKN GFVVKQIDGR DDSVEFLNGI KIYAGDVMGA VDEDQLRRIQ IRETILSHIQ RERQLFYKGI KVLSLFFIDE VANYREYDAA GQPVNGKYAR MFEEEYEDII SNMQLSIGED EYIKYLQSIK ASKTHAGYFS VDGKGKMINS KVGRKETTSD DVSAYELIMK NKELLLDRDP KKSPVRFIFS HSALREGWDN PNVFQICTLK QSSSDVRKRQ EVGRGLRLCV NQDGERMDTN ALGNDVHNVN VLTVIASESY DSFAKGLQTE MADAVADRPR AVTIDLFVGK VIKDDKGNEQ VIDQDTASAI HYDMIVNGYI DRKGVLTDKY YEDKANGEIK VAEEVADSAA SVIEIVDSIY DARSMQPENA RSNNVELQVD EEKLAMPEFK ALWSKINAKS VYVVDFDTDE LIRKSIASLD AKLRVSKIYF RVESGAMDNI KSKEELVSGA SFVKEESASY GVTITANSNV KYDLIGKLVD ETGLTRKAVI AILQGIKPFV FDQFKDNPEE FIVKAAALIN DEKATAIIEH ITYDVLDEHY GMDVFTDPTI KGKLGVNAMK AKKHLYDHIV YDSSNERDFA TDLDTNTDVA VYVKLPDGFY ISTPVGHYNP DWAIAFYEGK VKHIYFVAET KGSMSSMQLR LIEESKIHCA REHFKAISNG NVVYDVVDSY KSLLEKVMK
|
| |