Gene DET1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1114 
Symbol 
ID3229569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp1018023 
End bp1021082 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content49% 
IMG OID637120678 
Producttype III restriction-modification system, restriction endonuclease subunit 
Protein accessionYP_181829 
Protein GI57234103 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.189519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTC AATACAAACA TCAAAAATTC CAAGCTGATG CTGCCAAGGC GGTCGTGGAT 
GTTTTTGCAG GCCAGCCTTA TCTGACGCCT ACCTATATGA TGGATAGAGG AACCGGTATG
TTTCAGATTG GATTGAATGA GGAACGTGAT TATACCGGTT GGAGTAACCA GAAGATTGTG
CCGGAGCTTT CCGATCATCT GATTCTGGAG CATATACAGA AAATTCAGCG TACAAATCAA
ATCCAGCCGT CTTCCAAGCT GGAAGGCCGC AGCGACGGTT TCAATCTCAC CATCGAGATG
GAGACCGGTG TTGGCAAGAC TTATACCTAT ATCAAGACAA TGTATGAGCT GAACCGCGCC
TATGGCTGGA GTAAGTTCAT CATTGTGGTT CCGAGTATCG CTATACGTGA GGGCGTGTAT
AAGTCCTTCC AGATGACGCA AGAGCATTTC GCAGAAGAGT ACGGAAAAAA GATCCGCTTC
TTTATCTATA ATTCTGCACA GCTCACGGAG ATTGACCGCT TTGCTTCGGA TAGCTCTATT
AATGTCATGA TCATCAATTC GCAGGCGTTC AATGCCAAGG GCAAGGATGC AAGGCGTATT
TACATGAAGC TGGACGAGTT CCGCAGCCGT CGCCCGATTG ACATCATTGC GAAGACGAAT
CCGATTCTGA TCATCGACGA GCCACAGTCT GTGGAAGGTA AGCAGACAAA GGAGCGCCTG
AAGGAATTCC AGCCGCTGCT GACGCTGCGT TATTCCGCTA CGCATAAGTC AGACAGCATT
TACAATATGA TATATCGGCT GGACGCTATG GAAGCATATA ACAAGCGTCT GGTTAAGAAG
ATCGCCGTTA AGGGTATCAC CGAGACCGGC AGCACGGCCA CGGACAGCTA TCTCTATCTG
GAAGGTCTGA ATCTTTCCAA GGGTGATCCT ACTGCTACGC TGCAGTTTGA GGTCAAGATG
GCCAGTGGCA CTCCAAAGAA AAAGAGCCGC GTTGTGAAAA TCGGAGATAA TCTCTACGAC
TATTCCGGCG GGCTGGAAGA ATACAAAAAC GGCTTTGTGG TCAAGCAGAT CGATGGCCGG
GACGACTCCG TGGAATTCCT GAACGGCATC AAGATTTATG CCGGTGACGT GATGGGCGCG
GTCGATGAAG ACCAGCTGCG CCGTATTCAG ATCAGAGAGA CAATCTTGTC GCATATTCAG
AGAGAACGTC AGCTCTTCTA TAAGGGCATC AAGGTGTTGT CCCTGTTCTT TATTGACGAG
GTTGCAAACT ACAGAGAGTA TGATGCTGCA GGTCAGCCGG TAAATGGAAA GTACGCCAGA
ATGTTTGAAG AGGAGTACGA GGACATCATC AGCAATATGC AGCTTAGTAT AGGCGAGGAT
GAATACATCA AATACCTGCA GAGCATCAAG GCTTCTAAGA CGCATGCGGG ATATTTCTCA
GTGGACGGAA AAGGCAAGAT GATCAATTCC AAGGTCGGTC GTAAAGAAAC GACCTCAGAC
GATGTGAGCG CCTATGAGCT GATTATGAAG AACAAGGAGC TGCTTCTTGA TCGTGATCCG
AAGAAGTCTC CGGTGCGCTT CATCTTTTCA CACTCCGCTC TGCGTGAAGG ATGGGACAAT
CCTAATGTGT TCCAGATCTG TACGCTGAAG CAGTCCAGCA GTGATGTTCG TAAGCGTCAG
GAAGTAGGCC GTGGCCTGCG TCTTTGCGTC AATCAGGACG GTGAGCGTAT GGATACCAAT
GCGCTCGGAA ATGACGTTCA TAACGTGAAT GTTCTGACGG TTATTGCCAG CGAGAGCTAC
GATTCCTTTG CCAAAGGCCT GCAGACCGAG ATGGCTGACG CCGTTGCTGA TAGACCGCGT
GCGGTTACCA TCGATCTCTT TGTCGGAAAA GTAATCAAGG ACGACAAGGG TAATGAGCAG
GTTATCGATC AGGATACTGC TTCCGCAATC CACTACGATA TGATCGTTAA CGGTTATATC
GACCGTAAGG GCGTTCTGAC AGATAAGTAT TACGAGGATA AGGCAAACGG AGAAATCAAG
GTCGCAGAGG AAGTGGCTGA TTCCGCAGCC TCTGTTATTG AGATTGTTGA CTCCATTTAC
GATGCTCGTA GCATGCAGCC TGAGAATGCC CGCAGCAATA ACGTCGAGCT TCAGGTCGAT
GAAGAGAAAC TGGCTATGCC GGAATTCAAA GCGCTGTGGT CAAAGATCAA TGCTAAGTCC
GTATATGTTG TTGATTTTGA TACCGACGAG CTGATCAGGA AGTCCATCGC CTCCCTTGAT
GCCAAGCTGC GTGTATCGAA GATTTACTTC CGCGTAGAGT CCGGTGCAAT GGATAACATC
AAGTCGAAGG AAGAGCTTGT ATCCGGAGCC TCCTTCGTAA AAGAAGAGTC CGCCAGCTAC
GGTGTGACCA TCACAGCGAA CTCGAATGTA AAGTATGACC TGATTGGGAA ACTGGTGGAC
GAGACCGGGC TTACCCGTAA GGCGGTCATT GCAATCCTTC AGGGAATCAA GCCGTTTGTG
TTCGACCAGT TTAAGGATAA TCCGGAGGAG TTTATCGTCA AGGCCGCCGC GCTGATCAAT
GACGAGAAAG CCACGGCCAT TATCGAGCAC ATTACCTACG ACGTTCTGGA TGAGCATTAT
GGCATGGATG TTTTCACCGA TCCCACCATC AAGGGCAAGC TGGGCGTTAA TGCGATGAAG
GCAAAGAAGC ACCTGTACGA TCACATCGTT TACGATTCAT CGAACGAGCG CGACTTTGCT
ACAGACCTTG ATACAAACAC CGATGTTGCG GTTTACGTGA AGCTGCCGGA TGGATTCTAT
ATTTCCACTC CTGTCGGCCA CTACAATCCC GACTGGGCAA TTGCCTTCTA CGAGGGCAAG
GTAAAGCACA TATACTTCGT GGCAGAGACG AAGGGCTCCA TGAGCTCGAT GCAGCTGCGG
CTGATTGAGG AGTCCAAGAT TCACTGCGCG AGAGAACACT TTAAGGCCAT TTCCAATGGG
AATGTGGTAT ACGACGTAGT CGATAGCTAT AAGTCCCTAT TGGAGAAGGT AATGAAATAA
 
Protein sequence
MRIQYKHQKF QADAAKAVVD VFAGQPYLTP TYMMDRGTGM FQIGLNEERD YTGWSNQKIV 
PELSDHLILE HIQKIQRTNQ IQPSSKLEGR SDGFNLTIEM ETGVGKTYTY IKTMYELNRA
YGWSKFIIVV PSIAIREGVY KSFQMTQEHF AEEYGKKIRF FIYNSAQLTE IDRFASDSSI
NVMIINSQAF NAKGKDARRI YMKLDEFRSR RPIDIIAKTN PILIIDEPQS VEGKQTKERL
KEFQPLLTLR YSATHKSDSI YNMIYRLDAM EAYNKRLVKK IAVKGITETG STATDSYLYL
EGLNLSKGDP TATLQFEVKM ASGTPKKKSR VVKIGDNLYD YSGGLEEYKN GFVVKQIDGR
DDSVEFLNGI KIYAGDVMGA VDEDQLRRIQ IRETILSHIQ RERQLFYKGI KVLSLFFIDE
VANYREYDAA GQPVNGKYAR MFEEEYEDII SNMQLSIGED EYIKYLQSIK ASKTHAGYFS
VDGKGKMINS KVGRKETTSD DVSAYELIMK NKELLLDRDP KKSPVRFIFS HSALREGWDN
PNVFQICTLK QSSSDVRKRQ EVGRGLRLCV NQDGERMDTN ALGNDVHNVN VLTVIASESY
DSFAKGLQTE MADAVADRPR AVTIDLFVGK VIKDDKGNEQ VIDQDTASAI HYDMIVNGYI
DRKGVLTDKY YEDKANGEIK VAEEVADSAA SVIEIVDSIY DARSMQPENA RSNNVELQVD
EEKLAMPEFK ALWSKINAKS VYVVDFDTDE LIRKSIASLD AKLRVSKIYF RVESGAMDNI
KSKEELVSGA SFVKEESASY GVTITANSNV KYDLIGKLVD ETGLTRKAVI AILQGIKPFV
FDQFKDNPEE FIVKAAALIN DEKATAIIEH ITYDVLDEHY GMDVFTDPTI KGKLGVNAMK
AKKHLYDHIV YDSSNERDFA TDLDTNTDVA VYVKLPDGFY ISTPVGHYNP DWAIAFYEGK
VKHIYFVAET KGSMSSMQLR LIEESKIHCA REHFKAISNG NVVYDVVDSY KSLLEKVMK