Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0430 |
Symbol | |
ID | 8135739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 507673 |
End bp | 510765 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644868048 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003020268 |
Protein GI | 253699079 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 124 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTGAAC AGCCCCGATC GGAACGCAAG ACGCAGAACC GCGTTGTCTC CCTTTTCACC GATGGGAGCC GCCAAGATGG CCTCGGCTAC CGCTACCTCG GGGACTGGCA TGAACGCGCG GACAACGGTC CGATCGAACC GGCACTATTG CGGCAAAACC TCAAGGCTAG AGGCTATTCC GACGCTCACA TCTCCGCAGC CCTGCAAAAG CTCGAAAGAG CGGCGGAAGT CACGGGCACC ACGCTCTACC AAGCAAACCT CCGCACTTAT CAGTTGCTGC GTTACGGCGT GCCGGTCCAG GTCGCAGCAG GGCGCCCGCA TGAGACGGTG CACCTCGTCG ACTGGGAGCA CACGGATAAA AACGACTTCG CACTGGCCGA GGAAGTGACC TTGCGCGGCG GATACGAGCG CAGGCCCGAC CTGGTCATCT ACATGAATGG CATTACGGTG GGGGTGATCG AACTTAAGCG CAGTTCCGTG GATGTGGCCG ACGGTGTGCG CCAGCTCCTA ACAAACCAGG AAGAGATCTT CAACGAGTGG TTCTTCAGTA CCATCCAGTT CGTCTTTGCA GGAAGCGACT CGCAGGGATT GCGCTACGGC ACTGTCGGCA CGCCGGAAAA GTTCTTCGTG GAGTGGAAGG CACCTGGACA CGATGGAGAA TCTTCCCTTC CCGGCGCGCT CCTGGACCGG CCTCTTGCAG AGATGTGCGA GAAGGGGCGC CTGCTCGATC TGATCCGTAA CTTCATCATC TTCGACGCCG GACAGAAGAA GGTGCCGCGA CCGCACCAGT TCGCGGCGGT CAAGGCGGCC CAGGAGCGGA TCAGGACGCG CGAGGGAGGC GTCATCTGGC ACACCCAAGG GAGCGGCAAA AGTATCCTGA TGGTGCTGCT GGCCAAGTGG CTTCTGGAGC ACGACCCCGA GGGGCGAATC CTGATCGTCA CGGACCGCGA CGAGTTGGAT AAACAGATCG AAGGGGTGAT GAAAAACGCC GGAGTCGTGG GGGCGGAATC CCCTTCGCCG CGCATCACCT CCCGCGCACA GTTCTCCGAC AAGCTTGCCG CCACCTCGCC CCGGCTCTTA TGCGCACTGA TTCACAAGTT CGACTTGGAC CCCAAAAGCG AGCCCCCGCC CATACGCGGG CGCTTCTATG TGTTCGTGGA CGAATGTCAC CGCACCCAGG GCGGCGACAT GAACAAGCAG ATGAAACGCT GGCTGGAAAA CGCCATCTTC ATCGGCTTCA CCGGAACTCC TCTTCTGCGC CGCGACAAGC AAACCACCCG CGAGATCTTC GGCACCTACA TCCACACCTA CAAGTTCCAC CAAGCGGTGG AAGACAAGGT GGTCCTGGAC CTGAAATACG AGGCCCGCAC CGTCCCTCAG CGTCTCACCT CCAGAAAGGC GATCGATGAT TGGTTCGATC AGAAGACCCG GGGGCTGAAC AACTACCAGC GCGCAGTCCT CCGCAAGAGG TGGGCGACGA TGGAGGAGCT GATGAGCTCC GGCGAGCGCA AGCAGCGCAT CATCGCCGAC ATCATCCATG ACTTCGGGGT GCAGCAACGC CTGAACAATG ACCGGGGCAC AGCCATGCTC GTGGCCGACT CCATCTACGA CGCCTGCCAT TACTTCCGCC TGCTCCAGAA CACTTCTTTT GGTGCCTACT GCGGCATCGT CACCTCCTAC GAACCAAACC ACAACGCCAT CTCCCGTGAA CCGAAGGATA GCGACGAGCG TTACAAGTTC GACACCTACA CGCGCCATGT ACTGAGAGCG GGCCAGACTA CGAAGCAGTA CGAGGATGAA ACGAAGCGTC GCTTCATCGA GGAGCCGGCC AACTGCAAAC TCCTCATCGT GGTGAGCAAG CTTCTCACCG GCTTCGACGC ACCTTCCTGC ACTTACATCT ATCTCGACAG CAAGATGCAG GACCACACCC TTTTTCAGGC CATCTGCAGG ACCAACCGGC TCGACGGGGA CGACAAGTCC TTCGGCCATA TCGTCGATTA CAAGGAGCTG TTCGAGAAGG TGCAGAACTC CATCGCGGTC TACAGCTCCG ATGAGCTGGA CATCGAAAAC GGGGACGGCG ACAACAACGT GAGGCTCAAG GACTGGCTGA GAGAGGGTAA GGCGCAGCTC GACGCAGCCC GCGAGGCGCT CCGCTACCTC TGCGAACCGG TGATGCAGCC GCGCGAGATG GAGCAGTATG TGGAGTACTT CTGCGGGTAC GGCAACGGAC TCGAAGCCCT CGCTGCAACA GAGCCGCTGC GCATCTCGTA CTACAAGTCC GTCGTCACCT TCCTGCGCGC CTATGCCGAT GTAGCGCAAA ACCTTGCCGA GGCGGGTTAT TCCAGTGTCG AGATCGCCGC TTTGGAGCAG GAGACAAAAT TCCATTGCGA CACCCGGGCG GCGATAAAGA ACCATTCGGG GGAAGAACTG GACATCAAGC CCTACGAGGC GGACATGCGC CACCTCATCA ACACCTACAT ACAGGCCGAC CCCGCCACCG ACCTGGGCAA CCTGAGTTCG CTCTCCCTTA CCCAGCTCAT CATAGAGACC GGCATCCACG ACGCCATAGC CCGGAAGCTC AACCAGCAGG GGAAGCTCTC CCGTAACGCC ATCGCGGAGG GGATCATCAA CAACGTCCGC AAGACGATCA TCCGAGAGAA GCTTACCGAC CCCAGATTCT ACGAGGAGAT GTCTAAACTG CTCGAGGACT TGATCCGGCA GAAGCGGAAC GGCACGAAGT CCTACGAGGA ATTCCTGAAA AAGGCAGAGG AGCTGGTCAG GCGCTTAGCG CGGAAGCATC CAGAGGCTGG CATCCCTGTA GTTCTGTACG GCAAACCGGA AGCCATCGCG CTATTCAACA ATCTCCGCGG TATCCCGGCG ACCAGCTTTC GCTATCCCAC GGATGATGAG GAGAGGGCGC AGCTAGCCCT GGATCTGGAC CGGGTGGTAT GCGCTGAGGC TCCGGCCAAC TGGATAGGGG ACGATGCACG GGAAAAGCAG GTCCTGAACG CCCTGTTCCC CAAGCTCGCG CGCGACCGCG AAGCCACGCT GGCAATCTTT GAAATCGTCA AGAACCAACC AGGCTACAAA TGA
|
Protein sequence | MPEQPRSERK TQNRVVSLFT DGSRQDGLGY RYLGDWHERA DNGPIEPALL RQNLKARGYS DAHISAALQK LERAAEVTGT TLYQANLRTY QLLRYGVPVQ VAAGRPHETV HLVDWEHTDK NDFALAEEVT LRGGYERRPD LVIYMNGITV GVIELKRSSV DVADGVRQLL TNQEEIFNEW FFSTIQFVFA GSDSQGLRYG TVGTPEKFFV EWKAPGHDGE SSLPGALLDR PLAEMCEKGR LLDLIRNFII FDAGQKKVPR PHQFAAVKAA QERIRTREGG VIWHTQGSGK SILMVLLAKW LLEHDPEGRI LIVTDRDELD KQIEGVMKNA GVVGAESPSP RITSRAQFSD KLAATSPRLL CALIHKFDLD PKSEPPPIRG RFYVFVDECH RTQGGDMNKQ MKRWLENAIF IGFTGTPLLR RDKQTTREIF GTYIHTYKFH QAVEDKVVLD LKYEARTVPQ RLTSRKAIDD WFDQKTRGLN NYQRAVLRKR WATMEELMSS GERKQRIIAD IIHDFGVQQR LNNDRGTAML VADSIYDACH YFRLLQNTSF GAYCGIVTSY EPNHNAISRE PKDSDERYKF DTYTRHVLRA GQTTKQYEDE TKRRFIEEPA NCKLLIVVSK LLTGFDAPSC TYIYLDSKMQ DHTLFQAICR TNRLDGDDKS FGHIVDYKEL FEKVQNSIAV YSSDELDIEN GDGDNNVRLK DWLREGKAQL DAAREALRYL CEPVMQPREM EQYVEYFCGY GNGLEALAAT EPLRISYYKS VVTFLRAYAD VAQNLAEAGY SSVEIAALEQ ETKFHCDTRA AIKNHSGEEL DIKPYEADMR HLINTYIQAD PATDLGNLSS LSLTQLIIET GIHDAIARKL NQQGKLSRNA IAEGIINNVR KTIIREKLTD PRFYEEMSKL LEDLIRQKRN GTKSYEEFLK KAEELVRRLA RKHPEAGIPV VLYGKPEAIA LFNNLRGIPA TSFRYPTDDE ERAQLALDLD RVVCAEAPAN WIGDDAREKQ VLNALFPKLA RDREATLAIF EIVKNQPGYK
|
| |