Gene GM21_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0430 
Symbol 
ID8135739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp507673 
End bp510765 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content60% 
IMG OID644868048 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003020268 
Protein GI253699079 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGAAC AGCCCCGATC GGAACGCAAG ACGCAGAACC GCGTTGTCTC CCTTTTCACC 
GATGGGAGCC GCCAAGATGG CCTCGGCTAC CGCTACCTCG GGGACTGGCA TGAACGCGCG
GACAACGGTC CGATCGAACC GGCACTATTG CGGCAAAACC TCAAGGCTAG AGGCTATTCC
GACGCTCACA TCTCCGCAGC CCTGCAAAAG CTCGAAAGAG CGGCGGAAGT CACGGGCACC
ACGCTCTACC AAGCAAACCT CCGCACTTAT CAGTTGCTGC GTTACGGCGT GCCGGTCCAG
GTCGCAGCAG GGCGCCCGCA TGAGACGGTG CACCTCGTCG ACTGGGAGCA CACGGATAAA
AACGACTTCG CACTGGCCGA GGAAGTGACC TTGCGCGGCG GATACGAGCG CAGGCCCGAC
CTGGTCATCT ACATGAATGG CATTACGGTG GGGGTGATCG AACTTAAGCG CAGTTCCGTG
GATGTGGCCG ACGGTGTGCG CCAGCTCCTA ACAAACCAGG AAGAGATCTT CAACGAGTGG
TTCTTCAGTA CCATCCAGTT CGTCTTTGCA GGAAGCGACT CGCAGGGATT GCGCTACGGC
ACTGTCGGCA CGCCGGAAAA GTTCTTCGTG GAGTGGAAGG CACCTGGACA CGATGGAGAA
TCTTCCCTTC CCGGCGCGCT CCTGGACCGG CCTCTTGCAG AGATGTGCGA GAAGGGGCGC
CTGCTCGATC TGATCCGTAA CTTCATCATC TTCGACGCCG GACAGAAGAA GGTGCCGCGA
CCGCACCAGT TCGCGGCGGT CAAGGCGGCC CAGGAGCGGA TCAGGACGCG CGAGGGAGGC
GTCATCTGGC ACACCCAAGG GAGCGGCAAA AGTATCCTGA TGGTGCTGCT GGCCAAGTGG
CTTCTGGAGC ACGACCCCGA GGGGCGAATC CTGATCGTCA CGGACCGCGA CGAGTTGGAT
AAACAGATCG AAGGGGTGAT GAAAAACGCC GGAGTCGTGG GGGCGGAATC CCCTTCGCCG
CGCATCACCT CCCGCGCACA GTTCTCCGAC AAGCTTGCCG CCACCTCGCC CCGGCTCTTA
TGCGCACTGA TTCACAAGTT CGACTTGGAC CCCAAAAGCG AGCCCCCGCC CATACGCGGG
CGCTTCTATG TGTTCGTGGA CGAATGTCAC CGCACCCAGG GCGGCGACAT GAACAAGCAG
ATGAAACGCT GGCTGGAAAA CGCCATCTTC ATCGGCTTCA CCGGAACTCC TCTTCTGCGC
CGCGACAAGC AAACCACCCG CGAGATCTTC GGCACCTACA TCCACACCTA CAAGTTCCAC
CAAGCGGTGG AAGACAAGGT GGTCCTGGAC CTGAAATACG AGGCCCGCAC CGTCCCTCAG
CGTCTCACCT CCAGAAAGGC GATCGATGAT TGGTTCGATC AGAAGACCCG GGGGCTGAAC
AACTACCAGC GCGCAGTCCT CCGCAAGAGG TGGGCGACGA TGGAGGAGCT GATGAGCTCC
GGCGAGCGCA AGCAGCGCAT CATCGCCGAC ATCATCCATG ACTTCGGGGT GCAGCAACGC
CTGAACAATG ACCGGGGCAC AGCCATGCTC GTGGCCGACT CCATCTACGA CGCCTGCCAT
TACTTCCGCC TGCTCCAGAA CACTTCTTTT GGTGCCTACT GCGGCATCGT CACCTCCTAC
GAACCAAACC ACAACGCCAT CTCCCGTGAA CCGAAGGATA GCGACGAGCG TTACAAGTTC
GACACCTACA CGCGCCATGT ACTGAGAGCG GGCCAGACTA CGAAGCAGTA CGAGGATGAA
ACGAAGCGTC GCTTCATCGA GGAGCCGGCC AACTGCAAAC TCCTCATCGT GGTGAGCAAG
CTTCTCACCG GCTTCGACGC ACCTTCCTGC ACTTACATCT ATCTCGACAG CAAGATGCAG
GACCACACCC TTTTTCAGGC CATCTGCAGG ACCAACCGGC TCGACGGGGA CGACAAGTCC
TTCGGCCATA TCGTCGATTA CAAGGAGCTG TTCGAGAAGG TGCAGAACTC CATCGCGGTC
TACAGCTCCG ATGAGCTGGA CATCGAAAAC GGGGACGGCG ACAACAACGT GAGGCTCAAG
GACTGGCTGA GAGAGGGTAA GGCGCAGCTC GACGCAGCCC GCGAGGCGCT CCGCTACCTC
TGCGAACCGG TGATGCAGCC GCGCGAGATG GAGCAGTATG TGGAGTACTT CTGCGGGTAC
GGCAACGGAC TCGAAGCCCT CGCTGCAACA GAGCCGCTGC GCATCTCGTA CTACAAGTCC
GTCGTCACCT TCCTGCGCGC CTATGCCGAT GTAGCGCAAA ACCTTGCCGA GGCGGGTTAT
TCCAGTGTCG AGATCGCCGC TTTGGAGCAG GAGACAAAAT TCCATTGCGA CACCCGGGCG
GCGATAAAGA ACCATTCGGG GGAAGAACTG GACATCAAGC CCTACGAGGC GGACATGCGC
CACCTCATCA ACACCTACAT ACAGGCCGAC CCCGCCACCG ACCTGGGCAA CCTGAGTTCG
CTCTCCCTTA CCCAGCTCAT CATAGAGACC GGCATCCACG ACGCCATAGC CCGGAAGCTC
AACCAGCAGG GGAAGCTCTC CCGTAACGCC ATCGCGGAGG GGATCATCAA CAACGTCCGC
AAGACGATCA TCCGAGAGAA GCTTACCGAC CCCAGATTCT ACGAGGAGAT GTCTAAACTG
CTCGAGGACT TGATCCGGCA GAAGCGGAAC GGCACGAAGT CCTACGAGGA ATTCCTGAAA
AAGGCAGAGG AGCTGGTCAG GCGCTTAGCG CGGAAGCATC CAGAGGCTGG CATCCCTGTA
GTTCTGTACG GCAAACCGGA AGCCATCGCG CTATTCAACA ATCTCCGCGG TATCCCGGCG
ACCAGCTTTC GCTATCCCAC GGATGATGAG GAGAGGGCGC AGCTAGCCCT GGATCTGGAC
CGGGTGGTAT GCGCTGAGGC TCCGGCCAAC TGGATAGGGG ACGATGCACG GGAAAAGCAG
GTCCTGAACG CCCTGTTCCC CAAGCTCGCG CGCGACCGCG AAGCCACGCT GGCAATCTTT
GAAATCGTCA AGAACCAACC AGGCTACAAA TGA
 
Protein sequence
MPEQPRSERK TQNRVVSLFT DGSRQDGLGY RYLGDWHERA DNGPIEPALL RQNLKARGYS 
DAHISAALQK LERAAEVTGT TLYQANLRTY QLLRYGVPVQ VAAGRPHETV HLVDWEHTDK
NDFALAEEVT LRGGYERRPD LVIYMNGITV GVIELKRSSV DVADGVRQLL TNQEEIFNEW
FFSTIQFVFA GSDSQGLRYG TVGTPEKFFV EWKAPGHDGE SSLPGALLDR PLAEMCEKGR
LLDLIRNFII FDAGQKKVPR PHQFAAVKAA QERIRTREGG VIWHTQGSGK SILMVLLAKW
LLEHDPEGRI LIVTDRDELD KQIEGVMKNA GVVGAESPSP RITSRAQFSD KLAATSPRLL
CALIHKFDLD PKSEPPPIRG RFYVFVDECH RTQGGDMNKQ MKRWLENAIF IGFTGTPLLR
RDKQTTREIF GTYIHTYKFH QAVEDKVVLD LKYEARTVPQ RLTSRKAIDD WFDQKTRGLN
NYQRAVLRKR WATMEELMSS GERKQRIIAD IIHDFGVQQR LNNDRGTAML VADSIYDACH
YFRLLQNTSF GAYCGIVTSY EPNHNAISRE PKDSDERYKF DTYTRHVLRA GQTTKQYEDE
TKRRFIEEPA NCKLLIVVSK LLTGFDAPSC TYIYLDSKMQ DHTLFQAICR TNRLDGDDKS
FGHIVDYKEL FEKVQNSIAV YSSDELDIEN GDGDNNVRLK DWLREGKAQL DAAREALRYL
CEPVMQPREM EQYVEYFCGY GNGLEALAAT EPLRISYYKS VVTFLRAYAD VAQNLAEAGY
SSVEIAALEQ ETKFHCDTRA AIKNHSGEEL DIKPYEADMR HLINTYIQAD PATDLGNLSS
LSLTQLIIET GIHDAIARKL NQQGKLSRNA IAEGIINNVR KTIIREKLTD PRFYEEMSKL
LEDLIRQKRN GTKSYEEFLK KAEELVRRLA RKHPEAGIPV VLYGKPEAIA LFNNLRGIPA
TSFRYPTDDE ERAQLALDLD RVVCAEAPAN WIGDDAREKQ VLNALFPKLA RDREATLAIF
EIVKNQPGYK