Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_4037 |
Symbol | |
ID | 5166252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4691718 |
End bp | 4694972 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640551516 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001232754 |
Protein GI | 148266048 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAT ATCTCCACGT AGAAAAACCA TTCCTTGACC AGCTCGCCGC ACTGGGCTGG ACGGTAATCG ACCAAGGTCA GGGGTTTATC CCCTCCGATC CTACGGCCAG CCTGCGCGAC ACCTTCCGCG AATGGCTGCT CCCTGATGTC TTCCACAATG CCGTGCGCGC CATCAACCGC ACCGCCGACG GTACCCCCTG GCTGACCGAT CGCCAACTCG ATGACCTCCG GTCACAAATC CTGCGCCAGC CAAACCGTAC CCTGCTTGAA GCCAACGAGG CGGTACAGGC GCTCTTCCTC AAGGCACAAG TGGATCGCAA TGAGATCAGC GGCGAGCCAG ATCCTGTGGT CCAGCTCATT GACTTCTCCC ACCCCGAGCG CAATCAGTTC CACGCCATCA ACCAGTTCCG CATCGACACG CCCGGCTACG TGAAGAAATG CATCATCCCG GATATCGTAC TCTTTGTGAA CGGCATCCCG CTGGTGGTGA TCGAGGCCAA GATCGGCGAC GCCACCACGG CCAACCCGAT GCACGCCGCC TTCGAGCAAC TGCTTCGCTA CCGCAACGGC CGCCCGGAAA CGGCTGCTGC CGGCCTGCGC GAAGGAGAGC CGCGCCTGTT CCACACGAAT TTGCTGCTGA TACGCACCTG TGGCGAGAAA GCGGAGTTTG GCAGCATCAC CTCGGGCCAC GAGCACTTCT ACGCCTGGAA GGACATTTGG CCGGAGGAGA ACCGCAATTA CACGCCGCCG CTGGGGATCG AGCGCGAACA GGAGCGGCTG ATCCAGGGGC TGCTCGCCCC AACGACGCTC CTGGATGTGC TGCGCACCTG CACGGTCTTC ATGGATACCG ACTCCGGCAA GCGGGTGAAG GTGGTCTGCC GTTACCAACA GTACCGCGCT GCCCACCGGA TCGTTGAGCG CCTGCGCACC GGCAAGACAT CAGAGGAGCG CTCTGGCGTC GTCTGGCACA CCCAGGGCTC GGGCAAGTCA CTGACTATGG TTTTCGTGGC CCGCATGCTG CGGGCATCAC AAGACCTGGC CGACTTCAAG ATCCTGCTGG TCAACGACCG CGTCGATCTG GAGGATCAAC TTGCAGCCAC TGCCAAGCTG ATCGGCGGCA AGGTTAATGT GATTGAGAGT ACGGCAGCGT TGCGCGAGCA CCTGAGTACC GATACCTCCG ACATTAACAT GGTGATGGTA CACAAGTTCA TGGAACGTGC CGAAGCGCTG CCGACCATGG TTGCCGAGGC GCTTGCAGCC TACCGCCCGC CGCCGTCCGG AGCAACTTTT GGCGTGGTCA ATCCGTCCGA ACGCATCCTG CTGATGATTG ACGAGGCGCA CAGGACGCAA GGCTCCGACC TGGGTGAAAA CATTTTCGAG GCATTCCCCA ACGCTACCCG CATCGCCTTC ACCGGCACCC CGCTTATCTC CGAGCAACAC GGCAGCAAGC GCACGGTGAA GCGTTTCGGC GAATACATCG ACACCTACAA GCTGATGGAC GCGGTGCATG ACGGCGCGAC GCTGCAGATT CTCTACGAAG GACGCACGGC GGACACGGCG TTGAAGGACA AGCACGGCTT CGACACCAAG TTCGAAGACC TCTTCAAGAA CCGGAGCGAA GAAGAACTCC TCGTCATCAA GAAGAAATAT GGCGCCAGCG GCGACATCCT GGAAGCCGAG CAGCGCATTG CGGCCATTGC CCGCGACTTG GTGACGCACT ACGTCGACAA CATCCTCCCC GACGGTTTCA AGGCGCAGGT TGTCTGCCAC TCCAAGCTGG CGGCGATCCG CTATCAGAAA TCGATCCGTG AAGCCCTGGC CGAGCGCCTT GACGTGGAGA AGCTCAAACA GAAGCCCGAC ACCGAACTGA TTCGCCGCAT CGCCTTCCTG AAGGCCGTGG TAGTGGTTTC CGCTGACGCC ACCAACGAAC TGGCCGCCAT CACCGAGGCG CGCAAGGAGG CGAAACGCTG GAACGCGGTG GAGAACTTCT GCAAACCTTT TGATCTGGAC GACCCGGACA AGGATCTGAC TGGAATCGCC TTCCTGATCG TCTGTGACAT GCTGCTGACC GGCTTCGACG CCCCGGTTGA GCAGGTGATG TATATCGACA AGCGTTTGCG CGAGCACAAC CTGCTACAGG CCATTGCCCG GGTGAACCGG GTGACGAAGA ACAAGCACCG GGGGTTCATC GTCGATTACA TCGGCCTGGC CAACCACCTG ATGGAGGCGC TCAGCATCTA TTCCGATGAG GATGCAGAAG ACATCCAGCA AGGGCTGAAA AACCGGCTCA CCGAACTGCC GATTCTGGAA GAACGCTATC AGCGTCTCCT GCAGCATTTC CGCGCAGCCG GGGTAGGGGA GATCGAAGCC TTCGTAAAGG GTGAACTGGC AACACCAGAG TCTGACGTGG CCGTGGTGCA TGCGGCTGTG GGAGCGATGC AGGACATCAA GCGCCGGGCG GACTTCGAGG TCTACCTGAA GAAGTATCTC CAAAGCCTGA ACCTGATCCT CCCCCACGCG TCGGGGCATC CCTACCGCGG CCCGGCAAGG CGTTTCGGCT ATCTCCTCCG CATGGTCAAG GAGCGCTACA AGGACGACTC CCTCGACATC TCCGACGCCG GTGAGAAGGT GAAAGCGCTG ATCAACGAGC ACCTTATCGA CCTGGGGATA AATCCCAAGA TCCCCCCCAT AGAGCTTCTG TCGGACGACT TCATCTCCAA TGTGCAGAAG CATTCCCAGG GAAACCCGGA GGCGAAGGCG AGTGAGATGG AGCATGCCAT CCGCAAACAC TGCACAATCC ACTTCGACGA AGACCCGGCC TTCTACAAGC GGCTGAGCGA GAAGCTGGAA AAGCTGATCC AGGAGCACAA GAACCAATGG GAGGTGCTGG CCGAGGGGTA TGAACAACTC AGAAGCGAAG CCCTGGCGGG CCGGACGGAT GCTGAGGAAG GCTTGACCAG GGAGGCTACC ACCTTCTACG ACTACGTGGT GCAGCTCGCT TTCGAGAATG GCGAAGTGCC CGGGAATCAT CGGCATCAGT TGAAGAGGCT GATGGCTGGC ATCGTGGAGA TGTTGCAAGG TACTATCGGC ATCATCGACT TCTGGAAGAA GCCAATCGAG GTAAAGAAAC TTCGGGGGAA CATCGACACC GAGATCCTGC TGACCGAGAT CCCGCAACTC ATCGACAAAC ACGAACGGAT CGCGGTGGAG ATTGTGAAGC TCGCCGAAAA ACGCCACCAG GAGCTGACAC GATGA
|
Protein sequence | MSEYLHVEKP FLDQLAALGW TVIDQGQGFI PSDPTASLRD TFREWLLPDV FHNAVRAINR TADGTPWLTD RQLDDLRSQI LRQPNRTLLE ANEAVQALFL KAQVDRNEIS GEPDPVVQLI DFSHPERNQF HAINQFRIDT PGYVKKCIIP DIVLFVNGIP LVVIEAKIGD ATTANPMHAA FEQLLRYRNG RPETAAAGLR EGEPRLFHTN LLLIRTCGEK AEFGSITSGH EHFYAWKDIW PEENRNYTPP LGIEREQERL IQGLLAPTTL LDVLRTCTVF MDTDSGKRVK VVCRYQQYRA AHRIVERLRT GKTSEERSGV VWHTQGSGKS LTMVFVARML RASQDLADFK ILLVNDRVDL EDQLAATAKL IGGKVNVIES TAALREHLST DTSDINMVMV HKFMERAEAL PTMVAEALAA YRPPPSGATF GVVNPSERIL LMIDEAHRTQ GSDLGENIFE AFPNATRIAF TGTPLISEQH GSKRTVKRFG EYIDTYKLMD AVHDGATLQI LYEGRTADTA LKDKHGFDTK FEDLFKNRSE EELLVIKKKY GASGDILEAE QRIAAIARDL VTHYVDNILP DGFKAQVVCH SKLAAIRYQK SIREALAERL DVEKLKQKPD TELIRRIAFL KAVVVVSADA TNELAAITEA RKEAKRWNAV ENFCKPFDLD DPDKDLTGIA FLIVCDMLLT GFDAPVEQVM YIDKRLREHN LLQAIARVNR VTKNKHRGFI VDYIGLANHL MEALSIYSDE DAEDIQQGLK NRLTELPILE ERYQRLLQHF RAAGVGEIEA FVKGELATPE SDVAVVHAAV GAMQDIKRRA DFEVYLKKYL QSLNLILPHA SGHPYRGPAR RFGYLLRMVK ERYKDDSLDI SDAGEKVKAL INEHLIDLGI NPKIPPIELL SDDFISNVQK HSQGNPEAKA SEMEHAIRKH CTIHFDEDPA FYKRLSEKLE KLIQEHKNQW EVLAEGYEQL RSEALAGRTD AEEGLTREAT TFYDYVVQLA FENGEVPGNH RHQLKRLMAG IVEMLQGTIG IIDFWKKPIE VKKLRGNIDT EILLTEIPQL IDKHERIAVE IVKLAEKRHQ ELTR
|
| |