Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0547 |
Symbol | |
ID | 5165527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 640745 |
End bp | 643831 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640548049 |
Product | hypothetical protein |
Protein accession | YP_001229334 |
Protein GI | 148262628 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTCA ATGAAAACGC CAGAGTGAAA ATACCGGCAA TATTACATTT ATGCCGTTTG GGCTATGACT ACCTGTCGCT ATCGGCAGCA ATCCGTGATG AAAGCACCAA TATTTTCACC GACCTGTTTT CTGAAAGCAT CAGGCGCATC AATCCTGATC TTGAGGAAAG CGCAATAAAG CGGTTATTGG AAGATATTTC ACTGATTCTG GATAATGAGG ACCTGGGCGA AGCATTTTAC CGGATGCTCA CCGCTCGTTC TGGCGTGAAA CTGATCGACT TCAATGACTT CAGCAACAAC AGTTTTCACG TAGTAACTGA ATTGACCTGC AAAAACGGCG ATGACGAATT CCGACCGGAT ATCACGCTTT TGATCAACGG TATGCCCCTC GCTTTTATCG AAGTCAAAAT ACCGAACAAT CAGGAAGGGA TTCTCAAAGA ACGGGACCGG ATGCTTGTTC GTTTCCGGAA CAAGAAGTTC CGGAAGTTCA TCAATATTTC CCAGCTGCTG GTCTTTTCCA ATAACATGGA ATACGATCAG GACTCAGTTG AACCTATCCA GGGAGCGTTT TACTCGTCTA CCTCCACCGG TGACGTTCAC TTCAACTGTT TCCGCGAAGA AGAACAGTTC GACCTGTCAC AACTGCTGCA GCCGGAAGAC AACGCCCGGG AAGACTTAGT CCTGAAGGAC AACAATCTTA ACGTCATCAA GCATTCACCG GAGTTCATCA CCAACAAGGA ACCCACCACC CCCACAAACC GGCTGCTCAC CTCGCTGTTT AGCTGTAATC GTTTGGCAAT GCTGCTTCAG TACGGTTTCG CCTACGTGCA GGAATCGAAG GGTGTTGAGA AGCATATCAT GCGCTATCCG CAACTGTTCG CCACCAAGGC GATCCAGCAT AAGCTCGATA ACGGCATCAA AAAGGGGATC ATCTGGCATA CCCAAGGAAG CGGCAAGACG GCGCTTTCCT ATTACAACAT CCACTACCTG ACGGACTATT TTCAGCAGAA GGGAGTCATT CCCAAGTTCT ATTTCATTGT TGACCGCATC GACCTGATGG AACAGGCCAA ACGGGAGTTT TCCATCAGGG GGCTGGTCGT CCATACCGTA AATTCCAGAG AAGAGCTGCT GAAAAGCTTT CGGGTGCGCA AGGCGATTCA CAACCTGTCC GGCAAGCGGG AAGTTACGGT CGTCAATATC CAGAAGTTCC AGGACGATAC CCATCTGCTC CAGATGCAGG ACTATGACAT CGCTATCCAG CGGGTCTATT TCCTCGACGA AGTGCACCGG AGCTACAACC CGAAAGGGAG TTTTCTGGCC AACCTTATCA GTTCAGATCG GGAGGCCATT CTCATTGGCC TGACCGGCAC CCCTCTTATT ATGGCAGACC GGAAGTCCAG GGATCTGTTC GGCGACTATA TCCATAAATA CTACTACAAT GCTTCCATCG CCGACGGTTA TACCCTGCGG CTGATCAGGG AAGGGATTGA GACCAACTAC AAAATCCAGT TGGAGCAAGC GCTGAAGGAA GTGGAAATTC TCAAAGGTGA TGCGGATAAG CGGGTGATTT TTGCCCACGA AAAATTCGTT GAACCGATGC TGGATTATAT CGTCGAGGAT TTCATCAACA GCAGAATCCG CCTGGGGGAT CACACCATCT GCGGCATGGT GGTCTGCGAT AGTGCCGAGC AGGCTCGCAG GTTGTTCGAC ATCTTTATCG CCAAGTACAA TCCTGATCAG AAAACGGTCG AAGATGTTTC AACGGAGTAT CTCAAGGTCG CCGAACCGGT TGTCGCCTAT GGGGAGTATC TGAACAAGCG GAAAAGCAGA CTGACCGCCT CGCTGATCCT GCATGACGTT GGCTCAAAGG ATGACCGCAA AGACGAAGTA GAGGATTTCA AGGAAGGCAA GATCGATTTT CTTTTTGTCT ATGGTATGCT CCTGACCGGC TTTGACGCCA AGCGCCTGAA AAAGCTTTAT CTTGGTCGGA TTATCAAGGA TCACAACCTG CTGCAAACCC TTACTCGGGT CAATCGACCG TATAAAAAGT TCCGCTATGG TTTCGTGGTT GATTTTGCCG ACATCCGCAA GGAGTTTGAC GCCACCAACA AGGCATATTT CGAGGAATTA CAGGAGGAGC TTGGTGATGA AATCGGGACC TACTCCAACC TGTTCAAGAC CAAGGAAGAG ATTGACGACG AGATCAGCGA CATCAAGGAA AAGCTGTTCC ACTACGACCT GACAAACGCT GAAATTTTTT CACAGCAGAT CAGCCGGATA GAAGACCGGA AAACAATGCT GGAGATCAAG AAGGCTCTGG AGAATGCCCG CAACCTCTAC AACATCATCC GACTCCTCGG CCATTTCGAA CTGCTGGAAC ATACCGACTT CAATAAACTG AACCAGCTTT ATCGAGAGGC GGTTCGGCAC CTGGAGCTGC TGAGTCTGAA GGAATCGGTA CAGAACAATG TCGATGCCAC TAACCTGCTG AATGTGGCCC TGGAAAACGT GCTCTTTATG TTCCGCAAGG TTTCGGAAGA AGAACTGATT ATTGCAGACA AGCTGAAGGA CATGCTGCGC AAGACCAGGG AAGCGCTGGG GAATAACTTC GACCGGAAAG ACCCGGAATT TGTTTCCCTT TACGAAGAAC TAAAGCGGCT TTTTGGCAAG AAGAATCTGG ACGAAATAAC CCAGGACGAG ATGAAGCGAA ATATCGGCTC GTTGCAGCAA ATTTTTGATA AAGTGACCGA GCTGAACCGC AGGAACAACC TGCTAAAAGC GAAGTACGAG AACGACGCCA AATATGCCCG CATTCACAAA CGGTTGGTGG AGAAGGGAAC TGTCTCAAAA CGGGAAAGCG CCATCCATGC CACCTTGATG GGCATAAAAA AACAGGCTGA CGATCAGGTG TTGTTGAATG CCAAGATGCT TGAGAACGAG GCCTATTTCG ACCAAAAACT TATGCAAATG GTAATCGGCG GCTTCGGCAA GGCAAAGATC AACCTCGATC CGGAGAGTGC CCATTTCATC AACTCCTGCG TGACGCGGGA GTATCTTAAC GAATACCGGG GAGTAGCTGC GTGGTAG
|
Protein sequence | MSFNENARVK IPAILHLCRL GYDYLSLSAA IRDESTNIFT DLFSESIRRI NPDLEESAIK RLLEDISLIL DNEDLGEAFY RMLTARSGVK LIDFNDFSNN SFHVVTELTC KNGDDEFRPD ITLLINGMPL AFIEVKIPNN QEGILKERDR MLVRFRNKKF RKFINISQLL VFSNNMEYDQ DSVEPIQGAF YSSTSTGDVH FNCFREEEQF DLSQLLQPED NAREDLVLKD NNLNVIKHSP EFITNKEPTT PTNRLLTSLF SCNRLAMLLQ YGFAYVQESK GVEKHIMRYP QLFATKAIQH KLDNGIKKGI IWHTQGSGKT ALSYYNIHYL TDYFQQKGVI PKFYFIVDRI DLMEQAKREF SIRGLVVHTV NSREELLKSF RVRKAIHNLS GKREVTVVNI QKFQDDTHLL QMQDYDIAIQ RVYFLDEVHR SYNPKGSFLA NLISSDREAI LIGLTGTPLI MADRKSRDLF GDYIHKYYYN ASIADGYTLR LIREGIETNY KIQLEQALKE VEILKGDADK RVIFAHEKFV EPMLDYIVED FINSRIRLGD HTICGMVVCD SAEQARRLFD IFIAKYNPDQ KTVEDVSTEY LKVAEPVVAY GEYLNKRKSR LTASLILHDV GSKDDRKDEV EDFKEGKIDF LFVYGMLLTG FDAKRLKKLY LGRIIKDHNL LQTLTRVNRP YKKFRYGFVV DFADIRKEFD ATNKAYFEEL QEELGDEIGT YSNLFKTKEE IDDEISDIKE KLFHYDLTNA EIFSQQISRI EDRKTMLEIK KALENARNLY NIIRLLGHFE LLEHTDFNKL NQLYREAVRH LELLSLKESV QNNVDATNLL NVALENVLFM FRKVSEEELI IADKLKDMLR KTREALGNNF DRKDPEFVSL YEELKRLFGK KNLDEITQDE MKRNIGSLQQ IFDKVTELNR RNNLLKAKYE NDAKYARIHK RLVEKGTVSK RESAIHATLM GIKKQADDQV LLNAKMLENE AYFDQKLMQM VIGGFGKAKI NLDPESAHFI NSCVTREYLN EYRGVAAW
|
| |