Gene Gura_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0547 
Symbol 
ID5165527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp640745 
End bp643831 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content49% 
IMG OID640548049 
Producthypothetical protein 
Protein accessionYP_001229334 
Protein GI148262628 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCA ATGAAAACGC CAGAGTGAAA ATACCGGCAA TATTACATTT ATGCCGTTTG 
GGCTATGACT ACCTGTCGCT ATCGGCAGCA ATCCGTGATG AAAGCACCAA TATTTTCACC
GACCTGTTTT CTGAAAGCAT CAGGCGCATC AATCCTGATC TTGAGGAAAG CGCAATAAAG
CGGTTATTGG AAGATATTTC ACTGATTCTG GATAATGAGG ACCTGGGCGA AGCATTTTAC
CGGATGCTCA CCGCTCGTTC TGGCGTGAAA CTGATCGACT TCAATGACTT CAGCAACAAC
AGTTTTCACG TAGTAACTGA ATTGACCTGC AAAAACGGCG ATGACGAATT CCGACCGGAT
ATCACGCTTT TGATCAACGG TATGCCCCTC GCTTTTATCG AAGTCAAAAT ACCGAACAAT
CAGGAAGGGA TTCTCAAAGA ACGGGACCGG ATGCTTGTTC GTTTCCGGAA CAAGAAGTTC
CGGAAGTTCA TCAATATTTC CCAGCTGCTG GTCTTTTCCA ATAACATGGA ATACGATCAG
GACTCAGTTG AACCTATCCA GGGAGCGTTT TACTCGTCTA CCTCCACCGG TGACGTTCAC
TTCAACTGTT TCCGCGAAGA AGAACAGTTC GACCTGTCAC AACTGCTGCA GCCGGAAGAC
AACGCCCGGG AAGACTTAGT CCTGAAGGAC AACAATCTTA ACGTCATCAA GCATTCACCG
GAGTTCATCA CCAACAAGGA ACCCACCACC CCCACAAACC GGCTGCTCAC CTCGCTGTTT
AGCTGTAATC GTTTGGCAAT GCTGCTTCAG TACGGTTTCG CCTACGTGCA GGAATCGAAG
GGTGTTGAGA AGCATATCAT GCGCTATCCG CAACTGTTCG CCACCAAGGC GATCCAGCAT
AAGCTCGATA ACGGCATCAA AAAGGGGATC ATCTGGCATA CCCAAGGAAG CGGCAAGACG
GCGCTTTCCT ATTACAACAT CCACTACCTG ACGGACTATT TTCAGCAGAA GGGAGTCATT
CCCAAGTTCT ATTTCATTGT TGACCGCATC GACCTGATGG AACAGGCCAA ACGGGAGTTT
TCCATCAGGG GGCTGGTCGT CCATACCGTA AATTCCAGAG AAGAGCTGCT GAAAAGCTTT
CGGGTGCGCA AGGCGATTCA CAACCTGTCC GGCAAGCGGG AAGTTACGGT CGTCAATATC
CAGAAGTTCC AGGACGATAC CCATCTGCTC CAGATGCAGG ACTATGACAT CGCTATCCAG
CGGGTCTATT TCCTCGACGA AGTGCACCGG AGCTACAACC CGAAAGGGAG TTTTCTGGCC
AACCTTATCA GTTCAGATCG GGAGGCCATT CTCATTGGCC TGACCGGCAC CCCTCTTATT
ATGGCAGACC GGAAGTCCAG GGATCTGTTC GGCGACTATA TCCATAAATA CTACTACAAT
GCTTCCATCG CCGACGGTTA TACCCTGCGG CTGATCAGGG AAGGGATTGA GACCAACTAC
AAAATCCAGT TGGAGCAAGC GCTGAAGGAA GTGGAAATTC TCAAAGGTGA TGCGGATAAG
CGGGTGATTT TTGCCCACGA AAAATTCGTT GAACCGATGC TGGATTATAT CGTCGAGGAT
TTCATCAACA GCAGAATCCG CCTGGGGGAT CACACCATCT GCGGCATGGT GGTCTGCGAT
AGTGCCGAGC AGGCTCGCAG GTTGTTCGAC ATCTTTATCG CCAAGTACAA TCCTGATCAG
AAAACGGTCG AAGATGTTTC AACGGAGTAT CTCAAGGTCG CCGAACCGGT TGTCGCCTAT
GGGGAGTATC TGAACAAGCG GAAAAGCAGA CTGACCGCCT CGCTGATCCT GCATGACGTT
GGCTCAAAGG ATGACCGCAA AGACGAAGTA GAGGATTTCA AGGAAGGCAA GATCGATTTT
CTTTTTGTCT ATGGTATGCT CCTGACCGGC TTTGACGCCA AGCGCCTGAA AAAGCTTTAT
CTTGGTCGGA TTATCAAGGA TCACAACCTG CTGCAAACCC TTACTCGGGT CAATCGACCG
TATAAAAAGT TCCGCTATGG TTTCGTGGTT GATTTTGCCG ACATCCGCAA GGAGTTTGAC
GCCACCAACA AGGCATATTT CGAGGAATTA CAGGAGGAGC TTGGTGATGA AATCGGGACC
TACTCCAACC TGTTCAAGAC CAAGGAAGAG ATTGACGACG AGATCAGCGA CATCAAGGAA
AAGCTGTTCC ACTACGACCT GACAAACGCT GAAATTTTTT CACAGCAGAT CAGCCGGATA
GAAGACCGGA AAACAATGCT GGAGATCAAG AAGGCTCTGG AGAATGCCCG CAACCTCTAC
AACATCATCC GACTCCTCGG CCATTTCGAA CTGCTGGAAC ATACCGACTT CAATAAACTG
AACCAGCTTT ATCGAGAGGC GGTTCGGCAC CTGGAGCTGC TGAGTCTGAA GGAATCGGTA
CAGAACAATG TCGATGCCAC TAACCTGCTG AATGTGGCCC TGGAAAACGT GCTCTTTATG
TTCCGCAAGG TTTCGGAAGA AGAACTGATT ATTGCAGACA AGCTGAAGGA CATGCTGCGC
AAGACCAGGG AAGCGCTGGG GAATAACTTC GACCGGAAAG ACCCGGAATT TGTTTCCCTT
TACGAAGAAC TAAAGCGGCT TTTTGGCAAG AAGAATCTGG ACGAAATAAC CCAGGACGAG
ATGAAGCGAA ATATCGGCTC GTTGCAGCAA ATTTTTGATA AAGTGACCGA GCTGAACCGC
AGGAACAACC TGCTAAAAGC GAAGTACGAG AACGACGCCA AATATGCCCG CATTCACAAA
CGGTTGGTGG AGAAGGGAAC TGTCTCAAAA CGGGAAAGCG CCATCCATGC CACCTTGATG
GGCATAAAAA AACAGGCTGA CGATCAGGTG TTGTTGAATG CCAAGATGCT TGAGAACGAG
GCCTATTTCG ACCAAAAACT TATGCAAATG GTAATCGGCG GCTTCGGCAA GGCAAAGATC
AACCTCGATC CGGAGAGTGC CCATTTCATC AACTCCTGCG TGACGCGGGA GTATCTTAAC
GAATACCGGG GAGTAGCTGC GTGGTAG
 
Protein sequence
MSFNENARVK IPAILHLCRL GYDYLSLSAA IRDESTNIFT DLFSESIRRI NPDLEESAIK 
RLLEDISLIL DNEDLGEAFY RMLTARSGVK LIDFNDFSNN SFHVVTELTC KNGDDEFRPD
ITLLINGMPL AFIEVKIPNN QEGILKERDR MLVRFRNKKF RKFINISQLL VFSNNMEYDQ
DSVEPIQGAF YSSTSTGDVH FNCFREEEQF DLSQLLQPED NAREDLVLKD NNLNVIKHSP
EFITNKEPTT PTNRLLTSLF SCNRLAMLLQ YGFAYVQESK GVEKHIMRYP QLFATKAIQH
KLDNGIKKGI IWHTQGSGKT ALSYYNIHYL TDYFQQKGVI PKFYFIVDRI DLMEQAKREF
SIRGLVVHTV NSREELLKSF RVRKAIHNLS GKREVTVVNI QKFQDDTHLL QMQDYDIAIQ
RVYFLDEVHR SYNPKGSFLA NLISSDREAI LIGLTGTPLI MADRKSRDLF GDYIHKYYYN
ASIADGYTLR LIREGIETNY KIQLEQALKE VEILKGDADK RVIFAHEKFV EPMLDYIVED
FINSRIRLGD HTICGMVVCD SAEQARRLFD IFIAKYNPDQ KTVEDVSTEY LKVAEPVVAY
GEYLNKRKSR LTASLILHDV GSKDDRKDEV EDFKEGKIDF LFVYGMLLTG FDAKRLKKLY
LGRIIKDHNL LQTLTRVNRP YKKFRYGFVV DFADIRKEFD ATNKAYFEEL QEELGDEIGT
YSNLFKTKEE IDDEISDIKE KLFHYDLTNA EIFSQQISRI EDRKTMLEIK KALENARNLY
NIIRLLGHFE LLEHTDFNKL NQLYREAVRH LELLSLKESV QNNVDATNLL NVALENVLFM
FRKVSEEELI IADKLKDMLR KTREALGNNF DRKDPEFVSL YEELKRLFGK KNLDEITQDE
MKRNIGSLQQ IFDKVTELNR RNNLLKAKYE NDAKYARIHK RLVEKGTVSK RESAIHATLM
GIKKQADDQV LLNAKMLENE AYFDQKLMQM VIGGFGKAKI NLDPESAHFI NSCVTREYLN
EYRGVAAW