Gene Gura_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4037 
Symbol 
ID5166252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4691718 
End bp4694972 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content60% 
IMG OID640551516 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001232754 
Protein GI148266048 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAT ATCTCCACGT AGAAAAACCA TTCCTTGACC AGCTCGCCGC ACTGGGCTGG 
ACGGTAATCG ACCAAGGTCA GGGGTTTATC CCCTCCGATC CTACGGCCAG CCTGCGCGAC
ACCTTCCGCG AATGGCTGCT CCCTGATGTC TTCCACAATG CCGTGCGCGC CATCAACCGC
ACCGCCGACG GTACCCCCTG GCTGACCGAT CGCCAACTCG ATGACCTCCG GTCACAAATC
CTGCGCCAGC CAAACCGTAC CCTGCTTGAA GCCAACGAGG CGGTACAGGC GCTCTTCCTC
AAGGCACAAG TGGATCGCAA TGAGATCAGC GGCGAGCCAG ATCCTGTGGT CCAGCTCATT
GACTTCTCCC ACCCCGAGCG CAATCAGTTC CACGCCATCA ACCAGTTCCG CATCGACACG
CCCGGCTACG TGAAGAAATG CATCATCCCG GATATCGTAC TCTTTGTGAA CGGCATCCCG
CTGGTGGTGA TCGAGGCCAA GATCGGCGAC GCCACCACGG CCAACCCGAT GCACGCCGCC
TTCGAGCAAC TGCTTCGCTA CCGCAACGGC CGCCCGGAAA CGGCTGCTGC CGGCCTGCGC
GAAGGAGAGC CGCGCCTGTT CCACACGAAT TTGCTGCTGA TACGCACCTG TGGCGAGAAA
GCGGAGTTTG GCAGCATCAC CTCGGGCCAC GAGCACTTCT ACGCCTGGAA GGACATTTGG
CCGGAGGAGA ACCGCAATTA CACGCCGCCG CTGGGGATCG AGCGCGAACA GGAGCGGCTG
ATCCAGGGGC TGCTCGCCCC AACGACGCTC CTGGATGTGC TGCGCACCTG CACGGTCTTC
ATGGATACCG ACTCCGGCAA GCGGGTGAAG GTGGTCTGCC GTTACCAACA GTACCGCGCT
GCCCACCGGA TCGTTGAGCG CCTGCGCACC GGCAAGACAT CAGAGGAGCG CTCTGGCGTC
GTCTGGCACA CCCAGGGCTC GGGCAAGTCA CTGACTATGG TTTTCGTGGC CCGCATGCTG
CGGGCATCAC AAGACCTGGC CGACTTCAAG ATCCTGCTGG TCAACGACCG CGTCGATCTG
GAGGATCAAC TTGCAGCCAC TGCCAAGCTG ATCGGCGGCA AGGTTAATGT GATTGAGAGT
ACGGCAGCGT TGCGCGAGCA CCTGAGTACC GATACCTCCG ACATTAACAT GGTGATGGTA
CACAAGTTCA TGGAACGTGC CGAAGCGCTG CCGACCATGG TTGCCGAGGC GCTTGCAGCC
TACCGCCCGC CGCCGTCCGG AGCAACTTTT GGCGTGGTCA ATCCGTCCGA ACGCATCCTG
CTGATGATTG ACGAGGCGCA CAGGACGCAA GGCTCCGACC TGGGTGAAAA CATTTTCGAG
GCATTCCCCA ACGCTACCCG CATCGCCTTC ACCGGCACCC CGCTTATCTC CGAGCAACAC
GGCAGCAAGC GCACGGTGAA GCGTTTCGGC GAATACATCG ACACCTACAA GCTGATGGAC
GCGGTGCATG ACGGCGCGAC GCTGCAGATT CTCTACGAAG GACGCACGGC GGACACGGCG
TTGAAGGACA AGCACGGCTT CGACACCAAG TTCGAAGACC TCTTCAAGAA CCGGAGCGAA
GAAGAACTCC TCGTCATCAA GAAGAAATAT GGCGCCAGCG GCGACATCCT GGAAGCCGAG
CAGCGCATTG CGGCCATTGC CCGCGACTTG GTGACGCACT ACGTCGACAA CATCCTCCCC
GACGGTTTCA AGGCGCAGGT TGTCTGCCAC TCCAAGCTGG CGGCGATCCG CTATCAGAAA
TCGATCCGTG AAGCCCTGGC CGAGCGCCTT GACGTGGAGA AGCTCAAACA GAAGCCCGAC
ACCGAACTGA TTCGCCGCAT CGCCTTCCTG AAGGCCGTGG TAGTGGTTTC CGCTGACGCC
ACCAACGAAC TGGCCGCCAT CACCGAGGCG CGCAAGGAGG CGAAACGCTG GAACGCGGTG
GAGAACTTCT GCAAACCTTT TGATCTGGAC GACCCGGACA AGGATCTGAC TGGAATCGCC
TTCCTGATCG TCTGTGACAT GCTGCTGACC GGCTTCGACG CCCCGGTTGA GCAGGTGATG
TATATCGACA AGCGTTTGCG CGAGCACAAC CTGCTACAGG CCATTGCCCG GGTGAACCGG
GTGACGAAGA ACAAGCACCG GGGGTTCATC GTCGATTACA TCGGCCTGGC CAACCACCTG
ATGGAGGCGC TCAGCATCTA TTCCGATGAG GATGCAGAAG ACATCCAGCA AGGGCTGAAA
AACCGGCTCA CCGAACTGCC GATTCTGGAA GAACGCTATC AGCGTCTCCT GCAGCATTTC
CGCGCAGCCG GGGTAGGGGA GATCGAAGCC TTCGTAAAGG GTGAACTGGC AACACCAGAG
TCTGACGTGG CCGTGGTGCA TGCGGCTGTG GGAGCGATGC AGGACATCAA GCGCCGGGCG
GACTTCGAGG TCTACCTGAA GAAGTATCTC CAAAGCCTGA ACCTGATCCT CCCCCACGCG
TCGGGGCATC CCTACCGCGG CCCGGCAAGG CGTTTCGGCT ATCTCCTCCG CATGGTCAAG
GAGCGCTACA AGGACGACTC CCTCGACATC TCCGACGCCG GTGAGAAGGT GAAAGCGCTG
ATCAACGAGC ACCTTATCGA CCTGGGGATA AATCCCAAGA TCCCCCCCAT AGAGCTTCTG
TCGGACGACT TCATCTCCAA TGTGCAGAAG CATTCCCAGG GAAACCCGGA GGCGAAGGCG
AGTGAGATGG AGCATGCCAT CCGCAAACAC TGCACAATCC ACTTCGACGA AGACCCGGCC
TTCTACAAGC GGCTGAGCGA GAAGCTGGAA AAGCTGATCC AGGAGCACAA GAACCAATGG
GAGGTGCTGG CCGAGGGGTA TGAACAACTC AGAAGCGAAG CCCTGGCGGG CCGGACGGAT
GCTGAGGAAG GCTTGACCAG GGAGGCTACC ACCTTCTACG ACTACGTGGT GCAGCTCGCT
TTCGAGAATG GCGAAGTGCC CGGGAATCAT CGGCATCAGT TGAAGAGGCT GATGGCTGGC
ATCGTGGAGA TGTTGCAAGG TACTATCGGC ATCATCGACT TCTGGAAGAA GCCAATCGAG
GTAAAGAAAC TTCGGGGGAA CATCGACACC GAGATCCTGC TGACCGAGAT CCCGCAACTC
ATCGACAAAC ACGAACGGAT CGCGGTGGAG ATTGTGAAGC TCGCCGAAAA ACGCCACCAG
GAGCTGACAC GATGA
 
Protein sequence
MSEYLHVEKP FLDQLAALGW TVIDQGQGFI PSDPTASLRD TFREWLLPDV FHNAVRAINR 
TADGTPWLTD RQLDDLRSQI LRQPNRTLLE ANEAVQALFL KAQVDRNEIS GEPDPVVQLI
DFSHPERNQF HAINQFRIDT PGYVKKCIIP DIVLFVNGIP LVVIEAKIGD ATTANPMHAA
FEQLLRYRNG RPETAAAGLR EGEPRLFHTN LLLIRTCGEK AEFGSITSGH EHFYAWKDIW
PEENRNYTPP LGIEREQERL IQGLLAPTTL LDVLRTCTVF MDTDSGKRVK VVCRYQQYRA
AHRIVERLRT GKTSEERSGV VWHTQGSGKS LTMVFVARML RASQDLADFK ILLVNDRVDL
EDQLAATAKL IGGKVNVIES TAALREHLST DTSDINMVMV HKFMERAEAL PTMVAEALAA
YRPPPSGATF GVVNPSERIL LMIDEAHRTQ GSDLGENIFE AFPNATRIAF TGTPLISEQH
GSKRTVKRFG EYIDTYKLMD AVHDGATLQI LYEGRTADTA LKDKHGFDTK FEDLFKNRSE
EELLVIKKKY GASGDILEAE QRIAAIARDL VTHYVDNILP DGFKAQVVCH SKLAAIRYQK
SIREALAERL DVEKLKQKPD TELIRRIAFL KAVVVVSADA TNELAAITEA RKEAKRWNAV
ENFCKPFDLD DPDKDLTGIA FLIVCDMLLT GFDAPVEQVM YIDKRLREHN LLQAIARVNR
VTKNKHRGFI VDYIGLANHL MEALSIYSDE DAEDIQQGLK NRLTELPILE ERYQRLLQHF
RAAGVGEIEA FVKGELATPE SDVAVVHAAV GAMQDIKRRA DFEVYLKKYL QSLNLILPHA
SGHPYRGPAR RFGYLLRMVK ERYKDDSLDI SDAGEKVKAL INEHLIDLGI NPKIPPIELL
SDDFISNVQK HSQGNPEAKA SEMEHAIRKH CTIHFDEDPA FYKRLSEKLE KLIQEHKNQW
EVLAEGYEQL RSEALAGRTD AEEGLTREAT TFYDYVVQLA FENGEVPGNH RHQLKRLMAG
IVEMLQGTIG IIDFWKKPIE VKKLRGNIDT EILLTEIPQL IDKHERIAVE IVKLAEKRHQ
ELTR