Gene Gura_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4301 
Symbol 
ID5166808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4966999 
End bp4969380 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content57% 
IMG OID640551780 
Producthypothetical protein 
Protein accessionYP_001233017 
Protein GI148266311 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.355479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCGCCT ACGACACCGA AGGTTTCGGC CCTGAGAGCC TCCTGGGGCG AACCTCCGAC 
CGGGAGGTAG TACTCCGGAG AGACGTGGAG GCCGCACTGC GCCGCCTGAA TCCGGGCCTG
CCGGATGATG CCTACCACGA TGCGCTGGCT CAGGTAACGG CGGACGACCG CACCAAGACT
CTGCTACAGG TGAACAAGGA AAAGTACCGG CTACTGCGGG ACGGAGTGCC GGTCAAATAT
CGCGACGAAG CGGGGAGGAT GACTGACCGG CGGCTGAAAC TGGTCGATTT TGACGACCCG
GCCAACCCGA AGAAGAACCG GTTTCTGGTG GTAAGGGAAC TTTGGGTAAA GGGAGATACG
TACAGGCGAA GGCCTGACGT TCTCGGCTAT GTCAACGGAT TGCCATTGGT GTTCATCGAG
CTGAAACGCT ATGACCAGCA CATCGATAAG GCATTTAAGC AGAACTACAG CGATTACAAG
GACACCATCC CCCACCTGTT CCACTGGAAC GCGCTGATCC TGCTGTCCAA TGGCGTCGAT
GCCAAGTACG GCTCCATCAC TTCTATCATG GAGCATTTCT CTCGCTGGAA ACGGCAAAAA
GAAGAAGACC CGGAACCGAC CGCCGATCAG CCACTCCTGC CGTTGTTACT GCGCGGCATG
CTGAACAAGG AAGCCCTCCT CGATCTGGTT GAGAACTTCA TTCTCTTTGA CCGGACCGAG
GGGGAACTCC AGAAGATAGT GGCGCGTAAC CACCAGTATC TTGGCGTAAA CCAAGTGATC
GGCAAGCTGC TGTCGAAAGA GCCGGGCATG CAGGCCGAGG TAGAGGCGGG ACGGCTGGGG
GTATTCTGGC ATACCCAGGG TTCGGGGAAG TCATACTCGA TGATCTTCCT GACCGAGAAG
ACCCACCGCA AGATCTCGGC CAAGTATACC TTTGTGGTGA TGACCGACCG GAACGAACTG
GACGAACAGA TTTTCGGCAC CTATACCGGC TGCGGCGCGG CCACCAACAA GAAGGCGAAA
GCCATGGACG GCAAGGCACC GGACAGATTC ACGCTGATTC GCCGTGCCCA GGTGGAGTGG
ATGAAGGAGA CGGAGATTTG CGTCGTCGTT TCGCCGGAGC AGGGAGAAGT GGCCGAGTTC
CGCAAGTGGG AGCTGGATAT CGTTCCGCAC CGGGAAAAGA TGGTCCATCG GGATCTGAAC
TTGGAGTTCA AGAAGCCGGA ACACCCGTTC CGGGTGGTCA TCGTCTGTGC CATGTGGCTG
ACCGGTTACG ACGTGAAGTG CCTTGCCACC CTCTACCTGG ACAAACCGAT GAAGGGCCAC
ACCCTGATGC AGGCCATCGC CCGCGTGAAC CGGGTCGGCG GCGGCAAGAA GAACGGTCTC
ATCATCGACT ACAACGGCAT GCTGAAGAGT TTGCGAAAGG CGCTGGCTAC ATTCGCTCAA
GGTGACCGCA AGGGCTCTGA CCAGGACATC CTTCGTGACG ATACCGAGGC AGTGGCTGAG
TACGGCCAGT CGATCCGGGC AGCACAGGAT TTCCTGACCG GCTGCGGATT CAATCTGGAC
GAGCTGATCG CAGCCAACGG GTTCGACAAG CAAGCGATGA TCCTGCGGGG GGTAAACACT
GTCTGCGAGA CTGACGAACG GCGCAAGACC TTCGAGGTCA TGGCCGATGA CATCGCAGCC
AGGTTCCGGG GCATCTTTCC CAATCCAGGA CTGTACGCTT ACGACGAGCA GGAGAATGCA
ATCGCGGCCA TCTATAACCG GTTGCAGGAG AGCAAGGAAA GCCCGGATGT CAGCGAAATG
CTCCAGGCGC TTTATGCTGT GATAGATACG GCGGTGACCA CCGATACCTT GACCGTAAAT
GAGCCCCCTG TACGCTACGA GCTGACCAAA ATCGATATCA GCCGCTTGCA GGCTGAATTC
GAGCGCACGT GCCCCAACAT CAAGATGCTC AACCTGCGGG AAAAGATCGA AAAGCGGCTT
GAGGCGATGA TCGCACGGAA TCCGACCCGC GTGGATCTGT ACGAGCGCTA CCAGGAGATC
GTGGCGGAGT ATAACAAGGA GTATAACAAG GACAAGGATG CCGTGGAAGT GCAGAAGGTG
TTCGACCTGC TGCAGAAGGA CACCCAGACC CGGCCCGAAC GGGAACGGAT CAAGGAGGTG
GCAAAGGAAC TGCTGGACAA GCTGCTATCC GACAAGCTCC AGATCGACCA TTGGCGGGAA
AAAGCCACGG CCCAGGCCCA GGTCAAAGCA GAAATCATCA AGCATCTCTT CGTCAACCTG
CCTGAAACAG GTTATGCAGA GCACGAAATT TCCGCACGGG CAGACCTGGT GTTTGCTCAT
CTCTATCAGA CATGCGCGGG AACGATGGCA TTTCACCAAT GA
 
Protein sequence
MFAYDTEGFG PESLLGRTSD REVVLRRDVE AALRRLNPGL PDDAYHDALA QVTADDRTKT 
LLQVNKEKYR LLRDGVPVKY RDEAGRMTDR RLKLVDFDDP ANPKKNRFLV VRELWVKGDT
YRRRPDVLGY VNGLPLVFIE LKRYDQHIDK AFKQNYSDYK DTIPHLFHWN ALILLSNGVD
AKYGSITSIM EHFSRWKRQK EEDPEPTADQ PLLPLLLRGM LNKEALLDLV ENFILFDRTE
GELQKIVARN HQYLGVNQVI GKLLSKEPGM QAEVEAGRLG VFWHTQGSGK SYSMIFLTEK
THRKISAKYT FVVMTDRNEL DEQIFGTYTG CGAATNKKAK AMDGKAPDRF TLIRRAQVEW
MKETEICVVV SPEQGEVAEF RKWELDIVPH REKMVHRDLN LEFKKPEHPF RVVIVCAMWL
TGYDVKCLAT LYLDKPMKGH TLMQAIARVN RVGGGKKNGL IIDYNGMLKS LRKALATFAQ
GDRKGSDQDI LRDDTEAVAE YGQSIRAAQD FLTGCGFNLD ELIAANGFDK QAMILRGVNT
VCETDERRKT FEVMADDIAA RFRGIFPNPG LYAYDEQENA IAAIYNRLQE SKESPDVSEM
LQALYAVIDT AVTTDTLTVN EPPVRYELTK IDISRLQAEF ERTCPNIKML NLREKIEKRL
EAMIARNPTR VDLYERYQEI VAEYNKEYNK DKDAVEVQKV FDLLQKDTQT RPERERIKEV
AKELLDKLLS DKLQIDHWRE KATAQAQVKA EIIKHLFVNL PETGYAEHEI SARADLVFAH
LYQTCAGTMA FHQ