Gene Gura_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4043 
Symbol 
ID5165930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4701384 
End bp4703858 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content59% 
IMG OID640551522 
ProductN-6 DNA methylase 
Protein accessionYP_001232760 
Protein GI148266054 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAC TCACCCTCAG CCAACTTTCC AGCCTTCTCT TCCGCGCCTG CGACGACCTG 
CGCGGCAACA TGGACGCCTC TGAATATAAA GAATATATCT TCGGAATGCT ATTCCTGAAA
CGCCTCTCCG ACCTCTTCGA CCAGGAGCGG GAACAGCTCG CCAAAGATTT AAAAGAAAAG
GGGATGGCCG AGGCGGTCAT CGCTGGCCAG CTTAACAACC CGGATAAATA CACTTTTTTC
GTGCCGGAAG AGGCTCACTG GTCTAATATC CGCCACCTCA AGACCAATGT CGGCACTAAT
CTCAACAAGG CGCTGGAAGC GCTGGAGGAC GCAAACGTTG ATGCTCTGCA GGACGTGCTC
AAGGGGATCA ACTTCAACAA GAAGATCGGC CAGCGTTCCC TTGACGACGA TACACTTGCC
AACTTCATCC AGAACTTCGA GAAGATCCCG CTTCGCGACG AAAACTTTGA GTTCCCTGAC
CTGCTCGGGG CGGCTTACGA ATACCTGATC AAATATTTCG CTGATTCCGC CGGCAAGAAG
GCGGGAGAGT TCTACTCTCC GGCCGATGTG GTGCGCACCC TGGTCGAGAT CGTTGATCCC
CAGCCTGGCA TGAGTGTGTA CGACCCCACC TGCGGCTCGG GTGGCATGCT CATCCAGACC
CGCGACTATG TCCGCGAATG TGGCGGTGAT CCTCGCGACC TCGCCCTTGC CGGGCAGGAA
AGCATCGGCA CCACCTGGTC CATCTGCAAG ATGAACATGC TCCTCCACGG CATCGAGCAC
GCCGACATCC GCCAGGAGGA CACCCTGCGT CACCCGCAAC ACAAGGCAGA GAACAATGAA
CTGCAACGAC ACGACCGCGT CCTTGCCAAT CCCCCCTTCA GTCAGAACTA CATAAAAAAA
GACATCGACT ATCCCGGCCG TTTCGCCGTC TGGCTTCCGG AAAAGGGAAA AAAGGCCGAC
CTGATGTTTG TCCAGCACAT GCTGGCCGTG CTCAAGGCCG ACGGCAAGAT GGCCACCGTC
ATGCCCCACG GCGTGCTCTT CCGGGGAGGC GAGGAAAAGG CGGCGCGCAG GCACTTCATA
GAGCACGGCT GGCTGGAGGC GGTAATCGGT CTACCCGCAG GGCTCTTTTA CGGCACAGGC
ATCCCTGCCT GCGTGTTGGT GATGAACAAA AAGGACGCCG GTTCGGGTGA TAACGTGCGC
GACCACGTCT TCTTCATCAA TGCCGACCGG GAATATCGTG AAGGGAAGGC GCAGAATTTC
CTGCGTCCAG AGGATATTTC CAAGATCGTC CACGCCTACC GTACCATGGC GGACGTGCCC
GGCTACGCTC GCCGGGTGCC GGTGAGCGAG ATCAAAGTCG AGGATTACAA CTGCAATATC
CGCCGCTATG TGGACAACGC CCCGCCGCCC GAGCCCCACG ACGTGCGTGC CCACCTGCAC
GGCGGGGTTC CGACCGTTGA GATTGAGGCC ATGGCCCGTT ACTGGACCAA CTATCCTGGT
CTGCGAGAGC GCTGTTTTGT ACCGCGTCCG GGCGATTCCC TCTATGCCGA TTTCACCCCT
GCCGTCACCG ATCGACGAAC TTTGGCCGAA CTGGTCAAAA CCGACCCTGG CGTGGTGGTG
GCGCAGAGCC GCTTTTTGCA GACACTGGAA ACGTGGTGGG TGGAGAACCT GCCGCTGATT
GAAGATCTCG CCCCCAGGAA CGGCCAGAAG GGGAATGTCT ATGAACTGCG CCGTGGGCTT
CTTGCCACCA TCGCCTTGAC CTTTTCTGGG CAGGGCCTGT TGACCGAACA CCAGATTCGT
GGTGCCTTTG CCCGGTATAT GGATGACCTG AAGGCAGATC TCAAATCCAT CGCCGCCAGC
GGCTGGGGGC CGGAACTGAT TCCCGACGCC GACATCCTGG AGAGCCAGTT CCCGGAACTC
CTGGCCGACA TGGAACAGAA GCGCCTGCGG TTAGCCGAAC TCTCGGCTCT CTTTGCCGCC
GCCGATGAAG AGGATTACGA AGACAGCGAC GACACCGGAG TCCTCCCCGC CGAGGATGTC
AAGGCGCTCA AGTCCGAACT CAAGGAGGCC AATGCCCAGG CCAAGCTGGC GAAACGGGAA
AACCGGGACG CCACGGCTTT CACAGCCCGC GCCCAAGCCG CGGAGGCACG TCTTGCCCGG
CACAAAGCGC TGGAGGATGA GGGGAAGCAG TTGAAAGCAG AGCTGCGTGC CACGGAGAAG
AAACAGGAGG ATCTCGTTGC TGCCGCCCGA GACAAGATCG ACCACGATGC AGCCCGTCGT
GTCATCCTCG ACCGGCTGCG TCGCCTGCTG GTGCAGACCT ATGAAGGGTA TCTGCAGGCC
GACCAGCGGG CGTGTCTTGC TGCGCTGGAG AATCTGCATG GGAAATATGC CGTTACTGTC
AAGGACATTG AGGCCAGGCG GGATTTGGCT GCGGAGAAGT TGAAGGGGTT TTTGGTGGAG
TTGGGGTATG AGTGA
 
Protein sequence
MSKLTLSQLS SLLFRACDDL RGNMDASEYK EYIFGMLFLK RLSDLFDQER EQLAKDLKEK 
GMAEAVIAGQ LNNPDKYTFF VPEEAHWSNI RHLKTNVGTN LNKALEALED ANVDALQDVL
KGINFNKKIG QRSLDDDTLA NFIQNFEKIP LRDENFEFPD LLGAAYEYLI KYFADSAGKK
AGEFYSPADV VRTLVEIVDP QPGMSVYDPT CGSGGMLIQT RDYVRECGGD PRDLALAGQE
SIGTTWSICK MNMLLHGIEH ADIRQEDTLR HPQHKAENNE LQRHDRVLAN PPFSQNYIKK
DIDYPGRFAV WLPEKGKKAD LMFVQHMLAV LKADGKMATV MPHGVLFRGG EEKAARRHFI
EHGWLEAVIG LPAGLFYGTG IPACVLVMNK KDAGSGDNVR DHVFFINADR EYREGKAQNF
LRPEDISKIV HAYRTMADVP GYARRVPVSE IKVEDYNCNI RRYVDNAPPP EPHDVRAHLH
GGVPTVEIEA MARYWTNYPG LRERCFVPRP GDSLYADFTP AVTDRRTLAE LVKTDPGVVV
AQSRFLQTLE TWWVENLPLI EDLAPRNGQK GNVYELRRGL LATIALTFSG QGLLTEHQIR
GAFARYMDDL KADLKSIAAS GWGPELIPDA DILESQFPEL LADMEQKRLR LAELSALFAA
ADEEDYEDSD DTGVLPAEDV KALKSELKEA NAQAKLAKRE NRDATAFTAR AQAAEARLAR
HKALEDEGKQ LKAELRATEK KQEDLVAAAR DKIDHDAARR VILDRLRRLL VQTYEGYLQA
DQRACLAALE NLHGKYAVTV KDIEARRDLA AEKLKGFLVE LGYE