Gene Acid345_3758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3758 
Symbol 
ID4069333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4440674 
End bp4442209 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content56% 
IMG OID637985780 
ProductN-6 DNA methylase 
Protein accessionYP_592832 
Protein GI94970784 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.921564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAACG GAACCATCTC AGAAATCGAT GAGGCGAAGC TCTGGAGCAT GGCAGATGCG 
CTGCGCAACA ACATGGACGC TGCGGAGTAC AAACACGTCG TCCTCGGTCT TATCTTCCTG
AAATACATCT CCGACGCCTT CGAGGCGAAG CACGCCGAAC TTGAACAAAA GATGGATCAG
GGGGCCGATC CGGAGGACCC CGATGAGTAC CGCGCTGTCA GTATCTTTTG GGTTCCCAGA
GAAGCTCGTT GGGCACACCT GAAGGACAAC GCTCCGCAAC CCAAAATCGG GACGCTTGTC
GATGATGCAA TGGCGGCGAT TGAGCGAGAT AATCAGTCGC TGAAGGGCGT ATTGCCGAAG
GATTATGCCC GGCCTGGGCT AGACAAACAA AGGCTTGGGC AATTGATCAA TCTCGTGAGT
GGCATCGGTC TTGGTACGCC CGCCGCCCGG GCCAAGGACA TCTTAGGTCG CGTTTACGAG
TACTTCCTTG CTCAGTTTGC CAGCGCTGAA GGTAAAAAGG GCGGGCAGTT CTATACGCCC
TCTCACGTCG TTCGAATCCT CGTCGAGATG CTAGCTCCGT ACAAAGGGCG AGTCTACGAC
CCATGCTGCG GATCAGGCGG CATGTTCGTC AGCAGCGAGA AGTTCATAGA GGCTCACAGT
GGAAAACTCG GCGACATCTC GATCTACGGT CAAGAATCGA ATTACACAAC TTGGCGCTTG
GCGAAGATGA ATCTGGCGAT TCGGGGCATC GATGCTCAAA TCCAGCACGG CGACACTTTT
CACAACGATC GCCACCCAGA CCTGAAGGCC GATTGTGTTC TGGCAAACCC TCCCTTCAAT
GACAGCGATT GGCGCGGGGA ACTGCTGAAG GAGGACAAGC GTTGGGTCTT CGGTGTACCT
CCGGCCGGTA ACGCGAACTT TGCATGGATC CAGCACTTCA TCTATCACCT CGCGCCGACC
GGCTTGGCGG GCTTCGTCCT CGCTAATGGC TCGATGTCCA CGAATACCTC AGGGGAGGGT
GAGATCAGGA AAGGCATCAT CGAATCTGAT CTCGTCGATT GCATGGTGGC ACTCCCCGGA
CAGCTTTTCT ATTCGACGGG CATTCCGGTT TGCCTTTGGT TCGTCGCCCG GAGCAAGTCG
AGTGGTCGGT TCCGTAACCG CAGGGGCGAA ACGCTCTTCA TCGACGCGCG AAAGTTCGGA
TCGCTGATTG ATCGTGTGCA CCGGGAATTA AGCGACGCGG ATGTCGCCAA GATCGCCGGA
ACCTATCATG CGTGGCGTGG TGATGAGGGC GCAGGAGGCT ACGCCGATGT TGCGGGCTTT
TGTAAGGCCG CAACGTTGGA CGATATCCGG AAGCACAGTC ACATCCTCAC CCCGGGACGA
TATGTGGGTG CGAAGGAGAC TGAAGATGAC GGCGAACCAA TTGAGCAAAA GATGAAGACT
TTGACGGACG CACTGCGCAT GCAGTTAGCA GAGAGCAGAA AACTCGAAGG GGAGATAACC
GGCAATCTCA AGGAAATAGG TTATGAATTG GCCTAA
 
Protein sequence
MANGTISEID EAKLWSMADA LRNNMDAAEY KHVVLGLIFL KYISDAFEAK HAELEQKMDQ 
GADPEDPDEY RAVSIFWVPR EARWAHLKDN APQPKIGTLV DDAMAAIERD NQSLKGVLPK
DYARPGLDKQ RLGQLINLVS GIGLGTPAAR AKDILGRVYE YFLAQFASAE GKKGGQFYTP
SHVVRILVEM LAPYKGRVYD PCCGSGGMFV SSEKFIEAHS GKLGDISIYG QESNYTTWRL
AKMNLAIRGI DAQIQHGDTF HNDRHPDLKA DCVLANPPFN DSDWRGELLK EDKRWVFGVP
PAGNANFAWI QHFIYHLAPT GLAGFVLANG SMSTNTSGEG EIRKGIIESD LVDCMVALPG
QLFYSTGIPV CLWFVARSKS SGRFRNRRGE TLFIDARKFG SLIDRVHREL SDADVAKIAG
TYHAWRGDEG AGGYADVAGF CKAATLDDIR KHSHILTPGR YVGAKETEDD GEPIEQKMKT
LTDALRMQLA ESRKLEGEIT GNLKEIGYEL A