Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3758 |
Symbol | |
ID | 4069333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4440674 |
End bp | 4442209 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985780 |
Product | N-6 DNA methylase |
Protein accession | YP_592832 |
Protein GI | 94970784 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.921564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAACG GAACCATCTC AGAAATCGAT GAGGCGAAGC TCTGGAGCAT GGCAGATGCG CTGCGCAACA ACATGGACGC TGCGGAGTAC AAACACGTCG TCCTCGGTCT TATCTTCCTG AAATACATCT CCGACGCCTT CGAGGCGAAG CACGCCGAAC TTGAACAAAA GATGGATCAG GGGGCCGATC CGGAGGACCC CGATGAGTAC CGCGCTGTCA GTATCTTTTG GGTTCCCAGA GAAGCTCGTT GGGCACACCT GAAGGACAAC GCTCCGCAAC CCAAAATCGG GACGCTTGTC GATGATGCAA TGGCGGCGAT TGAGCGAGAT AATCAGTCGC TGAAGGGCGT ATTGCCGAAG GATTATGCCC GGCCTGGGCT AGACAAACAA AGGCTTGGGC AATTGATCAA TCTCGTGAGT GGCATCGGTC TTGGTACGCC CGCCGCCCGG GCCAAGGACA TCTTAGGTCG CGTTTACGAG TACTTCCTTG CTCAGTTTGC CAGCGCTGAA GGTAAAAAGG GCGGGCAGTT CTATACGCCC TCTCACGTCG TTCGAATCCT CGTCGAGATG CTAGCTCCGT ACAAAGGGCG AGTCTACGAC CCATGCTGCG GATCAGGCGG CATGTTCGTC AGCAGCGAGA AGTTCATAGA GGCTCACAGT GGAAAACTCG GCGACATCTC GATCTACGGT CAAGAATCGA ATTACACAAC TTGGCGCTTG GCGAAGATGA ATCTGGCGAT TCGGGGCATC GATGCTCAAA TCCAGCACGG CGACACTTTT CACAACGATC GCCACCCAGA CCTGAAGGCC GATTGTGTTC TGGCAAACCC TCCCTTCAAT GACAGCGATT GGCGCGGGGA ACTGCTGAAG GAGGACAAGC GTTGGGTCTT CGGTGTACCT CCGGCCGGTA ACGCGAACTT TGCATGGATC CAGCACTTCA TCTATCACCT CGCGCCGACC GGCTTGGCGG GCTTCGTCCT CGCTAATGGC TCGATGTCCA CGAATACCTC AGGGGAGGGT GAGATCAGGA AAGGCATCAT CGAATCTGAT CTCGTCGATT GCATGGTGGC ACTCCCCGGA CAGCTTTTCT ATTCGACGGG CATTCCGGTT TGCCTTTGGT TCGTCGCCCG GAGCAAGTCG AGTGGTCGGT TCCGTAACCG CAGGGGCGAA ACGCTCTTCA TCGACGCGCG AAAGTTCGGA TCGCTGATTG ATCGTGTGCA CCGGGAATTA AGCGACGCGG ATGTCGCCAA GATCGCCGGA ACCTATCATG CGTGGCGTGG TGATGAGGGC GCAGGAGGCT ACGCCGATGT TGCGGGCTTT TGTAAGGCCG CAACGTTGGA CGATATCCGG AAGCACAGTC ACATCCTCAC CCCGGGACGA TATGTGGGTG CGAAGGAGAC TGAAGATGAC GGCGAACCAA TTGAGCAAAA GATGAAGACT TTGACGGACG CACTGCGCAT GCAGTTAGCA GAGAGCAGAA AACTCGAAGG GGAGATAACC GGCAATCTCA AGGAAATAGG TTATGAATTG GCCTAA
|
Protein sequence | MANGTISEID EAKLWSMADA LRNNMDAAEY KHVVLGLIFL KYISDAFEAK HAELEQKMDQ GADPEDPDEY RAVSIFWVPR EARWAHLKDN APQPKIGTLV DDAMAAIERD NQSLKGVLPK DYARPGLDKQ RLGQLINLVS GIGLGTPAAR AKDILGRVYE YFLAQFASAE GKKGGQFYTP SHVVRILVEM LAPYKGRVYD PCCGSGGMFV SSEKFIEAHS GKLGDISIYG QESNYTTWRL AKMNLAIRGI DAQIQHGDTF HNDRHPDLKA DCVLANPPFN DSDWRGELLK EDKRWVFGVP PAGNANFAWI QHFIYHLAPT GLAGFVLANG SMSTNTSGEG EIRKGIIESD LVDCMVALPG QLFYSTGIPV CLWFVARSKS SGRFRNRRGE TLFIDARKFG SLIDRVHREL SDADVAKIAG TYHAWRGDEG AGGYADVAGF CKAATLDDIR KHSHILTPGR YVGAKETEDD GEPIEQKMKT LTDALRMQLA ESRKLEGEIT GNLKEIGYEL A
|
| |