Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1127 |
Symbol | |
ID | 4069897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1404454 |
End bp | 1405533 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983136 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_590204 |
Protein GI | 94968156 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000503374 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000088158 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCTAAAAT CCCGATTTAT GAAGCGCCGA GATAGCGATT CAGCTCCGTT TTTGTTGGAA GTTGGATACG AGGCAGAGAA TCCCTGGCGA ATTGCTGAAA GCCAGCCGCC GAAAGGCAAA AGCGGTTTGG TCACGCTCGA ACTCTGCGCC GGTGCTGGGG GTCAAGCTCT CGGGTTAGAA CAGGCCGGTA TCAATCATGT CGCGCTCGTA GAGATAAATA AACACGCATG CGAGACACTT CGACTTAATC GTCCCAATTG GAAAGTGGTT GAGGGCGATC TGCAAACGTT CGATCCTTCC CCGTACAAGG GCGCTGACAT TGTCTCGGCA GGATTGCCAT GCCCCCCCTT TTCTGTTGCC GGAAAGCAGT TGGGAAAGTT GGACGAGAGA AATCTCTTTC CGGCGATGGT GAATGTCGTC GACGCGGTAA GACCGCGAGC CGTTATGGTG GAGAATGTTC GTGGCATTCT TGATGCGGTA TTCATTGACT ATCGCGAGCA CGTGAGCAAG CAGCTGCGAA AACTTGGATA TACCCCGGGT TGGCATTTGA TGAACGCCTG CGAATTCGGA GTTCCTCAAC TTCGGCCGCG GGTTGTATTC GTAGCGATGC GAAAGGAGTA TTCCGAGCAC TTCGCTTGGC CGCGCGCGAC TAACGAGCCT CAGACGGTCG GAGATGTATT ATTCGACTTG ATGAGTGCGC GCGGCTGGAA AGGTGTGAAA GCTTGGCGCG CGAAAGCAAA CGAGATTGCG CCAACGATTG TAGGGGGATC CCTGAAGCAC GGCGGCCCAG ATCTTGGTCC CACGAGAGCA CGCCGCGCCT GGGAGGCACT CGGAGTGGAC GGGAAGGGGA TCGCGGACGA TGTACCGGAG CGTGAGTTCG TAGGTATGCC CCGTCTCACT GTTCGTATGG TCGCGCGCAT TCAGGGTTTT CCCGATGAAT GGCAGTTCGC GGGCAGGAAA ACGCAAGCGT ACCGCCAGGT TGGAAATGCT TTTCCGCCGC CTTTCGCTCG TGCAGTTGCG GAAAGCGTGA GTGCTTGCTT GTCGTCCGCA CGAAGGACAG TCCGAGTCAC CAGTGCTTAA
|
Protein sequence | MLKSRFMKRR DSDSAPFLLE VGYEAENPWR IAESQPPKGK SGLVTLELCA GAGGQALGLE QAGINHVALV EINKHACETL RLNRPNWKVV EGDLQTFDPS PYKGADIVSA GLPCPPFSVA GKQLGKLDER NLFPAMVNVV DAVRPRAVMV ENVRGILDAV FIDYREHVSK QLRKLGYTPG WHLMNACEFG VPQLRPRVVF VAMRKEYSEH FAWPRATNEP QTVGDVLFDL MSARGWKGVK AWRAKANEIA PTIVGGSLKH GGPDLGPTRA RRAWEALGVD GKGIADDVPE REFVGMPRLT VRMVARIQGF PDEWQFAGRK TQAYRQVGNA FPPPFARAVA ESVSACLSSA RRTVRVTSA
|
| |