Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1093 |
Symbol | |
ID | 4069553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1370411 |
End bp | 1371271 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637983102 |
Product | DNA adenine methylase |
Protein accession | YP_590170 |
Protein GI | 94968122 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0338] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00571] DNA adenine methylase (dam) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.118835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000226621 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCATGA CAATCGGCTT GGTGTGCCCA AACGAATCGC CGGAAAAACG CGGGGTGAAG CCTTTCTTGC GATGGGCCGG CAGCAAACGA AAGCAGCTTT CGCGTCTGGC GCCATACTGG TCCGGGGAGC ACAAGCGTTA TGTCGAGCCA TTTGCGGGTT CTGCTTGTCT TTTCTTCGAC CTTGCTCCAC CCTCCGCGGT CCTCGGCGAT AGTAATGAGG AACTGATTCA GCTTTACCGC GTTGTTCGCG ATTCACCAGA GCGGTTGCAT CGGCGGCTTT GTCGAATTAA GCGAGACGTT GCTACATATC TTCGGTGGCG TAACCAAAAC CCGACGACCC TAGATCCCGA GACGCGCGCA CTAAGATTTC TCTATCTAAA TCGGAACTGC TTTAATGGGA TTTATCGTAC AAATCTTCAG GGCGGATTCA ACGTACCCAT GGGGAAGCGA GTCGGTGAGT ATTTTAGTAA AGAAGATCTA TTACGCTGCT CAAGGCTACT GCAGACTGCC GAACTAGTCG CTGGTGACTT CTCGGCGACC TTGGATCTTG TGAAGGTGGG TGATTTTGTC TATCTGGATC CACCTTACGC GGTGTTATCT CGCCGAATTT TCAAGGAGTA CGGAAGCAAG CTATTCGGCA CCGAAGATAT GCCTCGGTTC GAGGAGAGCC TTCACGCGAT TGTGGGTCGG GGTGCTGACT TTCTTGTGTC CTATGCAGAT TGCAAAGAAG CGAGAGCGCT CGCGCGGAAG TGGTGTTCGG TTCGACTTCC TGTAAAACGG CACGTCGCAG GCTTCGCCGG CGACCGAAAG ACCGCGTACG AATGGGTGAT CAGCAATCGG CAGCGATCAA CAACGGCTTA G
|
Protein sequence | MTMTIGLVCP NESPEKRGVK PFLRWAGSKR KQLSRLAPYW SGEHKRYVEP FAGSACLFFD LAPPSAVLGD SNEELIQLYR VVRDSPERLH RRLCRIKRDV ATYLRWRNQN PTTLDPETRA LRFLYLNRNC FNGIYRTNLQ GGFNVPMGKR VGEYFSKEDL LRCSRLLQTA ELVAGDFSAT LDLVKVGDFV YLDPPYAVLS RRIFKEYGSK LFGTEDMPRF EESLHAIVGR GADFLVSYAD CKEARALARK WCSVRLPVKR HVAGFAGDRK TAYEWVISNR QRSTTA
|
| |