Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0494 |
Symbol | |
ID | 8135803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 609132 |
End bp | 610262 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644868112 |
Product | putative RNA methylase |
Protein accession | YP_003020332 |
Protein GI | 253699143 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0116] Predicted N6-adenine-specific DNA methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.00380676 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCAGG AGCGTTTTTT CGCCACCACT GCAAAAGGGG TGGAAGAGGT ACTCGCCGCC GAGTTGACGC GGCTTGGGGC AAGCGACGTG GTCGTTGACA GCGGCGGGGT CCGTTTCGGC GGGGGGATGG AAGCGGCCTA CCGGGCCAAC CTCTGGCTCA GGAGCGCGAG CCGGGTGCTG ATGCCGCTGG GGGAGTTTCC TTGCGAGACG CCGGAGCAGC TCTACCAGGG GGTGCGGGGG CTCTTCTGGG TCAACTATTT GACGCCGGCG ATGACGCTGG CGGTCGACTG CAGCCTGAGA GACTCGGCGC TCACCCATTC AGGATTCGTG GCGCTGAAGG CGAAAGACGC CATCGTAGAT GTCTTGCGCG ACCATTTCGG CAGCCGCCCC AACGTCGACA CCAAGGACCC TGACCTGCGG GTGAACCTGC GGCTGTTCCG CAACCGCTGC ACGGTGAGTC TCGACTGCTC GGGTATGCCG CTGGACCGGC GCGGTTACCG CTTGGACCGG CATGAGGCGC CGCTCAAGGA GAACCTGGCC GCGGCCCTGG TCGAGCTTTC CGGGTGGGAC GGCGCCACTC CCCTCATCGA CCCCATGTGC GGCACCGGCA CCATCGTCAT CGAGGCGGCC ATGAAGGCGC TGCGCATCCC CCCCGGCCTG TCGCGCCAGG GGTTCGGCTT CCAGCGCTGG AAAGGTTTCG ATCACGCGCT CTGGGAGCGT GTCGTCTCCG AGGCGCGCAG CGGCATCCTT TCTTCCCTTC CCGCGCCGGT GCAGGGGACG GACATCTCCC ACTCGGCCGT GGGTATGGCC GCCCAGAACG CCAAACGCGC AGGCGTATTG GAGCAGATCT CGCTTGGGCG GCAGCAGCTC TCCGAGCTCG CCCCCCCGCC GGGGCCGGGC GTCGTCATCC TGAATCCCCC CTACGGCAAG AGGCTGGGGG AGGAGGAGGC TCTCCGGCCG CTCTACAAGG AGATCGGCGA CGTGCTGAAA AAACGCTGCA AGGGATATAC GGCCTACCTC TTCACCGGAA ACCTGGAGCT CGCCAAGTCC GTGGGGCTGA AGGCCACCAG GCGCATCGTG CTCTACAACG GGCCGATCGA GTGCAGGCTG CTCAAGTACG AGATGTATTA G
|
Protein sequence | MNQERFFATT AKGVEEVLAA ELTRLGASDV VVDSGGVRFG GGMEAAYRAN LWLRSASRVL MPLGEFPCET PEQLYQGVRG LFWVNYLTPA MTLAVDCSLR DSALTHSGFV ALKAKDAIVD VLRDHFGSRP NVDTKDPDLR VNLRLFRNRC TVSLDCSGMP LDRRGYRLDR HEAPLKENLA AALVELSGWD GATPLIDPMC GTGTIVIEAA MKALRIPPGL SRQGFGFQRW KGFDHALWER VVSEARSGIL SSLPAPVQGT DISHSAVGMA AQNAKRAGVL EQISLGRQQL SELAPPPGPG VVILNPPYGK RLGEEEALRP LYKEIGDVLK KRCKGYTAYL FTGNLELAKS VGLKATRRIV LYNGPIECRL LKYEMY
|
| |