Gene GM21_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0494 
Symbol 
ID8135803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp609132 
End bp610262 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID644868112 
Productputative RNA methylase 
Protein accessionYP_003020332 
Protein GI253699143 
COG category[L] Replication, recombination and repair 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.00380676 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCAGG AGCGTTTTTT CGCCACCACT GCAAAAGGGG TGGAAGAGGT ACTCGCCGCC 
GAGTTGACGC GGCTTGGGGC AAGCGACGTG GTCGTTGACA GCGGCGGGGT CCGTTTCGGC
GGGGGGATGG AAGCGGCCTA CCGGGCCAAC CTCTGGCTCA GGAGCGCGAG CCGGGTGCTG
ATGCCGCTGG GGGAGTTTCC TTGCGAGACG CCGGAGCAGC TCTACCAGGG GGTGCGGGGG
CTCTTCTGGG TCAACTATTT GACGCCGGCG ATGACGCTGG CGGTCGACTG CAGCCTGAGA
GACTCGGCGC TCACCCATTC AGGATTCGTG GCGCTGAAGG CGAAAGACGC CATCGTAGAT
GTCTTGCGCG ACCATTTCGG CAGCCGCCCC AACGTCGACA CCAAGGACCC TGACCTGCGG
GTGAACCTGC GGCTGTTCCG CAACCGCTGC ACGGTGAGTC TCGACTGCTC GGGTATGCCG
CTGGACCGGC GCGGTTACCG CTTGGACCGG CATGAGGCGC CGCTCAAGGA GAACCTGGCC
GCGGCCCTGG TCGAGCTTTC CGGGTGGGAC GGCGCCACTC CCCTCATCGA CCCCATGTGC
GGCACCGGCA CCATCGTCAT CGAGGCGGCC ATGAAGGCGC TGCGCATCCC CCCCGGCCTG
TCGCGCCAGG GGTTCGGCTT CCAGCGCTGG AAAGGTTTCG ATCACGCGCT CTGGGAGCGT
GTCGTCTCCG AGGCGCGCAG CGGCATCCTT TCTTCCCTTC CCGCGCCGGT GCAGGGGACG
GACATCTCCC ACTCGGCCGT GGGTATGGCC GCCCAGAACG CCAAACGCGC AGGCGTATTG
GAGCAGATCT CGCTTGGGCG GCAGCAGCTC TCCGAGCTCG CCCCCCCGCC GGGGCCGGGC
GTCGTCATCC TGAATCCCCC CTACGGCAAG AGGCTGGGGG AGGAGGAGGC TCTCCGGCCG
CTCTACAAGG AGATCGGCGA CGTGCTGAAA AAACGCTGCA AGGGATATAC GGCCTACCTC
TTCACCGGAA ACCTGGAGCT CGCCAAGTCC GTGGGGCTGA AGGCCACCAG GCGCATCGTG
CTCTACAACG GGCCGATCGA GTGCAGGCTG CTCAAGTACG AGATGTATTA G
 
Protein sequence
MNQERFFATT AKGVEEVLAA ELTRLGASDV VVDSGGVRFG GGMEAAYRAN LWLRSASRVL 
MPLGEFPCET PEQLYQGVRG LFWVNYLTPA MTLAVDCSLR DSALTHSGFV ALKAKDAIVD
VLRDHFGSRP NVDTKDPDLR VNLRLFRNRC TVSLDCSGMP LDRRGYRLDR HEAPLKENLA
AALVELSGWD GATPLIDPMC GTGTIVIEAA MKALRIPPGL SRQGFGFQRW KGFDHALWER
VVSEARSGIL SSLPAPVQGT DISHSAVGMA AQNAKRAGVL EQISLGRQQL SELAPPPGPG
VVILNPPYGK RLGEEEALRP LYKEIGDVLK KRCKGYTAYL FTGNLELAKS VGLKATRRIV
LYNGPIECRL LKYEMY