Gene Mmc1_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_0234 
Symbol 
ID4481683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp264386 
End bp265483 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content58% 
IMG OID639720980 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_864167 
Protein GI117923550 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGC CCCATCTGCC CGCCGATCTG GCTCAACGTC TGCTGGCCTA CTATGATGAA 
TATGGACGGG ATCTGCCCTG GCGTCAGCAG CAGGATCTCT ACCGCATCTG GCTCTCAGAA
ATCATGTTGC AGCAGACTGG GGTGAAGACG GTTATGCCCT ATTATGAAAA GTTTTTATCC
CATTTTCCCA GCATTACACA GCTTGCAGCG GCCTCCCAGG AGCAGGTGTT GGCCCAGTGG
CAGGGCTTGG GTTACTATCG CCGGGCCCGC ATGCTGCACC AAGCGGCGCA ACAGGTGGTG
CAACAGCATG GTGGCCTTTT TCCTGAAGAG ATAACCCAAG TGCAGGCGCT ACCAGGCATC
GGTCCCAGCA CCGCAGCGGC CATTTTAGCC ATCGGGCGGA ACCAAGCCCA CACCATTTTG
GATGGCAATG TGATGCGGGT TTTGGCCCGC TTGCTAACGC TGGAGCTGCC CGTGGATAGC
ACCCCTGGCA AACAGCGGTT GTGGCAGGTA GCCCGCCAGC TTACCTCTCA GCAGCGCCCC
GGTGATTATG CACAGGCAAT CATGGATCTT GGGGCAACCC TCTGCACCCG TAGTCAACCG
GCCTGCTCAC GTTGCCCCTG GGGCGGCGCC TGCGCGGCGC GACAACACGG TAGCTGGGCA
GAATACCCAA AAAAGCGGGA GAAAAAACCA AAACCCCACC ACTACCAATG CATGTGGGTG
TTACTCGATA CACAGCAGCG CATCTTTTTA CGTAAACGCC CCCTAGAAGG GCTGTTAGGA
GGCCTGTGGG AACCCCTCGG CGAGCCTCTA CTCGAAACCC CGCCCCTGGG TAATTTGGTA
CAGCGGGCCA GCCATCACTT GACCGCCTTG GGGATCCAGG GCCAGCCGCT GCTTGAGGCG
CAGCCGGTCG ATCATATCTT TACCCATTTC CGCCTGACGG TTTATCCTAT TCTGGTGGTG
GCGGCCAGCG GTGCGCCCAT ACTCAACGAT GCCAACTGGT GGCCCCTTGC TCAACTGGAT
CAGCGCCCCA TCGCCACCTT GCATCGAAAA GTGAATGAGA ACGCTGTGGG TCTGGTGGAG
CTTAGCCTAA ATGGATAG
 
Protein sequence
MTLPHLPADL AQRLLAYYDE YGRDLPWRQQ QDLYRIWLSE IMLQQTGVKT VMPYYEKFLS 
HFPSITQLAA ASQEQVLAQW QGLGYYRRAR MLHQAAQQVV QQHGGLFPEE ITQVQALPGI
GPSTAAAILA IGRNQAHTIL DGNVMRVLAR LLTLELPVDS TPGKQRLWQV ARQLTSQQRP
GDYAQAIMDL GATLCTRSQP ACSRCPWGGA CAARQHGSWA EYPKKREKKP KPHHYQCMWV
LLDTQQRIFL RKRPLEGLLG GLWEPLGEPL LETPPLGNLV QRASHHLTAL GIQGQPLLEA
QPVDHIFTHF RLTVYPILVV AASGAPILND ANWWPLAQLD QRPIATLHRK VNENAVGLVE
LSLNG