Gene Msil_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1600 
Symbol 
ID7090957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1724866 
End bp1725945 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID643464926 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_002361911 
Protein GI217977764 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.05241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGACG CCGCAATCGC AGACGCCGCA ATCCCAGACG CAGTTCTTGC CTGGTACGAC 
CGTCATCGCC GCGTTCTGCC CTGGCGCGCG CCGCCCGGCG CGGCGGCCGA CCCTTACGCC
GTCTGGCTCT CGGAAATCAT GCTGCAGCAG ACGACGGTCG CGGCGGTCAA ATCCTATTTC
TCCGCGTTTC TGGCGCGCTG GCCCAACGTC GACGCCCTCG CGCGGGCGCC GGCCGAGGAG
GTGATGCGGC AGTGGGCCGG GCTTGGCTAT TATTCGCGGG CGCGCAATTT GCACGCCTGC
GCCAAGACCG TGTCCGCAAA ATTTGGCGGA CAATTTCCGG ACGAGGAGGC GGCCCTGCGC
GCCCTGCCAG GTCTTGGCCC TTATACCGCC GCGGCGGTCG CCGCGATCGC CTTTTGCCGC
AAGGCCGCCG TCGTCGACGG CAATGTCGAG CGCGTCCTGT CGCGCCTCTA CGCGATCGAG
GCGCCACCGC CGGCGGGAAA ACGCCTGATC TACGCACGGG CCGAAGCGCT GACGCCGGCG
GAGCGTCCCG GCGATTATGC GCAGGCGATG ATGGATCTTG GCGCGACGAT CTGCACGCCG
AAAAGCCCCG CCTGCGCGAT CTGCCCCCTG AACGGAGCCT GCGCCGCGTT CAGGATCGGC
GATCCAGCGC GTTTTCCGGT GAAGGCCGCG AAACCGGAGC GGCCGCTCCG GCGAGGCGCC
GCCTTTTATG TGGCGCGGCC CGACGGCGCG GTCCTGGTGC GAACGCGCCC GCCGAAAGGG
CTGCTCGGCG GCATGACGGA GATCCCGGGC TCACCCTGGA CCGAGGATTT CGACGAGGCC
GGCGCGCCGC GCCATGCCCC GGTCGAGGCG CGCTATCGCC GGCTGGCGCG CCCGGTCGAG
CACAGTTTCA CGCATTTTGC CTTGCAGCTT TCGGTGTATG TGGGGGAGGC TGGGGCAAAC
ATGCCGGCGC CCGACGGTTG CCGCTGGGCG GCGGCCGATC TTGAGAATGA GGCGCTGCCG
ACTCTCATGC GCAAACTCGT CAGCGCGGCG AGGCGGCGGG AATTTGGGGG AGATCTGTGA
 
Protein sequence
MCDAAIADAA IPDAVLAWYD RHRRVLPWRA PPGAAADPYA VWLSEIMLQQ TTVAAVKSYF 
SAFLARWPNV DALARAPAEE VMRQWAGLGY YSRARNLHAC AKTVSAKFGG QFPDEEAALR
ALPGLGPYTA AAVAAIAFCR KAAVVDGNVE RVLSRLYAIE APPPAGKRLI YARAEALTPA
ERPGDYAQAM MDLGATICTP KSPACAICPL NGACAAFRIG DPARFPVKAA KPERPLRRGA
AFYVARPDGA VLVRTRPPKG LLGGMTEIPG SPWTEDFDEA GAPRHAPVEA RYRRLARPVE
HSFTHFALQL SVYVGEAGAN MPAPDGCRWA AADLENEALP TLMRKLVSAA RRREFGGDL