Gene Mlg_2695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2695 
Symbol 
ID4269938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3056996 
End bp3058081 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content73% 
IMG OID638127455 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_743525 
Protein GI114321842 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.375064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACC GCCCCGGCGC ACCGGGCACT GAGGAGGAGG CGGTCAATCG GACCCGGGCG 
GCGATCCTGG CCTGGTTCGA CCGCCACGGC CGCCACGACC TGCCCTGGCA GCATCCGGCC
ACCCCCTACC GGGTGTGGGT CTCGGAGGTG ATGCTGCAAC AGACCCAGGT GGCCACCGTG
GTGCCCTACT TCCACCGTTT CATGCGCCGA TTCCCCAGCC CGCGTGCGCT GGCGGACGCA
CCACAGGAGG AGGTGCTGGC GCTCTGGGCC GGGCTGGGTT ACTACGCCCG CGCCCGCAAC
CTGCACCGGG CCGCGCAACA CATCCGCGAT CAATACGGCG GGGAACTGCC CGCAGACCTG
GACGCCCTGG AGGCCCTCCC CGGCATCGGC CGCTCCACCG CCGGCGCCAT CCACTCCCTC
GGCCAGGGGC GCCGGGCGGT CATCCTGGAT GGCAACGTCA AGCGGGTGCT GGCCCGCTGG
CATGCGGTGG ACGGCTGGCC CGGCCGGACC GCCGTCGCCC GCCGGCTGTG GGCGCTCGCC
GAGCACTACA CCCCGGCCCA CCGCTGCGCC GACTACAACC AGGCCATGAT GGACCTGGGC
GCTACCGTCT GCACCCGGCG CACCCCCCGC TGCCATGAGT GCCCACTGCA GGCCCGATGC
GCCGGCCACG CCAGCGGCCG GCCGGAGGCC TGGCCCACCC CGAAACCCAA GCGCCGGCGC
CCGCTGCGCC AGACCCGCAT GCTCATTCTC CAGCACGGCG ACCGGGTGCT GCTGCAGCGC
CGCCCCCCGA GCGGCGTCTG GGGCGGCCTC TGGAGCTTGC CCGAGGCGGC CGTGGACGCC
GACCCGAAGA GCGCGGCGGC CGCGCTCGGC CTCAAGGTCG ACCAGGCCGG CCACTGGCCG
CCCCTGCGCC ACGCCTTCAG CCACTTTGAA CTGGACATCC ACCCGATTCA CCTGCGGGTT
TCCGGGGCGG GCCAAGCGGT GAAGGAGAGT GATACACTTT GGCAATCCAT TCATGACACC
GGCGCCCGGG CGGTGGCCGC CCCGGTGGCC CGGTTACTGG AACGACTCAG GGAGTACACA
CCATGA
 
Protein sequence
MADRPGAPGT EEEAVNRTRA AILAWFDRHG RHDLPWQHPA TPYRVWVSEV MLQQTQVATV 
VPYFHRFMRR FPSPRALADA PQEEVLALWA GLGYYARARN LHRAAQHIRD QYGGELPADL
DALEALPGIG RSTAGAIHSL GQGRRAVILD GNVKRVLARW HAVDGWPGRT AVARRLWALA
EHYTPAHRCA DYNQAMMDLG ATVCTRRTPR CHECPLQARC AGHASGRPEA WPTPKPKRRR
PLRQTRMLIL QHGDRVLLQR RPPSGVWGGL WSLPEAAVDA DPKSAAAALG LKVDQAGHWP
PLRHAFSHFE LDIHPIHLRV SGAGQAVKES DTLWQSIHDT GARAVAAPVA RLLERLREYT
P