Gene Sfum_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2733 
Symbol 
ID4458927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp3373646 
End bp3374812 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content61% 
IMG OID639703504 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_846846 
Protein GI116750159 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.610696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCAGC CAAAAACCCG TCTCCAAATT CAAACGCTCC TGCTTTCCTG GTTCGATGAG 
AACCAGAGGC CTCTTCCATG GCGGGAGAAA TACCGTCCAT ACGAGATTTG GATCTCCGAA
ATCATGCTCC AGCAAACGCA GGTCAAGACG ATGCTTCCCT ACTTCCGCCG CTGGATGGAG
CGTTTTCCGG ACGTGCAGTC GATAGCGGAC GCTCGGGAGG ATGAGGTCCT GAAGCACTGG
GAGGGACTGG GGTACTATTC TCGGGCCGTC AACATCCGCC GGACCGCCGA AATCATCGTC
AGGCACCACG GCGGGACTTT TCCGAAAGCG CACAGCACCA TTCTCGGCAT GCCGGGCATC
GGCCCTTACA CGGCGGGCGC CATTTCCAGC ATTGCGTTCA ACGAAGACCG CCCATTGGTT
GACGGCAACG TGGAGCGAAT CCTTGCCCGT CTTTTCAACC TCGATACACC CGTCGAGGAA
AAGAATACGC GGAAGTTCAT ATGGAACACG GCCGAAGAGC TCATTCCCGC AGGCCGGGCG
CGGCAATTCA ACCAGGCGTT GATGGACCTG GGAGCGACCG TGTGCCTGCC CCGCCGGCCC
GCCTGCGAAA AATGTCCCTT GAACGGCCTT TGTGAGAGCC GTCGCATGGG GACGGCGGAT
CGACGGCCTG TCACCAACAG GCGCAAGGAT ATCGCCTCCA TCGAAGTCGC CGTCGGAATC
CTGCACCACA GGGGGAGAGT GCTCATCCAG AAGCGGCCCG CCTCGGGACT GATGCCCAAC
CTGTGGGAGT TTCCGGGAGG CAAGATTCAC CCGGGCGAAT CTCCCGAACA GGCGCTGATC
CGGGAATTCC GGGAGGAATT GGAGCTGGAG GTCCGTTGCC GCGAGCGGCT CGCTTCGATC
AGGCACAACT ACACGTCCTT CAGGGTTCTC CTGCACGCAT TCCTGTGCAG GCCGGCGGAT
TCCCGTCCGA GACCCGTTCT TCGCAGTGCG GTCGAAGCGC GATGGGTCGT GGTGGAGGAA
CTCGACCAAT ACGCCTTTCC CGCGGCCAAT CGAAAGCTGA TCGACCTGGT GTCCGGGAGA
AAGCCGGCGG CGGCCCGAAA TCATCGTCCG GAGCCTCGGA ACGACGACGC AGAGGACGGA
GCGAACGTCG TCCTGCTGAA GAAATGA
 
Protein sequence
MLQPKTRLQI QTLLLSWFDE NQRPLPWREK YRPYEIWISE IMLQQTQVKT MLPYFRRWME 
RFPDVQSIAD AREDEVLKHW EGLGYYSRAV NIRRTAEIIV RHHGGTFPKA HSTILGMPGI
GPYTAGAISS IAFNEDRPLV DGNVERILAR LFNLDTPVEE KNTRKFIWNT AEELIPAGRA
RQFNQALMDL GATVCLPRRP ACEKCPLNGL CESRRMGTAD RRPVTNRRKD IASIEVAVGI
LHHRGRVLIQ KRPASGLMPN LWEFPGGKIH PGESPEQALI REFREELELE VRCRERLASI
RHNYTSFRVL LHAFLCRPAD SRPRPVLRSA VEARWVVVEE LDQYAFPAAN RKLIDLVSGR
KPAAARNHRP EPRNDDAEDG ANVVLLKK