Gene Sfum_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0798 
Symbol 
ID4460451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp985190 
End bp986650 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID639701560 
ProductSmr protein/MutS2 
Protein accessionYP_844931 
Protein GI116748244 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.475596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA ACTACGATCA TCCTACCATA GCATGGTACA TAACGCCACA TGGGTTCGGA 
CACGCCGTGC GCTCCCTGGA AGTGATCCGC CGCTTCATGT CGCGGGTTCC CGAAGTGCGG
CTCACCATCG TGTCGGATCT TCCCGATTTT CTCATCGAGC AGCATCTCGG GATGTTGCCT
TCCGTGAGGC GAAAACGACT CGATATCGGC ATGGTTCAAA AAGACAGCCT GCGTTTCGAT
CTGGACGCCA CTCGAACGAC GCTCGAGAAG CTGCGGCGGT CGCACGACCG CCTGGTGGAC
GAGGAGAAAA GGTTTCTTCT GGAGCTCGGA GCCGATGTCA TGGTCTCCGA TATTGCGTTC
ATTCCGTTTT ATGCGGCCGA TCATGCCGGC ATTCCCGGCA TTGGAATGGG GAATTTCACC
TGGGACTGGA TATACGCATC GTACAGAGAC ACGGACGAGC GCTGGGACCC GTTGATCGAG
TGGTGCAGGG GCGGCTACCG CTTGTGCGAC CTGCTGCTTC GCCTCCCCAT GCACGGGGAC
TGCTCATCCT GTCCCCGGAT CACGGACGTG CCCCTGGTCG CCCGAAAGCC CTCCCGCACG
AGGCGCGAAA GCAGGGCGAT CCTTGGCTGC GAGAAAAACC GGAAAGCCTA CCTGATTTCC
TTTTCCGAGC TGCACCTGGA TGAGTCCGCC CTGCGCCGTC TGGAAAGAAT GAACAACGCG
CTTTTCTTTT TCAAGCGACC GCTCCGCTTC GATTTTGCCA ACGGGCGGTC CATCGACGGC
CTGGAACTTT CATACGTCGA TGTCGTGGGG GCAATGGACG CGGTGATCAC CAAACCCGGG
TACGGGGTCG TATCGGACTG CCTGGCCACC GGGACGCCGA TGATCTACAC CGACCGGGGC
CCGTTCCCTG AATACGACAT TCTTGTCCGG GAGATGAAGC GACACCTGAA CACGGTTTAT
CTCAGCTCCG AGGACTTCTG CCGGGGCGCG TGGGAATCCG CCGTGGAACA GATCGAAGGG
ATGCCGCCGC GCACGGTTTC CATGCGTCTG GACGGCGCAG ACGTCTGCGT CGGCTTGATA
CTGGACTGCC TTGCCGGGCG CGAAGACGTG AAGGCGGGGA CTGTCGGGAG CCGGGATGGA
GCCGGGCGGC GGCATGCCGG AATGGAACCG GACTTGGAAT CCCCGGTGGT GGTTCCCGTT
GAGGATTCCA TTGATTTGCA CACCTACAGA CCCGCCGACA TCCGGGACCT CCTGGACGAC
TACCTCGAAG CGGCACGGGA AAAAGGGTTC GAACAAGTAA GGATCATTCA CGGAAAAGGA
ACGGGGGCGC TTCGGGCGAT GGTTCAATCC ATCCTCCGAA GACACCCCCT GGTTTTGTCT
TTCCGGGAAG CCGACGGTCC TGGCGGCGGG TGGGGCGCAA CCCTGGCAAC GCTCCGCCCC
GCCGAACGGA ACGGGTGCTA G
 
Protein sequence
MSENYDHPTI AWYITPHGFG HAVRSLEVIR RFMSRVPEVR LTIVSDLPDF LIEQHLGMLP 
SVRRKRLDIG MVQKDSLRFD LDATRTTLEK LRRSHDRLVD EEKRFLLELG ADVMVSDIAF
IPFYAADHAG IPGIGMGNFT WDWIYASYRD TDERWDPLIE WCRGGYRLCD LLLRLPMHGD
CSSCPRITDV PLVARKPSRT RRESRAILGC EKNRKAYLIS FSELHLDESA LRRLERMNNA
LFFFKRPLRF DFANGRSIDG LELSYVDVVG AMDAVITKPG YGVVSDCLAT GTPMIYTDRG
PFPEYDILVR EMKRHLNTVY LSSEDFCRGA WESAVEQIEG MPPRTVSMRL DGADVCVGLI
LDCLAGREDV KAGTVGSRDG AGRRHAGMEP DLESPVVVPV EDSIDLHTYR PADIRDLLDD
YLEAAREKGF EQVRIIHGKG TGALRAMVQS ILRRHPLVLS FREADGPGGG WGATLATLRP
AERNGC