Gene Sfum_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1919 
Symbol 
ID4459756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2338674 
End bp2340686 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content69% 
IMG OID639702686 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_846039 
Protein GI116749352 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.770906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00107591 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCAGAA TCACGATTTT GCCCGATATC CTGTGCAACC AGATCGCCGC CGGAGAAGTG 
GTGGAGCGGC CGGCCGCCGT CGCCAAGGAG CTCCTCGAGA ACAGCATCGA CGCCGGCGCC
CGAAGGATCT CCCTGTCCAT CGCCGACGGA GGCCGAAAAG AGATCCGGGT GGTGGACAAC
GGCTCGGGCA TGCACCCCGA CGATGCGCTC CTGGCCCTGG AACGCCACGC CACCAGCAAG
ATCCGGTCCA TCGAGGACCT GCAGGCGATC GGTTCCCTGG GGTTTCGCGG CGAAGCCCTT
CCCAGTATCG CCGCGGTGAG CCGCTTCGAA CTGGTGACGC GCGAACCCGA TGCCGTCGCG
GGGACGTTCA TCCGGGTCGA AGGCGGCGTG GTGCGCGAGG TCCGCGAAAC CGGGTCCCCC
GCGGGAACCA GGATCACCGT TCGCGACCTT TTCTACAACG TGCCCGCTCG ACGCAAATTC
CTGCGCGCCG CGGACACCGA AACCGCATAC ATCTGCGACC AGTTCCAGCG GCTGGCCATG
GCTCACCACG CCGTCCATTT TCAGCTCATC AACCGGGAAC GCACCCAATA CGACTTCCCC
GGCGCGGCCT CGCCCGAAGA GCGGGCCGGG CAGGTTCTCG GCGCCGAGAC CCTCAAGCGC
GCCATCCCCT TTTGCGTGGA AAACGCGTCC GCCAGGCTCC GAGGCATGGT CGGCACACCC
GACCTGCAGC GGGCCAACAG CCATTCCCTC TTCGTTTTCG TGAACGGCCG GCCGGTCTGG
GACCGCGCCG TCAACCGGGC GATCCTCGCG GCCTTCGAGA GCCTCATCCC GCGGGGCAAG
TTCCCCGTCG CGGTGCTCTT CCTCGAGCTC GATCCCCTCC ATGTGGACGT CAACGTTCAC
CCCACCAAGC GCGAAGTCCG GTTCAAGCAC CCCGGAGGCG TCATCGACAC CGTGCGCGGG
GCCATCCGCG ACGCTCTGTG CCACCTCAGG CCGCTCCACG GCTCCGCCGC TGCCGCACCC
CGTCCCTTCT CCGAAACGGC GGACCAGCGG GCTTTCCGCG ATTCCCTGGT GAGGGAAGGC
CAATTGTCCT TCGACCGCGG CCGTCCCCTC TCGCGCCCGC CAGGCTTCCC GTCCGAGCGT
TGGCGCGAAA GGCACCGGCC CGACGCCGAA CCGCCGTACC CGCTCTTGCG CGAGCCGGCG
CCGACGGAGA ATCCCCGCCG CGAGGCCGGA TCTCCGCCCG CAGCCCCCGC CGATTCACTC
TTCGACGAAG GCGCGGCGCC GCAGCCCGAC AATCCCGACA CCGACTTTTT TGCCGAACCG
AAGCGGGCGG CCGGCGGGCC GGCCTCGACC CATGCGCCCG TCACGGTCGA TACGGCGGCC
TTCGCGGACG CCTTCCAGGC CTTCGAAGCC GCGACACACC TCCATGCCGG CGATGTCCCG
GCTCTTGCCG AGCTTCCCGT CATCGGCCAG CTCGCCAACA CCTACATCCT GCTCGAAGCC
CCCGACGGGC TGATCCTCAT CGACCAGCAC GCGGCTCACG AGCGCATCAT CTTCGACGCC
CTCTCCTTTC CGGCCGGCGG TCCGGCCCGG CAGAGGCTGA TACGCCCGGC CGTCATCGAT
CTCCCCCCGC GCGATGCGGC CATGCTCCGC CGCTGGCTGC CGCTGCTCGA GGAAATCGGC
GTCGAAATCG AATCCTTCGG CGGCGACTCC TTCGTCGTGC ACGCCGTCCC GGCACCCCTT
GGCGAATGCC CGCCCGAGGG GCTGGTCCGC GAGTTGCTCG CCTCGGCCAT CGAAGGCGAT
GACGCCCCGC GCTGGAACGT CCTCGGCCGC CTGGCCAAGA CCGCCGCCTG CCACCGCGCC
GTGAGGGCGG GCCAGCGGCT GAGACCCGAG GAAATCCGGC TCCTCCTGGA AGGGCTCGAC
CGTACCCGGT TCGCTTCCAC CTGCCCGCAC GGCCGCCCGG TCTGGTACAA GATGACCCTC
TCCGACGTCG CCAGGCTCTT CCAGCGCACA TGA
 
Protein sequence
MARITILPDI LCNQIAAGEV VERPAAVAKE LLENSIDAGA RRISLSIADG GRKEIRVVDN 
GSGMHPDDAL LALERHATSK IRSIEDLQAI GSLGFRGEAL PSIAAVSRFE LVTREPDAVA
GTFIRVEGGV VREVRETGSP AGTRITVRDL FYNVPARRKF LRAADTETAY ICDQFQRLAM
AHHAVHFQLI NRERTQYDFP GAASPEERAG QVLGAETLKR AIPFCVENAS ARLRGMVGTP
DLQRANSHSL FVFVNGRPVW DRAVNRAILA AFESLIPRGK FPVAVLFLEL DPLHVDVNVH
PTKREVRFKH PGGVIDTVRG AIRDALCHLR PLHGSAAAAP RPFSETADQR AFRDSLVREG
QLSFDRGRPL SRPPGFPSER WRERHRPDAE PPYPLLREPA PTENPRREAG SPPAAPADSL
FDEGAAPQPD NPDTDFFAEP KRAAGGPAST HAPVTVDTAA FADAFQAFEA ATHLHAGDVP
ALAELPVIGQ LANTYILLEA PDGLILIDQH AAHERIIFDA LSFPAGGPAR QRLIRPAVID
LPPRDAAMLR RWLPLLEEIG VEIESFGGDS FVVHAVPAPL GECPPEGLVR ELLASAIEGD
DAPRWNVLGR LAKTAACHRA VRAGQRLRPE EIRLLLEGLD RTRFASTCPH GRPVWYKMTL
SDVARLFQRT