Gene Sfum_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3872 
Symbol 
ID4457797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4724187 
End bp4725848 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content63% 
IMG OID639704645 
Productphosphoesterase domain-containing protein 
Protein accessionYP_847976 
Protein GI116751289 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0907106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGAC CGCAATGGGT GATCTCCGCG CAGGAAAAAC CGGAATCGGT CCGTGACGTG 
GTGCGGATTC TGCTGGCCAA CCGGGGAGCC GGGCCGGCAT TTCTCGACGG GACCCTCAAA
GAGCTGGAGA CCTGTCTTGC CATTCGAGGC ATGGATGCCG GGGCCGAGCT CATGGCCGAA
CACCTCGCCG CCGGAACCAG GATCGTGCTC GTGGGCGACT ACGATTGCGA CGGGATCACC
TCTCTGGCCC AGGTTTCCCT GTTCCTGCGA GAGATCGGCT ACGGGAATTT CGAAACGGTG
GTCCCCCGGC GCTCGGAAGG CTACGGGATT CCGGAGAGGG CGATTCTCGA CCACCCCGAC
GCGGGGCTCT ACGTCGCCAT GGACTGCGGG ACTCTGGATC GCAAGGCCGT GGGAATGGCC
CGCGAGAGGG GGGCCGATTG CATCGTGATC GATCATCATG AAGTGCCCGA TCAGGGATTG
GCCCCGGCGA CGGTACTCAT CAACCCGAAA CACTCGGAAT GTGAATCGAC CTTCAAGGAG
TTCTGCTCAT CGGGCCTCAC GCTCCTTTTC CTGGCAAGGC TTCGCCGGGC ATTGAGGGGT
TTCCCCAAGC CTGCGCTGGG CGGAAGATTT CTCATTCTTG CCGCCATAGC GACGGTTGCC
GATATGGTGC CCCTGGTGGA AGGCAATCGA ATCCTCGTAC GCAGCGGCCT CAACTGCGCG
AACCAGGTTG ACTATCCACC CCTGGACCAA CTGGTTCAAA AAGCGGGGCT TTCAGGGAAG
TCCATCACGG CCGGGCACCT GGGGTATTAT ATCGGCCCGA GAATCAATGC GGCCGGGCGA
ATGGCGGACG CTCGCACCGC CTACGAACTG CTGGTCGAGG ACGATCCTTC GGCTGCCGCT
CGACTGGCCG AGCAGCTCAA TCGGTACAAT ACGCAGAGGC AGAACCAGGA AGACGCCGTA
GTGGGCGAAA TCAGGCAGCG ACTCGCGGAC CGGAAGGTTT TCGGGAGGAC GCTGGTGATG
GCCGATGCCG GCTGGCCGGC GGGAATCATC GGGATCGTCG CATCGAGAGT CCAGCAGGAA
TTTCACTACG GTCCGGTGGT CGTGTTTTCC GTCGATGAGG CGGAGGGGAT CGCTCGGGGC
TCCGCCAGGA GTATTCCGGG GTTCGATATC CACTCGGCGT TGGCTTCCTG TGACGACGTG
ATGCTGCGTT GGGGAGGCCA TAAGATGGCG GCGGGCATGA CCATCGCCCT GGACCGAATG
GACGAATTCG CACACCGGTT CGAGGAGAGC GCCCTGCATT GGCCGGCCCA CGTCTTCCAG
CCCCGCGGCA GGGTGGACGC GGAGCTCGAC ACGGGTTTGA TTTCCGTCGA CCTCTACCGG
GAACTGACGA AACTGGAACC GCACGGTCTG GGCAACCCGA CGCCGACCTT CGCTTCACGG
GGAGTCAGAG TGCAGGTCAA GAAGACGTTC GGCCGGGACG CAAGCCACCT GCGCCTCCAG
CTCGACCAGC GCATCGGCGG AGTGTTCTGG CGCGGGGCAA GGCATTTCCG ATCGGCTGGA
TTGAGAGACG GCGAAACGAT GGATGTGGTC TACCAGTTGG ATTGGGACGA CTACGCCGGT
CGCCCCGTCA TGCAGGTGAG GGATGCCGGG CGCCTGTTCT GA
 
Protein sequence
MIRPQWVISA QEKPESVRDV VRILLANRGA GPAFLDGTLK ELETCLAIRG MDAGAELMAE 
HLAAGTRIVL VGDYDCDGIT SLAQVSLFLR EIGYGNFETV VPRRSEGYGI PERAILDHPD
AGLYVAMDCG TLDRKAVGMA RERGADCIVI DHHEVPDQGL APATVLINPK HSECESTFKE
FCSSGLTLLF LARLRRALRG FPKPALGGRF LILAAIATVA DMVPLVEGNR ILVRSGLNCA
NQVDYPPLDQ LVQKAGLSGK SITAGHLGYY IGPRINAAGR MADARTAYEL LVEDDPSAAA
RLAEQLNRYN TQRQNQEDAV VGEIRQRLAD RKVFGRTLVM ADAGWPAGII GIVASRVQQE
FHYGPVVVFS VDEAEGIARG SARSIPGFDI HSALASCDDV MLRWGGHKMA AGMTIALDRM
DEFAHRFEES ALHWPAHVFQ PRGRVDAELD TGLISVDLYR ELTKLEPHGL GNPTPTFASR
GVRVQVKKTF GRDASHLRLQ LDQRIGGVFW RGARHFRSAG LRDGETMDVV YQLDWDDYAG
RPVMQVRDAG RLF