Gene Sfum_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0659 
Symbol 
ID4460420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp792815 
End bp793795 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content61% 
IMG OID639701415 
Productapurinic endonuclease Apn1 
Protein accessionYP_844793 
Protein GI116748106 
COG category[L] Replication, recombination and repair 
COG ID[COG0648] Endonuclease IV 
TIGRFAM ID[TIGR00587] apurinic endonuclease (APN1) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTGC TCGGCGCTCA CATGTCCATC GCCGGAGGAC TGCACAACGC TTTTCAACGC 
CTGCTGCAGG TACGGGGAGA GGCCCTGCAG GTGTTTCTGA AGAATCAGAG GCAGTGGCAC
CCCCCACCGT TGAGCGCCCA TGCCGTCCGC CTGTTTCAGG AGGAACGGGA CAATTTCGCT
CATGTCCCCG TCGCCGCTCA CGACAGCTAT CTCATCAACC TGGCCGCCCC CGATCCGGAC
ATCGCGGAAA AATCGGTCGG TGCATTCGCG AGCGAGCTCG ACGCGTGTGC GCAACTGGGC
ATCGAATTCC TGATCCTCCA TCCGGGGTTT CACCGCGGCG CGGGAATATC CACGGGCATC
GCCCGCTTCG CGAAAAACCT GGACCGTGCC TGTGCCCTTG CGAAAGCGCA CTCCGTGACC
GTGTTGATCG AAACCACCGC GGGCCAGGGT TCCGGAATCG GGTCGAAATT TGAGGAAATT
GCGGACATGT TGATGAAGTC GAAGGCCGGC CGGCCCCTCG GAGTCTGCTT CGACACGTGC
CACGCGTTTG CCGCGGGATA CGACCTTCGA GACGAACGCT CCTACGAACG CACGTTTGAC
CGCTTCGAAA AGACCATCGG ACTCCGTTTG CTGCGCTGGT TTCACCTCAA TGATTCCAAG
GCGGGCTTCG GTTCCCGCGT CGACCGGCAC GAACACATCG GTCGGGGAAG GATCGGGCTC
CAGGGGTTCC GCCTGCTCGT CAACGATCCC CGTTTCGAGT CACATCCCAT GGTGCTCGAA
ACTCCCAAGG GCAAGGATCT TCGCAACGAC AGGAAGAACC TGGCGACGCT CCGATCGTTG
CTGAGATCCG TTGACGATTC TGAAAGCGAA CCTTTCGAAG AAGATCGAGA GGCCGATCCT
GAAAACGGGG GCGAACGACG CCCCACGCCT CGGGATCCGG CGGGATCGCT CAAACGAGGG
AAACCTTCCG GTCACACCTG A
 
Protein sequence
MPLLGAHMSI AGGLHNAFQR LLQVRGEALQ VFLKNQRQWH PPPLSAHAVR LFQEERDNFA 
HVPVAAHDSY LINLAAPDPD IAEKSVGAFA SELDACAQLG IEFLILHPGF HRGAGISTGI
ARFAKNLDRA CALAKAHSVT VLIETTAGQG SGIGSKFEEI ADMLMKSKAG RPLGVCFDTC
HAFAAGYDLR DERSYERTFD RFEKTIGLRL LRWFHLNDSK AGFGSRVDRH EHIGRGRIGL
QGFRLLVNDP RFESHPMVLE TPKGKDLRND RKNLATLRSL LRSVDDSESE PFEEDREADP
ENGGERRPTP RDPAGSLKRG KPSGHT