Gene Sfum_3753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3753 
Symbol 
ID4457920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4586543 
End bp4587574 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content60% 
IMG OID639704527 
Productpeptidase U32 
Protein accessionYP_847858 
Protein GI116751171 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAC TGCTGGCACC CGGCGGAAGT CTCGAGATGG TGGAAGAAGT CTTTCGGCAG 
GGCGCCGATG CCGTCTACGT AGGCTCCAAG GGATTCAGCA GGCGCAAGTG CGCGTGGGAG
CTCGAAGATT CTCAAATCCG GGACGCCGTC GCCATAGGGC GGCGGATGAA CGGCAGAATC
CGTGTTGCCG TCAATGCCGA GGTGCCGCGG GAAAAGTCCT CCATTGTGAT GCGCAAGATC
GGCAAATACG CGGAGTGGGG AATCGAGGGA GTCATCGTCA AGAGCCCTTT TATCATGGAA
ATGGTAAAGG AGGGTTTCCC GGAGCTGGTC ATCCATGCCA GCGTGGGGTG CAATATCCGG
ACGCCCGAAC AGATGTCCGA ATACAAGGCA TATGGGGCAA CCCAGGTGGT GGCTTCCACG
GAAATCGACA GCGTGACGAA GTTGAGAGCG TTCAAGGAGT CCGCCGACAG GCTCGGACTC
GGCACGGAGG TCCTGATCCA CGGCAACCGT TGCCTGGGCG GCGTGGGCAA CTGCATGTTC
CACGAGCTCA TCAGCGACTC GTACATCAAA CGTATCCACC ACGACGAAGA AGGCAACGAA
ATCGTGGAGT ACGAAGGCTG GCCCGACCGG AGCGGCAGCT GTTTCCGGCT GTGTCTTTTG
ACCGATGCGC AGCGGGAGAA GGTACTGCGG CAGCGCCGCC ATCGCGATGA GGAAATTCGG
GCGATCAACG AGCGTATCCG GCTGCACCCC AATGTCGCAT TCATGATCAA CGGCGAGGAA
CTCTGGGACT ACCTGGGGAT CGGGCTTCAC ACGGTCAAGG TCCAGGGCCG CGAGTATGCA
ACCCCCCTCA TCGGGCGGAT GATCGGAATC TACCGCAGGC TGATCGACGC TTTTGGTTCC
GGCAGGGCTT GTGCCGAACC GGAACTCGTC GCCTTGCAGC GTGAGCTGGC CGAAATCGCC
GCCGACCGGG ACCGTGCCCG CATGGAAAAA ACCCGGGAGC TGCATCGCAA CATCAAGGGC
TTGTACGCCT GA
 
Protein sequence
MNELLAPGGS LEMVEEVFRQ GADAVYVGSK GFSRRKCAWE LEDSQIRDAV AIGRRMNGRI 
RVAVNAEVPR EKSSIVMRKI GKYAEWGIEG VIVKSPFIME MVKEGFPELV IHASVGCNIR
TPEQMSEYKA YGATQVVAST EIDSVTKLRA FKESADRLGL GTEVLIHGNR CLGGVGNCMF
HELISDSYIK RIHHDEEGNE IVEYEGWPDR SGSCFRLCLL TDAQREKVLR QRRHRDEEIR
AINERIRLHP NVAFMINGEE LWDYLGIGLH TVKVQGREYA TPLIGRMIGI YRRLIDAFGS
GRACAEPELV ALQRELAEIA ADRDRARMEK TRELHRNIKG LYA