Gene Sfum_3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3063 
Symbol 
ID4458613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp3771064 
End bp3772380 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content57% 
IMG OID639703835 
Productcarboxyl-terminal protease 
Protein accessionYP_847172 
Protein GI116750485 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0122157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGA AGCGCGCCAA AGTCATCGTA TTGCTCGCTG CGTTTGCCTT TCTGCTCATG 
AGTTATTCCG GTCTGAGAGC GGCCAGGAAC AGCCCTGACA ATTCCGACAT GTATCAGTAC
TTGAAGCTCT TCAGCGACGT GTTGAACATC GTTCAGGACA ACTACGTCGA GAAGGTTGAC
ACAAAGAAAT TGATGTACGG GGCCGTCAAC GGCATGCTCC GGGAACTCGA TCCTCATTCT
TCGTTTCTGC GTCCCGAAGA CTACAAGGAA CTGCAGATAG AAACCAAAGG GAAGTTCGGC
GGCCTGGGAA TCGAAATCAC CATGCGCGAC AACGTGTTGA CCGTCGTCGC GCCCCTTGAG
GACACGCCCG CGGATCGTGC GGGGGTACTG GCCAATGACC AGATTGTCAA GATCGACGAC
CAGCCGACCC AGGACATGTC CCTCATGGAC GCCGTGCAGA AAATGCGGGG TCCCAAGGGA
ACCAAGGTCA GGCTGACCAT TATCCGCAAA GGCGAAAAGA AGCCTCTCGA ATTCGAGCTC
ACCCGGGACA TCATTGCAAT ACAGAGCGTG AAGTACCGGA CGCTGGAATC GGGTTACGGC
TATGTGAGGA TCACGAGTTT CCAAAGCGGG ACAGCCAATG ATCTGCGCAA GGCTCTGGAA
CACCTGGAAA ATGACAACCA CCCGCTTCAG GGCCTCGTGC TCGATCTGCG CAACGACCCG
GGAGGACTCC TGGATCAGGC GGTCGAGGTG AGCGACGAAT TCATCGATGA AGGCCTTATC
GTATACACGG GCGGGCGCCT GGAGAGTCAG AAGATGCGCT TTGAGGCGCA CAAGGGAACC
AAGGCGCACG GCTACCCGAT GGTCGTTCTG GTGAATTCGG GGAGCGCCAG CGCGTCGGAA
ATTGTGGCCG GGGCGCTTCA GGACCACAAA CGTGCCATCA TTCTCGGTGA ACCCACGTTC
GGGAAGGGTT CCGTTCAGAC GGTCATCCCT CTCAACGACG GCTCGGCGCT TCGCCTGACC
ACCTCGCTGT ACTACACGCC ATCGGGCAGA TCCATTCAGG CCAAGGGGAT CGAGCCTGAC
ATCGTGGTCA AGCGCGAGAC CCCGCAAAAG GGGGAAGAGC CGGTCGGCGA CGAGTTGCGC
CGCATTCGGG AAAAGGATCT GCCGCGGCAC ATGGAGAATC AGAAACCCGA TTCCGGCGGC
GTCAAGCCGG AAAGCATTCC CATGGATTCC AAGCTGATCG ATCAGGACAA CCAGCTCAGA
AGGGCGCTGG ATCTTCTGAA AAGTTACAAA ATCATGGCCC AGATGCAGTT CAATTAA
 
Protein sequence
MSKKRAKVIV LLAAFAFLLM SYSGLRAARN SPDNSDMYQY LKLFSDVLNI VQDNYVEKVD 
TKKLMYGAVN GMLRELDPHS SFLRPEDYKE LQIETKGKFG GLGIEITMRD NVLTVVAPLE
DTPADRAGVL ANDQIVKIDD QPTQDMSLMD AVQKMRGPKG TKVRLTIIRK GEKKPLEFEL
TRDIIAIQSV KYRTLESGYG YVRITSFQSG TANDLRKALE HLENDNHPLQ GLVLDLRNDP
GGLLDQAVEV SDEFIDEGLI VYTGGRLESQ KMRFEAHKGT KAHGYPMVVL VNSGSASASE
IVAGALQDHK RAIILGEPTF GKGSVQTVIP LNDGSALRLT TSLYYTPSGR SIQAKGIEPD
IVVKRETPQK GEEPVGDELR RIREKDLPRH MENQKPDSGG VKPESIPMDS KLIDQDNQLR
RALDLLKSYK IMAQMQFN