Gene Sfum_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0020 
Symbol 
ID4461331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp32284 
End bp34755 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content58% 
IMG OID639700772 
ProductATP-dependent protease, putative 
Protein accessionYP_844158 
Protein GI116747471 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.225222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGAA AATTCCCGGA ACTTTCAGTG AAAGAATTGA GAGCATACAT CGAACCGGAA 
ACACTTCCCT TTGAGACGAC CGCTTCCCTG GAGGCTCCGG GAGACCGGGT GGTGGGGCAG
GAAAGGGCCA TCGACGCCAT ACAGTTCGGT ATGGGCATGA AGACGAGCGG CTACAACATC
TTCATCGCGG GGCCCGCCAA GGCCGGTCTG ACGTTCATGG CGAAATCGTT CATCGAGGAA
CGGGCGCGCA AGGAACCGAC GCCCCCCGAC TGGTGTTATG TCTTCAATTT CAAGGAGCAG
GACAAACCCA AGTGTCTCAA GGTGACGCCC GGTCGGGGCA AGGCGCTCAA GAAAGACATG
AATGAGTTCA TCGATACGCT GCAGGCCAAA ATCCCCGAAG TTTTCGACAG CGACGATTAC
CGGGCCAAGG AAAGCGAGGT GCACCAGGCC TTTGAGAAGC AGCGCCGGGA AATCATCGAT
GAGCTCTCGC AGCAGGCAAA GGACCAGGGG TTCATTCTGC AGTTCTCCCA GGTCGGAATG
GTGATCATCC CCGCCAATAA GGAAGGGGAA CCCATGACCC AGGAGGATTT GCGGCACCTG
GGCGAGGAAG AGCGGTTGGT GCTGCGGGAA AAGAGCGACG AGCTCCACGA AAAGATGAAA
GAGGCGATCA AGCACATCCG CGAGGCCGAA GCCGAATTCA AGGATAAACA CACGAAATTG
GACAATGAGA TCGCGCTCTT CGTCGTCGGC CAGCTCATGG ATACCTACGA GGAAACGTAC
AAGGACGACG AACAGGTACT CGAATACCTG AAGATGGTCC AGGAAGACAT CCTCGAAAAC
ATCGAGGATT TCAAGAAGAA ACCCGAAGCG GCCCAGCAGC AGGGACAGCC CGGCACACCT
TTCCCGATGC CCACGCGGGA AACCGCCATG CGCAAGTACG ACGTGAATGT CCTCATCGAC
AATTCGGAAA CGGAAGGCGC TCCCGTGGTC ATCGAATCCA ACCCCGCTTA CCCGAACATT
TTCGGGTCCA TCGAACGGCA GGCGTGGTTC GGAGCGCTGT TCACCGACCA TACCATGATC
AAGGCGGGAG CGCTTCACAA GGCCAATGGC GGGTATCTCG TCATGAAGGC GCTGGACCTG
TTGAAGTGGT TCGTCACCTA CGAGGCCCTC AAACGGGCGC TCCGGGACGG TGAGGTCCGG
ATCGAGGATC TGGGCGAACT GTACGGTATT TTCAGCACCC GCACCATTCG CCCGGAGCCC
ATCCCGCTCA ACGTCAAGAT CGTCCTCACC GGCGACCCGT ACATCTACCA GTTGCTCTAC
ACCTACGATG ACCGCTTCCA GAAGCTGTTC AAGGTCAAGG CCCATATGGA CGATCAGATG
GACCGCAGCG AGGACAAGCT CCTCCAGTGC GCCGGGATGA TCAGCCGCTA TTGCGAGGAC
CACAAACTGC GCCATCTCGA CCGGACCGGC GTATCCCGCG TGCTCGAATA CAGCGTGGAG
CTCACGGAAG ACCGCGACAA GCTCACTTTG GAGTTCGGCA CCATCGGCGA TCTGCTCAAG
GAAGCCAATT ACTTCGCCGG GATCGAGGAC TGCGAATTCG TCAAGCGCGA GCATGTGGAG
CAGGCGATCA AGAAACGGAT CTACCGCTCC AACCTCATCG AGGAGCGCAT GAAGGAGATG
ATGCTCAAGG ACATCTTCTG GGTGGAAACC ACGGGCGGCA AGGTCGGCCA GGTGAACGGC
CTTTCCGTGC TCATGGCGGG GGATCACGTG TTCGGAAGGC CGAACCGCAT CACGTGTTCG
GTGTCGGTGG GCCGCGAGGG CATGATTTCC ATCGATCGGG AATCCAAGAT GAGCGGGCCG
ACCCACACCA AGGGCCTCAT CATCCTGAGC AGCATCCTCA AAGACCGCTT CGCCCACAAC
AAGCCGATCT CGCTCAACGC GTCCCTGTGT TTCGAGCAGA GCTACGGCAT GGTGGACGGC
GACAGCGCGT CGAGCACCGA GCTGTACGTG CTCCTGAGCG CCATTGCCGA CGTCCCCATC
AAGCAGGGAA TCGCCGTCAC CGGGTCGGTG AGCCAGAAAG GCGAGATTCA GCCCATCGGC
GGGGTGAACC ACAAGGTCAA GGGATTCTTC GACATCTGCA AACACAAGGG ATTGACCGGA
AAACAGGGCG TCATGATCCC GTCCAAGAAT GTCCGCAACC TGATGCTCGA CCAGGAAGTG
ATCGATGCCG CCAAGGAAGG CAAGTTCCAT ATCTGGCCGG TTTCGACCAT CGAGGAAGGG
ATCGAGCTCC TCACGGGAAT GAAAGCCGGC AAACTGCAGC CGGACGGAAC CTACCCCGAG
GGCACCCTGT TCCGCAAGGT CGATGACCGC CTCAGAGAGA TTGCGGAGAT CGTCAAGACC
TTCGGAAAGG ACTCCGACAA CGGCAGAAAG TCCTCCGAGG AGGAGGGCGG CTGCTCCGGC
TGCGGAGCCT GA
 
Protein sequence
MARKFPELSV KELRAYIEPE TLPFETTASL EAPGDRVVGQ ERAIDAIQFG MGMKTSGYNI 
FIAGPAKAGL TFMAKSFIEE RARKEPTPPD WCYVFNFKEQ DKPKCLKVTP GRGKALKKDM
NEFIDTLQAK IPEVFDSDDY RAKESEVHQA FEKQRREIID ELSQQAKDQG FILQFSQVGM
VIIPANKEGE PMTQEDLRHL GEEERLVLRE KSDELHEKMK EAIKHIREAE AEFKDKHTKL
DNEIALFVVG QLMDTYEETY KDDEQVLEYL KMVQEDILEN IEDFKKKPEA AQQQGQPGTP
FPMPTRETAM RKYDVNVLID NSETEGAPVV IESNPAYPNI FGSIERQAWF GALFTDHTMI
KAGALHKANG GYLVMKALDL LKWFVTYEAL KRALRDGEVR IEDLGELYGI FSTRTIRPEP
IPLNVKIVLT GDPYIYQLLY TYDDRFQKLF KVKAHMDDQM DRSEDKLLQC AGMISRYCED
HKLRHLDRTG VSRVLEYSVE LTEDRDKLTL EFGTIGDLLK EANYFAGIED CEFVKREHVE
QAIKKRIYRS NLIEERMKEM MLKDIFWVET TGGKVGQVNG LSVLMAGDHV FGRPNRITCS
VSVGREGMIS IDRESKMSGP THTKGLIILS SILKDRFAHN KPISLNASLC FEQSYGMVDG
DSASSTELYV LLSAIADVPI KQGIAVTGSV SQKGEIQPIG GVNHKVKGFF DICKHKGLTG
KQGVMIPSKN VRNLMLDQEV IDAAKEGKFH IWPVSTIEEG IELLTGMKAG KLQPDGTYPE
GTLFRKVDDR LREIAEIVKT FGKDSDNGRK SSEEEGGCSG CGA