Gene Sfum_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1036 
Symbol 
ID4460939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1279202 
End bp1281127 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content61% 
IMG OID639701800 
Productchaperone protein DnaK 
Protein accessionYP_845165 
Protein GI116748478 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000619516 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000149965 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAAG TGATCGGTAT CGATCTTGGA ACCACGAACT CGGTGGTTGC AATCATGGAG 
GGCAAGGACC CCAAAGTACT TCCGAACGCC GAAGGCGGTC GCACCACACC CTCCATGGTG
GCCTTTACGG ATAGTGGAGA GCGCCTGGTG GGCCAAGTGG CCAAACGACA GGCGATCACG
AACCCGGAAA ACACGATTTT CGCCGTGAAG CGGCTCATCG GGCGAAAGTA CGAGTCCCGC
GAGGTCCAGC AGGACGTGAA GATCCTGCCT TACAAGATCG CGAAGGCGAA GAACGACGAC
GCGCACATTT CCATCCGGGG CAGGGAGTAC AGCCCCACGG AGATATCCGC TTTCATCCTG
ATGAAAATGA AGCAGACCGC CGAAGATTAC CTGGGCGAGA AGGTGACCGA AGCGGTGATC
ACGGTTCCCG CCTACTTCAA CGACAGCCAG AGGCAGGCCA CCAAGGACGC GGGCAAGATC
GCCGGTCTCG AGGTGCTGCG GATCATCAAC GAACCGACGG CGGCATCCCT CGCCTACGGC
CTCGACCGCA AGAAGGACGA GAAGATCGCC GTGTTCGATC TCGGCGGCGG CACTTACGAT
ATTTCCATCC TCGAGATCGG GGAGGGGGTG TTCGAAGTCA AGTCGACCAA CGGGAACACC
CACCTGGGCG GGGAAGACTT CGACCAGCGC ATCATCGACT GGCTCGCCAA CGAATTCAAG
AAAGACTATG GAATCGACCT TCGCAGCGAC AAGATGGCGC TGCAACGGCT GAAAGAAGCG
GCGGAAAAGG CCAAGATCGA GCTGTCGTCC ACGATGGAGA CGGAGATCAA CCTTCCCTTC
ATCACGGCCG ACGCCGCCGG CCCCAAGCAC ATGACCATCA AGCTCTCGCG GGCGAAGCTC
GAATCCCTCG TGGAAGACCT CATCGACCAA CTGCTCCCGC CGATGGAGCA GGCCCTGCGC
GACGCCGGGC TCAGCCGCAC CGCCATTGAC GAGGTGATCC TGGTGGGCGG CATGACCCGG
ATGCCCCGGG TCCAGCAGAA GGTGCAGGAA TTCTTCGGCA AGGAACCTCA CAAGGGGGTC
AACCCGGATG AGGTCGTTGC AATCGGGGCG GCGATCCAGG CCGGGGTGCT CAAGGGCGAC
GTCAAGGACG TTCTGCTGTT GGACGTCACG CCTCTGTCCC TGGGCATCGA GACCCTGGGC
GGGGTCATGA CCCGGCTGAT CGAGCGCAAC ACGACCATTC CGACCCGCAA GAGTCAGATC
TTCTCCACCG CCACGGACAA CCAGACGGCG GTCTCCATCC ACGTTCTTCA GGGGGAGCGG
GAGATGGCGA GCAACAACAA GACTCTCGGC CGCTTCGAAC TGGTCGGGAT CCCGTCCGCG
CCTCGCGGCA TCCCGCAGAT CGAGGTGACT TTCGATATCG ACGCCAACGG CATCGTGCAC
GTCTCGGCAA AGGATCTGGC TACGCAGAAG GAGCAGTCCA TTCAGATCAC CGCTTCCAGC
GGCCTGAACA AGGACGAAAT ACAGAACCTG GTGAAGGAAG CCGAAGTGCA CGCCGAGGAA
GACAAGAAGA AGCGTGAGCT GGTGGAAACC AGGAACCAGG CGGATACGCT CATCTACAGC
ACCGAGCGGA CGATGCGGGA CATGGGGGAC AAGATCGACG CCCAGACGCG TCAGAATATC
GAGGAGCAGA TCGGAAAGCT GCGCAAGACC ATGGAGGATG GCGACAAGGA CGCGATCCAG
AAAGACATGG ACCAGTTGAT GCAGATTTCC CACAAGGTGG CCGAAGAGGC TTACAAGCGG
GCGGCGGAAC AACAGGGTCC CCAGGCCGGA GCGGAGGAGC AGGCAGCGCG CGAGTCCGCC
GGTGCGCAGA AAAAGCCCGA CGACGACGTC GTCGACGCCG ACTTCCAGGA AGTGAAGGAC
AAGTAG
 
Protein sequence
MSKVIGIDLG TTNSVVAIME GKDPKVLPNA EGGRTTPSMV AFTDSGERLV GQVAKRQAIT 
NPENTIFAVK RLIGRKYESR EVQQDVKILP YKIAKAKNDD AHISIRGREY SPTEISAFIL
MKMKQTAEDY LGEKVTEAVI TVPAYFNDSQ RQATKDAGKI AGLEVLRIIN EPTAASLAYG
LDRKKDEKIA VFDLGGGTYD ISILEIGEGV FEVKSTNGNT HLGGEDFDQR IIDWLANEFK
KDYGIDLRSD KMALQRLKEA AEKAKIELSS TMETEINLPF ITADAAGPKH MTIKLSRAKL
ESLVEDLIDQ LLPPMEQALR DAGLSRTAID EVILVGGMTR MPRVQQKVQE FFGKEPHKGV
NPDEVVAIGA AIQAGVLKGD VKDVLLLDVT PLSLGIETLG GVMTRLIERN TTIPTRKSQI
FSTATDNQTA VSIHVLQGER EMASNNKTLG RFELVGIPSA PRGIPQIEVT FDIDANGIVH
VSAKDLATQK EQSIQITASS GLNKDEIQNL VKEAEVHAEE DKKKRELVET RNQADTLIYS
TERTMRDMGD KIDAQTRQNI EEQIGKLRKT MEDGDKDAIQ KDMDQLMQIS HKVAEEAYKR
AAEQQGPQAG AEEQAARESA GAQKKPDDDV VDADFQEVKD K