Gene Sfum_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3803 
Symbol 
ID4457868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4650070 
End bp4654659 
Gene Length4590 bp 
Protein Length1529 aa 
Translation table11 
GC content61% 
IMG OID639704576 
ProductSPP1 family phage head morphogenesis protein 
Protein accessionYP_847907 
Protein GI116751220 
COG category 
COG ID 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCGG ACCTGAAGCA GCGCATCCAG GCGGCCACCC TGAAGAGTCT GACGGCCCGC 
AACCGCTACA ACGACCAGGT CACGGCCCAG CTCACCCAGG CGCTGAAACA GGCCGAAGAC
GAGGTCGCCC GCGCCATTCT CCAGTACCGC TCCCTCGGTT CCCTGCCGGA CAACAAGCTC
GCCGCCCTCA AGGGGCTGGA AAAGCTTCAG CTCGAACTCG ACGACACCAT GAAGCGGCTC
AAGCGGGAGC AGACCCTGGT CTTTCGCAAG ACGACCAAGG ACTCCTTCAA ACTCGGCATC
CAGCAGGGAA TCGGAGAGCT CGCCGACGCG GCGCTGCCGT TCTACGCCGA CCTCAAACCC
GAAGGCATCG ACAAGTTGGC CACCAAGGTG TTCACCATCG TCGACACCAA CGCCCTCGAC
TTCATGGCGC AGTACAACCT CACGCTCGCC GGTGACGTCC ACCGCGAACT CGCAGACGGC
ATCAAGCGCA CCATCCTGAA CGGCGTCGCC ACGGGCAAGG GAGCCGACGA CATCGTCCGG
GACATGGGCA AGGTGATCAT CGACAAGGAT TCCTTTCGCC AGGCCGGAAG CCGGGTGTTC
AGCAAGGCCC AGTACCGCAT GGAGATGATC GCCCGCACCG AGGTCCTCCG CGCCCACAAC
ATGGGCAGGC TCAAGTTCCA CGAGCGGGTC GGCATCCAGA AACTGGAATG GCTGGCCATG
GAGGACGAGC GCATGTGCCC GGTCTGCGGC GGCCTGGACG GCAAGACCTT TCCCATCGAC
AAGTTCCCCC AGCAACCCGC GCATCCGCAC TGCCGCTGCA CCAACATCGT GGCGTGGCCG
ATGACCGTCT GCGGCAGCGA GATGGCCGCC AAGGCCGCCG CCCAGGCATC GCAGGGGGAC
GCCTGCATTC TCCCGCCCCA CGTGCTGGAA GGCTTGGCCG ACGCCCAGGC CAAGGAGAAC
GCCAAGCTCA AGAGCGCCTT TGAAAACGGC GACATTGCCG ACCTCGGCTC GCTGACGGTC
AAACAGCTTC AGACTCTGGC GAAACAGAAC GGCGTGGCCA TCGCCCGGAC CAAGGCCGAT
TTCATCAAGC TGCTCGATCT GGCTGAACCG GGGATCGATC ACGGTGACCT GGCCGGAGCG
GCGCTCAGCG CCAAGCTCAA GGAACACAAG ATCGGCCTGC TGCGGACCAA GGAAGAACTG
GTCGAGCTGC TCGGACTGAA GCAGGCGGAA CTCAAACAGG CCAAGCTGCT CGCCGCCCAG
ATGGCGAAGA TCCCTCCCAC CGAGGGGCTG GAAGGCATGA CCGCCCAGCA GCTCAAGGAG
ATGGCAAAGG AGAACGGCAT CTCCCTCAAC ATGACCAAGC AGGAGACCAT CGAGCTGCTG
GACAAGCTGG AACCCGGCGT GGACCACAGC GGCCTGATGG GCAAGGAACT CGCGGCGGCC
AAACAGAAGC ACGGCATCGG CATCCTCAAG AACAAGCAGC AGCTCGTCGA GGCGCTGCAG
AAAAAGGCCG GTGCCGACAT GGCCGAGTCG GTCAAGAAAA AGGCGGTCGA CGAGGCCAAA
CAAAAGCTGA TCCTGAAACA GAAAACGGCC CTCGAAGACG CCGCCAAGGC CGTTGTCGTT
CCCGACACGC CGACCGGCTA TAAGGATTTC CTCGACGCGA TTGCCAAGGC GGAACAGGCG
GTTTCCGGCG GCACCGATCT GCCCCAGGAA CTGCTTGCGG CCCACAGCAA GGAAATCGCC
CTCAAAAAAC AGCTCTTCCA GGATCAGGTC GGCAAACTGA AATCGGCGGA GCTCAAGACG
CTCGCCAAGG AGACCAAGGT CCAGTATTGG CAGTGGGCCA ACAAAGACGA GCTGACCACG
CTCTTCACCG AGACCGATCC CGCGAAAATC AAGGCGGTTC AGGCCAGCAT CGACACCAAG
CACGCGGCCT GGGCCGAAAA GCATGGAGGC AAGAAGAAAA CCGTACCGGC CAAGCCCGCC
ACGCCGAAGA CGGAACTGCC GAAACCGGCT CCGCAGCCGA GCCCGGTCAA GCCGCCCGAG
CCCAAGATCG GCAAGAAAGG CGCGGAGTTC GCCACCGTCG ATGCCTCGTG GCAGCAGAAG
GGTCTGCCTT CGAAGTTCAA GAAAACCGGC AAGGCCGCTG TCGGCGGCGC ACATGAAAAG
GAGTTCTGGA CTGATGAAAA CGGCGACAAA TGGCTGTTCA AGCCTCATGG CCGCAAGGAC
GATGAGTTCA TCGCCTTCGG AGAGGAAGCG GTCTACAAGA TCGGCCGCCT GATCGACCCC
CATTCCATCG AGGTGCGCAC CATCCAATTG AACGGCCGCA CCGGCTCCAT CCAGAAATGG
CGCACCGATC TGCGGGAAGA CTTCGATTTT CGCAACATCC TGCCCCAGGA TCTGACCACC
ATCGAACTGG AGCAGATCCA GCGCGAGCAT GTGGTCGACT GGCTGATCGC CAATCACGAC
GGACATTCCA AGCAGTTCAT CCGTGCCCGG GACGGTCGCG TCTACGGAAT CGACAAAGGC
CAGGCCTTCA AGTTTCTGGG CCAGGACAAG CTCTCGCTCG ACTATCACCC CAACGGCGTC
TGCGGCGAGG AGGAGCCTTT TTACAACAAG GTCTTCCGGG CGGCCAAGGA AGGGAAGGTA
CGGGTCGATC CGAACGCGAC CCTGCGCTAC ATCCAGGAAG TCGAAAAGAT CGCCGACGAG
GATTACCTCG ATCTGTTGCG ACCCTACGCC GAGGGACGGT TCGCCAAGGA CCCGGCCGGG
CTGCGGCATT TCTACGATCA GGTCCTGGAG CGGAAACACA ATCTCCGGCG GGACTTCGAG
GGCTATTACG CAGATGTGCT GGGGGATCGG GGCTTCCGTT TCGACAAGCT GAGTACCGCC
ACCGGCAAGA AAAAGCTGCT CTCCTCAACA GAGGAAGCCC TGGTCGAGGA GGCCCGCAAG
CTCGGCTGGC AGGGCAAGAC GCTGCCCTTC GACAGCGGCG ACGTGGAAGA CCAGAACGCG
CTGATCTTCA CCGAGAGCTT CAAGGGGAAG AAGCGCACCG TGGTCAAGAT GAAGATCCGG
CCGGACACCG ACCGCCGCAT CGACGAGGTG CTGCGCAGGT ATGTGCAGAC GGCGGCCTGT
GAAAAGGGAC AACCGCTGGC CGAGGACAGC TTCTTTCCGA CGATTCTGGA CGCCGTCAAA
AACGTCAACT TCCACGTGGG CGACGGCAAG TACAACCGGA CCAAGATCGA CAAGGCCCTG
CGCCTGCGGA AGAAACTGGA AACCCTGCAA AAGAGCGCCG ACCCCAAGGT CAAGGAGATG
GCGGACCACT ATTTGAAATG GGTCAAGGAG ATCGAAGAGT CCGTCGACTG GGACCGGGCC
ACCAACGGCG TATTCGAGCA GTACCTGCCC AAGCTCGACG CGCAGAAACC CAAGGAGAAA
CCGCCGTTCA AGGTGGAACG CGGCAAGGTG ACCCACACCA AGCGCAGGAT CGGGTCCGGC
ACCATTACCG TCGAGGCCGA CGACATCGAC AACCGGACGC TGTTCAATCA CAACTCCCGC
ATGCAGGACG GGCACCAGTA CACCGTCACC TTCGAGGACG GCACCCGGGT CCGCTATCGC
CCCTGGTCCG ACACCAACCT CTATGCCCAG CGCGGCGAGC TGGAAATGAT CCTGGATGGC
GACGCCACCC CCGGACGGGT CGAGGCGATG CTGGAAAAGC TCGAACAGCT TGGGATCGAC
ACCCGGGTGG CCACGGCGGA AAACGCGGAG CAGATGTACC TCGAAAAGCT CGCCTACATC
CGCAAGAGCG ACAAAAGCGC CGATTTCAAA CGGCTGCAGA AATCCCTCGA TGACCGCAAC
GCCACCACCA CCGAGCGGGT CCAGGCTCTG CGCGGCTATT GGCAAAAGGA ACTGGGTGTC
CAGGACATCA CCCAGCTTTC CGGATACAAC CCGCTGGGCG AATACCAGGC GGGCTTTCTG
GACCGCGACG CCAAGGGCGG ATACCGGCAC CAGTTCCGGT TCGACATCAC CGAGGAGGAC
CTGGAAAAAC AGATGAAGGG CTACTCGCTG GTCCACGATC TGACCAACGG CGAAAGCATG
TCCGGCTTTA TCGACTTGAT CATGGAAAAC AACGGAGCCA TGGTCAGCAC GGTCGAGAAA
ATGCGCATGG GCGTGGCTCC GGGAGGAATG TCCCCGGTGG CCGACATGCA GACCGGCGGC
GCGAGCTATT TCTTCACCCG AATCAAGAAG CAACCGGCCA GCGACGCCTC ACCGGCCCTC
TACTTCAAGA AACAGATGCT GCGGCGCATG GACGCCATCA GCTATGACCA TGACGCCTAC
GGCAAGGTGA TCGACGACTA CGTGCAGCGC AACCGGGGAG CCAGTATCGA TGATTGGAAG
CGGTTCTCGC AGCGCCATGG CAACGAAACC ATCTTCAAAT ACTCGGTGAC GCTGCTGGAC
AACATCGAGT TCATCGTGGC CAGAAGCGAC AACGAACGCC GGGAGATCAT CCAGAGTTTC
ACCCGGCGTG GCATCAAGAA ACTGCCCGAC GGGCGCAAGG TGGAGGACAT CGTCCATACC
TCGCAAAGCT GGAGCAAACG CAAACAATGA
 
Protein sequence
MPSDLKQRIQ AATLKSLTAR NRYNDQVTAQ LTQALKQAED EVARAILQYR SLGSLPDNKL 
AALKGLEKLQ LELDDTMKRL KREQTLVFRK TTKDSFKLGI QQGIGELADA ALPFYADLKP
EGIDKLATKV FTIVDTNALD FMAQYNLTLA GDVHRELADG IKRTILNGVA TGKGADDIVR
DMGKVIIDKD SFRQAGSRVF SKAQYRMEMI ARTEVLRAHN MGRLKFHERV GIQKLEWLAM
EDERMCPVCG GLDGKTFPID KFPQQPAHPH CRCTNIVAWP MTVCGSEMAA KAAAQASQGD
ACILPPHVLE GLADAQAKEN AKLKSAFENG DIADLGSLTV KQLQTLAKQN GVAIARTKAD
FIKLLDLAEP GIDHGDLAGA ALSAKLKEHK IGLLRTKEEL VELLGLKQAE LKQAKLLAAQ
MAKIPPTEGL EGMTAQQLKE MAKENGISLN MTKQETIELL DKLEPGVDHS GLMGKELAAA
KQKHGIGILK NKQQLVEALQ KKAGADMAES VKKKAVDEAK QKLILKQKTA LEDAAKAVVV
PDTPTGYKDF LDAIAKAEQA VSGGTDLPQE LLAAHSKEIA LKKQLFQDQV GKLKSAELKT
LAKETKVQYW QWANKDELTT LFTETDPAKI KAVQASIDTK HAAWAEKHGG KKKTVPAKPA
TPKTELPKPA PQPSPVKPPE PKIGKKGAEF ATVDASWQQK GLPSKFKKTG KAAVGGAHEK
EFWTDENGDK WLFKPHGRKD DEFIAFGEEA VYKIGRLIDP HSIEVRTIQL NGRTGSIQKW
RTDLREDFDF RNILPQDLTT IELEQIQREH VVDWLIANHD GHSKQFIRAR DGRVYGIDKG
QAFKFLGQDK LSLDYHPNGV CGEEEPFYNK VFRAAKEGKV RVDPNATLRY IQEVEKIADE
DYLDLLRPYA EGRFAKDPAG LRHFYDQVLE RKHNLRRDFE GYYADVLGDR GFRFDKLSTA
TGKKKLLSST EEALVEEARK LGWQGKTLPF DSGDVEDQNA LIFTESFKGK KRTVVKMKIR
PDTDRRIDEV LRRYVQTAAC EKGQPLAEDS FFPTILDAVK NVNFHVGDGK YNRTKIDKAL
RLRKKLETLQ KSADPKVKEM ADHYLKWVKE IEESVDWDRA TNGVFEQYLP KLDAQKPKEK
PPFKVERGKV THTKRRIGSG TITVEADDID NRTLFNHNSR MQDGHQYTVT FEDGTRVRYR
PWSDTNLYAQ RGELEMILDG DATPGRVEAM LEKLEQLGID TRVATAENAE QMYLEKLAYI
RKSDKSADFK RLQKSLDDRN ATTTERVQAL RGYWQKELGV QDITQLSGYN PLGEYQAGFL
DRDAKGGYRH QFRFDITEED LEKQMKGYSL VHDLTNGESM SGFIDLIMEN NGAMVSTVEK
MRMGVAPGGM SPVADMQTGG ASYFFTRIKK QPASDASPAL YFKKQMLRRM DAISYDHDAY
GKVIDDYVQR NRGASIDDWK RFSQRHGNET IFKYSVTLLD NIEFIVARSD NERREIIQSF
TRRGIKKLPD GRKVEDIVHT SQSWSKRKQ