Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3803 |
Symbol | |
ID | 4457868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 4650070 |
End bp | 4654659 |
Gene Length | 4590 bp |
Protein Length | 1529 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639704576 |
Product | SPP1 family phage head morphogenesis protein |
Protein accession | YP_847907 |
Protein GI | 116751220 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTCGG ACCTGAAGCA GCGCATCCAG GCGGCCACCC TGAAGAGTCT GACGGCCCGC AACCGCTACA ACGACCAGGT CACGGCCCAG CTCACCCAGG CGCTGAAACA GGCCGAAGAC GAGGTCGCCC GCGCCATTCT CCAGTACCGC TCCCTCGGTT CCCTGCCGGA CAACAAGCTC GCCGCCCTCA AGGGGCTGGA AAAGCTTCAG CTCGAACTCG ACGACACCAT GAAGCGGCTC AAGCGGGAGC AGACCCTGGT CTTTCGCAAG ACGACCAAGG ACTCCTTCAA ACTCGGCATC CAGCAGGGAA TCGGAGAGCT CGCCGACGCG GCGCTGCCGT TCTACGCCGA CCTCAAACCC GAAGGCATCG ACAAGTTGGC CACCAAGGTG TTCACCATCG TCGACACCAA CGCCCTCGAC TTCATGGCGC AGTACAACCT CACGCTCGCC GGTGACGTCC ACCGCGAACT CGCAGACGGC ATCAAGCGCA CCATCCTGAA CGGCGTCGCC ACGGGCAAGG GAGCCGACGA CATCGTCCGG GACATGGGCA AGGTGATCAT CGACAAGGAT TCCTTTCGCC AGGCCGGAAG CCGGGTGTTC AGCAAGGCCC AGTACCGCAT GGAGATGATC GCCCGCACCG AGGTCCTCCG CGCCCACAAC ATGGGCAGGC TCAAGTTCCA CGAGCGGGTC GGCATCCAGA AACTGGAATG GCTGGCCATG GAGGACGAGC GCATGTGCCC GGTCTGCGGC GGCCTGGACG GCAAGACCTT TCCCATCGAC AAGTTCCCCC AGCAACCCGC GCATCCGCAC TGCCGCTGCA CCAACATCGT GGCGTGGCCG ATGACCGTCT GCGGCAGCGA GATGGCCGCC AAGGCCGCCG CCCAGGCATC GCAGGGGGAC GCCTGCATTC TCCCGCCCCA CGTGCTGGAA GGCTTGGCCG ACGCCCAGGC CAAGGAGAAC GCCAAGCTCA AGAGCGCCTT TGAAAACGGC GACATTGCCG ACCTCGGCTC GCTGACGGTC AAACAGCTTC AGACTCTGGC GAAACAGAAC GGCGTGGCCA TCGCCCGGAC CAAGGCCGAT TTCATCAAGC TGCTCGATCT GGCTGAACCG GGGATCGATC ACGGTGACCT GGCCGGAGCG GCGCTCAGCG CCAAGCTCAA GGAACACAAG ATCGGCCTGC TGCGGACCAA GGAAGAACTG GTCGAGCTGC TCGGACTGAA GCAGGCGGAA CTCAAACAGG CCAAGCTGCT CGCCGCCCAG ATGGCGAAGA TCCCTCCCAC CGAGGGGCTG GAAGGCATGA CCGCCCAGCA GCTCAAGGAG ATGGCAAAGG AGAACGGCAT CTCCCTCAAC ATGACCAAGC AGGAGACCAT CGAGCTGCTG GACAAGCTGG AACCCGGCGT GGACCACAGC GGCCTGATGG GCAAGGAACT CGCGGCGGCC AAACAGAAGC ACGGCATCGG CATCCTCAAG AACAAGCAGC AGCTCGTCGA GGCGCTGCAG AAAAAGGCCG GTGCCGACAT GGCCGAGTCG GTCAAGAAAA AGGCGGTCGA CGAGGCCAAA CAAAAGCTGA TCCTGAAACA GAAAACGGCC CTCGAAGACG CCGCCAAGGC CGTTGTCGTT CCCGACACGC CGACCGGCTA TAAGGATTTC CTCGACGCGA TTGCCAAGGC GGAACAGGCG GTTTCCGGCG GCACCGATCT GCCCCAGGAA CTGCTTGCGG CCCACAGCAA GGAAATCGCC CTCAAAAAAC AGCTCTTCCA GGATCAGGTC GGCAAACTGA AATCGGCGGA GCTCAAGACG CTCGCCAAGG AGACCAAGGT CCAGTATTGG CAGTGGGCCA ACAAAGACGA GCTGACCACG CTCTTCACCG AGACCGATCC CGCGAAAATC AAGGCGGTTC AGGCCAGCAT CGACACCAAG CACGCGGCCT GGGCCGAAAA GCATGGAGGC AAGAAGAAAA CCGTACCGGC CAAGCCCGCC ACGCCGAAGA CGGAACTGCC GAAACCGGCT CCGCAGCCGA GCCCGGTCAA GCCGCCCGAG CCCAAGATCG GCAAGAAAGG CGCGGAGTTC GCCACCGTCG ATGCCTCGTG GCAGCAGAAG GGTCTGCCTT CGAAGTTCAA GAAAACCGGC AAGGCCGCTG TCGGCGGCGC ACATGAAAAG GAGTTCTGGA CTGATGAAAA CGGCGACAAA TGGCTGTTCA AGCCTCATGG CCGCAAGGAC GATGAGTTCA TCGCCTTCGG AGAGGAAGCG GTCTACAAGA TCGGCCGCCT GATCGACCCC CATTCCATCG AGGTGCGCAC CATCCAATTG AACGGCCGCA CCGGCTCCAT CCAGAAATGG CGCACCGATC TGCGGGAAGA CTTCGATTTT CGCAACATCC TGCCCCAGGA TCTGACCACC ATCGAACTGG AGCAGATCCA GCGCGAGCAT GTGGTCGACT GGCTGATCGC CAATCACGAC GGACATTCCA AGCAGTTCAT CCGTGCCCGG GACGGTCGCG TCTACGGAAT CGACAAAGGC CAGGCCTTCA AGTTTCTGGG CCAGGACAAG CTCTCGCTCG ACTATCACCC CAACGGCGTC TGCGGCGAGG AGGAGCCTTT TTACAACAAG GTCTTCCGGG CGGCCAAGGA AGGGAAGGTA CGGGTCGATC CGAACGCGAC CCTGCGCTAC ATCCAGGAAG TCGAAAAGAT CGCCGACGAG GATTACCTCG ATCTGTTGCG ACCCTACGCC GAGGGACGGT TCGCCAAGGA CCCGGCCGGG CTGCGGCATT TCTACGATCA GGTCCTGGAG CGGAAACACA ATCTCCGGCG GGACTTCGAG GGCTATTACG CAGATGTGCT GGGGGATCGG GGCTTCCGTT TCGACAAGCT GAGTACCGCC ACCGGCAAGA AAAAGCTGCT CTCCTCAACA GAGGAAGCCC TGGTCGAGGA GGCCCGCAAG CTCGGCTGGC AGGGCAAGAC GCTGCCCTTC GACAGCGGCG ACGTGGAAGA CCAGAACGCG CTGATCTTCA CCGAGAGCTT CAAGGGGAAG AAGCGCACCG TGGTCAAGAT GAAGATCCGG CCGGACACCG ACCGCCGCAT CGACGAGGTG CTGCGCAGGT ATGTGCAGAC GGCGGCCTGT GAAAAGGGAC AACCGCTGGC CGAGGACAGC TTCTTTCCGA CGATTCTGGA CGCCGTCAAA AACGTCAACT TCCACGTGGG CGACGGCAAG TACAACCGGA CCAAGATCGA CAAGGCCCTG CGCCTGCGGA AGAAACTGGA AACCCTGCAA AAGAGCGCCG ACCCCAAGGT CAAGGAGATG GCGGACCACT ATTTGAAATG GGTCAAGGAG ATCGAAGAGT CCGTCGACTG GGACCGGGCC ACCAACGGCG TATTCGAGCA GTACCTGCCC AAGCTCGACG CGCAGAAACC CAAGGAGAAA CCGCCGTTCA AGGTGGAACG CGGCAAGGTG ACCCACACCA AGCGCAGGAT CGGGTCCGGC ACCATTACCG TCGAGGCCGA CGACATCGAC AACCGGACGC TGTTCAATCA CAACTCCCGC ATGCAGGACG GGCACCAGTA CACCGTCACC TTCGAGGACG GCACCCGGGT CCGCTATCGC CCCTGGTCCG ACACCAACCT CTATGCCCAG CGCGGCGAGC TGGAAATGAT CCTGGATGGC GACGCCACCC CCGGACGGGT CGAGGCGATG CTGGAAAAGC TCGAACAGCT TGGGATCGAC ACCCGGGTGG CCACGGCGGA AAACGCGGAG CAGATGTACC TCGAAAAGCT CGCCTACATC CGCAAGAGCG ACAAAAGCGC CGATTTCAAA CGGCTGCAGA AATCCCTCGA TGACCGCAAC GCCACCACCA CCGAGCGGGT CCAGGCTCTG CGCGGCTATT GGCAAAAGGA ACTGGGTGTC CAGGACATCA CCCAGCTTTC CGGATACAAC CCGCTGGGCG AATACCAGGC GGGCTTTCTG GACCGCGACG CCAAGGGCGG ATACCGGCAC CAGTTCCGGT TCGACATCAC CGAGGAGGAC CTGGAAAAAC AGATGAAGGG CTACTCGCTG GTCCACGATC TGACCAACGG CGAAAGCATG TCCGGCTTTA TCGACTTGAT CATGGAAAAC AACGGAGCCA TGGTCAGCAC GGTCGAGAAA ATGCGCATGG GCGTGGCTCC GGGAGGAATG TCCCCGGTGG CCGACATGCA GACCGGCGGC GCGAGCTATT TCTTCACCCG AATCAAGAAG CAACCGGCCA GCGACGCCTC ACCGGCCCTC TACTTCAAGA AACAGATGCT GCGGCGCATG GACGCCATCA GCTATGACCA TGACGCCTAC GGCAAGGTGA TCGACGACTA CGTGCAGCGC AACCGGGGAG CCAGTATCGA TGATTGGAAG CGGTTCTCGC AGCGCCATGG CAACGAAACC ATCTTCAAAT ACTCGGTGAC GCTGCTGGAC AACATCGAGT TCATCGTGGC CAGAAGCGAC AACGAACGCC GGGAGATCAT CCAGAGTTTC ACCCGGCGTG GCATCAAGAA ACTGCCCGAC GGGCGCAAGG TGGAGGACAT CGTCCATACC TCGCAAAGCT GGAGCAAACG CAAACAATGA
|
Protein sequence | MPSDLKQRIQ AATLKSLTAR NRYNDQVTAQ LTQALKQAED EVARAILQYR SLGSLPDNKL AALKGLEKLQ LELDDTMKRL KREQTLVFRK TTKDSFKLGI QQGIGELADA ALPFYADLKP EGIDKLATKV FTIVDTNALD FMAQYNLTLA GDVHRELADG IKRTILNGVA TGKGADDIVR DMGKVIIDKD SFRQAGSRVF SKAQYRMEMI ARTEVLRAHN MGRLKFHERV GIQKLEWLAM EDERMCPVCG GLDGKTFPID KFPQQPAHPH CRCTNIVAWP MTVCGSEMAA KAAAQASQGD ACILPPHVLE GLADAQAKEN AKLKSAFENG DIADLGSLTV KQLQTLAKQN GVAIARTKAD FIKLLDLAEP GIDHGDLAGA ALSAKLKEHK IGLLRTKEEL VELLGLKQAE LKQAKLLAAQ MAKIPPTEGL EGMTAQQLKE MAKENGISLN MTKQETIELL DKLEPGVDHS GLMGKELAAA KQKHGIGILK NKQQLVEALQ KKAGADMAES VKKKAVDEAK QKLILKQKTA LEDAAKAVVV PDTPTGYKDF LDAIAKAEQA VSGGTDLPQE LLAAHSKEIA LKKQLFQDQV GKLKSAELKT LAKETKVQYW QWANKDELTT LFTETDPAKI KAVQASIDTK HAAWAEKHGG KKKTVPAKPA TPKTELPKPA PQPSPVKPPE PKIGKKGAEF ATVDASWQQK GLPSKFKKTG KAAVGGAHEK EFWTDENGDK WLFKPHGRKD DEFIAFGEEA VYKIGRLIDP HSIEVRTIQL NGRTGSIQKW RTDLREDFDF RNILPQDLTT IELEQIQREH VVDWLIANHD GHSKQFIRAR DGRVYGIDKG QAFKFLGQDK LSLDYHPNGV CGEEEPFYNK VFRAAKEGKV RVDPNATLRY IQEVEKIADE DYLDLLRPYA EGRFAKDPAG LRHFYDQVLE RKHNLRRDFE GYYADVLGDR GFRFDKLSTA TGKKKLLSST EEALVEEARK LGWQGKTLPF DSGDVEDQNA LIFTESFKGK KRTVVKMKIR PDTDRRIDEV LRRYVQTAAC EKGQPLAEDS FFPTILDAVK NVNFHVGDGK YNRTKIDKAL RLRKKLETLQ KSADPKVKEM ADHYLKWVKE IEESVDWDRA TNGVFEQYLP KLDAQKPKEK PPFKVERGKV THTKRRIGSG TITVEADDID NRTLFNHNSR MQDGHQYTVT FEDGTRVRYR PWSDTNLYAQ RGELEMILDG DATPGRVEAM LEKLEQLGID TRVATAENAE QMYLEKLAYI RKSDKSADFK RLQKSLDDRN ATTTERVQAL RGYWQKELGV QDITQLSGYN PLGEYQAGFL DRDAKGGYRH QFRFDITEED LEKQMKGYSL VHDLTNGESM SGFIDLIMEN NGAMVSTVEK MRMGVAPGGM SPVADMQTGG ASYFFTRIKK QPASDASPAL YFKKQMLRRM DAISYDHDAY GKVIDDYVQR NRGASIDDWK RFSQRHGNET IFKYSVTLLD NIEFIVARSD NERREIIQSF TRRGIKKLPD GRKVEDIVHT SQSWSKRKQ
|
| |