Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1878 |
Symbol | |
ID | 4459819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 2295895 |
End bp | 2297271 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639702645 |
Product | SPP1 family phage head morphogenesis protein |
Protein accession | YP_845998 |
Protein GI | 116749311 |
COG category | [S] Function unknown |
COG ID | [COG2369] Uncharacterized protein, homolog of phage Mu protein gp30 |
TIGRFAM ID | [TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.524046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCTT TCCCGGAACT GCTCCCGTTC ACGGAAGCCA TCGAATTCTT CAACGCCAAG AACATCGTGG TCTCCCCCGA TTCCTGGCGG GACGTCTGGG CGTCCGAACA CGTTCACGCC TTTACCGTGG CCCGGGTCAC CGCGATCGAT GTCCTCGAGG ACATCCGCAA GGCGGTGGGC AAAGCCGTCG CGGACGGCAC TTCCATTCAG CAGTTCAAAT CCGGGCTGTC CCGGCTGCTG GCCGCAAAAG GCTGGTTCAG CGAAAAACGG GACAGGCCGC CGGGGGCTCT GACCGGCCCG CGCCTTGAAA CCATCTACCG CACGAACCTG CAGTCAGCCT ACCAGGCCGG GCGCTTCAAA CAACTCGTCG AGACGGCCCA CGTGCGTCCC TACTGGATGT ACGATGCCGT CGGCGACCAT CGCACGAGGC CGCTCCACGC CGCTCTCAAC GGCAAGGTAT ATCGTTTCGA CCACCCGTTC TGGAACGCCT GGTACCCCCC GAACGGGTTC AACTGCAGGT GTACCGTGCG AAGCCTTTCG GAAGACCGGT TCGAACGGAT GAAGCTGCCG CTCGAAAACG AGGCGCCACC CCACGGTCCG GACCCGGGAT TCGATTTCAA CCCCGGAATG GTCCGCTGGC AGCCGGATCT TGCGCGGTAC GGCCCGGAAG CTCGCGGGCT CATCACCCGG GACATGATCG GGCGCCCCTG GGACATCCCC TCCCTCGAGG AGGATCTCAC GCGGTTGCGC GACGGATTCG CCCAGACCGG GATCACGGCC TCAAGAAACC CGCTTACTAT TGCGCCGCTC CCCGGAGAGG GCGTAACGGG GTGGAGCAAC CCCAGGACGG GGGAAATCGC TCTTCCCGAA GGCGTCTATG ATCGAATCCG CTGGATCCTC AATATCGGCG CGATCCGGAC GGAAGCGGAA ATCGACGCGT TCCGCCTCCT GATTCACGAA TTCGGACATC ACCTCGGACA TCCCGTCGAC ATCAAGCGGT ACAACTCCGA TCCGGACTAC TCGGCGATCA AGGAAGCCGT AAACGACCTG TGGGCCAGGC ACTATCTGGC CGAAGCCGCC CGGGCGCTCA ACCTGGAATG CAGGCTCTCG CCCTTCACCG AGTCGCGCTC GCGGGCACCG GGCACCTACT CCGCCTGGGT CGAACACCTG CGGGCCGTTC TGCGCAAGCT CGACATCGAC GAAACCGATG AGAAGAAGCT CATCACGGAG CTGAACCTCT CCGAGAACGC CGAAAACCTC TCCGAGCGCT TCTGGGCCAT GGTTCACGAG AAGAAGCCGG ACATCGCCGC GGACCGGCCG TTCGGGGAGT TGATCCGGCT GTTCTGGCTG TGGGAGCAGC TGATGGACGA GCTGTGA
|
Protein sequence | MKAFPELLPF TEAIEFFNAK NIVVSPDSWR DVWASEHVHA FTVARVTAID VLEDIRKAVG KAVADGTSIQ QFKSGLSRLL AAKGWFSEKR DRPPGALTGP RLETIYRTNL QSAYQAGRFK QLVETAHVRP YWMYDAVGDH RTRPLHAALN GKVYRFDHPF WNAWYPPNGF NCRCTVRSLS EDRFERMKLP LENEAPPHGP DPGFDFNPGM VRWQPDLARY GPEARGLITR DMIGRPWDIP SLEEDLTRLR DGFAQTGITA SRNPLTIAPL PGEGVTGWSN PRTGEIALPE GVYDRIRWIL NIGAIRTEAE IDAFRLLIHE FGHHLGHPVD IKRYNSDPDY SAIKEAVNDL WARHYLAEAA RALNLECRLS PFTESRSRAP GTYSAWVEHL RAVLRKLDID ETDEKKLITE LNLSENAENL SERFWAMVHE KKPDIAADRP FGELIRLFWL WEQLMDEL
|
| |