Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1889 |
Symbol | |
ID | 4459795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 2303861 |
End bp | 2306881 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639702656 |
Product | hypothetical protein |
Protein accession | YP_846009 |
Protein GI | 116749322 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.405118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.121372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAA ACAGGATCCA GATTGTGATC GTTGCCGATT CCAGTAAGGC CGTGAAGAAC ATACGCATGG TCTCCGACGA ATTCGATCGG TTCAAATCCT CCGCCTGCAA CTCCATTGCG GGCACAAATA GCGCGATTTC GTCGATGGGC GCCGGCCTCG CCTCGGCCAT CAGCCTTTTG CAGGGCTTCG CTGCCGCTAT GGCGGCGGCG AAAGCCGCTG AGATCGGTCT TGGCTTCAAC AAAGAGGTCG AGGATGCGCG CCTCGGCATA GCGACCCTCC TGGCGGCACA AGGCCAATTT GTCGATCAAC AGGGCCAACA ATTGCAGGGG CAGCAAAGGA TAAACGCGGC CATGGCCTAT TCGGCCGATC TCACCAGGCA GCTCCAGGTC GACAACCTCA AGACGACGGC CACGTTTCAG CAATTGCTCA AGGCGTTTCA GCAGACGCTC TCGCCGGGTT TGTCCGAGGG ACTCTCCGTC GACCAGGTGC GACAATACAC CCTGGCGATG GTGCAGGCCG CCTCCGCCAT GCAAATCCCT CTCGACATGA TGGCGGAAGA GACCCGGTCC CTGCTCAAGG GGACGATCAC TCCGCGCAAT ACTCTTATCG CGACCGCCCT CGGCATCACA CCGGAGGATA TTCGAAAACT CCAGGGGGAC GCGGAAGGCC TGTTTACCTT CATCATGGGC AAGCTGCAGG CGTTCCAGGA TTTCGGCCAG GTCACCCAGA CGACTTATTC GGGGCTCCTC AGCAATGCCC AGGATGCCCT GGCAAACGTT CTGGGCAAAG CCACCGAACC ATTTTTCGAG AGCCTCAAAG CGAGCCTCAA GCGATTCACC GACTATGCGC TCCAGGTCGA TGCGGTCACC GGGAATTTGA AACTCAACCC GGAACTGGTG AACGCCTTCG AGCTCCTGAA CGACACTCTG CGCATGGCCA TTGCCCTTGC GGAAAGCCTT GCAAAAACGC TCGGGGCCGT GGGCATCGCA TACAAGGCGA TGAAGGAAGC TCAAAAGGCG GCCGCCCTTC AGAATACTCC AGCCGGATTT GCCGGTCCGG AGCTGGCTGA TTTTACCGAC GTGCAGCGCT TGGAGCGCGT GAAGCAGCTC ATGGCGAAAA TCGTCGAAGA CCAGCAAACC ATAGCCAACT GGAAAGCCGG AGGCATTGGC GGGTCGATAT TGGGCGCCCC TCAAATCGGC CTGGCGAATC TTGATATCTC GAATTCGGTC ACCTCCCTTG AGAACTTCCG CGCTCAAATG GTTGCGGCCG CCCAGAACAC CGAGGAGCTC GACGCCTTCC TCAAAAGCCT CAAGCAGTCT TCCGACGCGG TTGCCGCGGC ACAGGAAACG GCAGCCGGGA ACATCGGCAG CGTGAGATAT GAGGCGGCCA GAGCCGCAAT GGACCTGGAC TCGTTCAAGA AGAAGCTCGA AGAAGTCGCC CGCATAAATC TCACCGGCCT GAGCGACGAT CTCAGCAAGT CCATTGAGGG ATTGAAGGCT AAGATCGCGG TCGCTCTTCA AGGCGGGGAC CCCAGGCAGC AGGCGATCGC CGCCGCGTAT ACGGAGAGAA TTCGAAAACA GGCGACAGCC TGGGAACTGG CTCTCAAGGA AAATGCCGCA CCGGAGGTTT TTTCCGCCAT ATCGGCACAA AAACAACAGG CCGAAGAAGA GGCGTATTGG GGCAGATATC TGGCCGGCGC CGAAGCCGTC CGCAAAACCA CGGCCCGTGG CGCCGCCTCC GGCGCATCCC CCATCGACAT CGACAAGATC GACAAGGACG TGCTTCAGTT CAGGGAGCGC ATGCAAAAGC TCTACTCCGA GCTCGAAGAC CTCGACTCCG AGTACCGCAT CGCACAGCTC GAACAATCCG GCCGCAACTA CGACGCCGAA GCCGAGCGGA TCGACCGCCA GGCCGCAAAG CGCAAGGAAG CCTTCGCCAA GGAGGTGGCC GATGCAGAGC AGGCCTATCT CGAAATGGAG CAAAAACTCT CCGGCAGCCG GGGCGGAACT GCCGAGGCGT GGGCGCAGCT TGCGGATCTC AAGACGAAAT ACGACGCCTT GAAGAAAGCC GCCCAGGAAT ACGGCGACAA GGTCGACCGC AATCTGCAAC TCACCAAGGA CATGAAGAAG GCCGAAGACG ACGCGAAACG CGCGGAGGAT CTCGCCCAAC TGAACCTCGA ATACGGCAAG CTGACCGGAA CCCTCCAGGA GCAGCTCGCC CTCCAGATTC TCCTGCTCCG GGCCGAAAAG GACCGGAAGA TCCTCGCGGC CGACCCGGAG CTCCGGGCGG CCTACGAGCG CCTCTATGCC GAGCAGGAAC GGCTGCTCCG CCTCCAGCGC GACGGCTCCT TTATGGACGG CCTGTCCGAA GGACTCAAGA AATGGCAGCG CGAAATGCCG ACGGCGTTCG GCCAGGGCCT TGAGGCAATC GAAATGCTGA AGCGCGGCAT CGATTCCGCG GCCGACGCCC TGGCCGAATT CACCATGACC GGCAAGATGG ACTTTTCGAG CTTTGCCGAC TCGATCATCA GGGACATCCT CCGCATGCAG TACAAGGCAC TGCTCACGCA AATGTTCGGC GGTGAAGGCG GTCTCTTCGA TTGGCTAAAG GGACTCTTTG GCGGCGGCGG TCCGACCACC GGCGCCTACG GGGTCGAGAC CTACGCCGGG ATCGGGCACC GGGGAGGACC GGCCGAAAGC CTGCCGGACC ACCGATACGT GGCCTCGTAT CTGTTCGCCC GCGCCCCGCG CCTCCACGAC GGGCTCGCCC CGGATGAGTT CCCGGCCGTC CTGCAGCGGG GCGAGCGGGT GCTCTCGCGC CGGGAAACCC GCGAATACGG CGCCGCAAAC CGCGCCCCGG AGGTCGTCGT CAACGTCCAA AACAAGACGA ACACGCCGGT CACGGCCGAT AAGACCCGCG CGGCCTTCGA CGGCAAACGC TACGTGGTCG ACGTGATCCT CGACGACTAC AGCCGCGGGG GAGACATCTG GAAGATGATA AGGGGGAACC GCAATGGCTG A
|
Protein sequence | MAENRIQIVI VADSSKAVKN IRMVSDEFDR FKSSACNSIA GTNSAISSMG AGLASAISLL QGFAAAMAAA KAAEIGLGFN KEVEDARLGI ATLLAAQGQF VDQQGQQLQG QQRINAAMAY SADLTRQLQV DNLKTTATFQ QLLKAFQQTL SPGLSEGLSV DQVRQYTLAM VQAASAMQIP LDMMAEETRS LLKGTITPRN TLIATALGIT PEDIRKLQGD AEGLFTFIMG KLQAFQDFGQ VTQTTYSGLL SNAQDALANV LGKATEPFFE SLKASLKRFT DYALQVDAVT GNLKLNPELV NAFELLNDTL RMAIALAESL AKTLGAVGIA YKAMKEAQKA AALQNTPAGF AGPELADFTD VQRLERVKQL MAKIVEDQQT IANWKAGGIG GSILGAPQIG LANLDISNSV TSLENFRAQM VAAAQNTEEL DAFLKSLKQS SDAVAAAQET AAGNIGSVRY EAARAAMDLD SFKKKLEEVA RINLTGLSDD LSKSIEGLKA KIAVALQGGD PRQQAIAAAY TERIRKQATA WELALKENAA PEVFSAISAQ KQQAEEEAYW GRYLAGAEAV RKTTARGAAS GASPIDIDKI DKDVLQFRER MQKLYSELED LDSEYRIAQL EQSGRNYDAE AERIDRQAAK RKEAFAKEVA DAEQAYLEME QKLSGSRGGT AEAWAQLADL KTKYDALKKA AQEYGDKVDR NLQLTKDMKK AEDDAKRAED LAQLNLEYGK LTGTLQEQLA LQILLLRAEK DRKILAADPE LRAAYERLYA EQERLLRLQR DGSFMDGLSE GLKKWQREMP TAFGQGLEAI EMLKRGIDSA ADALAEFTMT GKMDFSSFAD SIIRDILRMQ YKALLTQMFG GEGGLFDWLK GLFGGGGPTT GAYGVETYAG IGHRGGPAES LPDHRYVASY LFARAPRLHD GLAPDEFPAV LQRGERVLSR RETREYGAAN RAPEVVVNVQ NKTNTPVTAD KTRAAFDGKR YVVDVILDDY SRGGDIWKMI RGNRNG
|
| |