Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3750 |
Symbol | |
ID | 4457917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 4581978 |
End bp | 4583927 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639704524 |
Product | hypothetical protein |
Protein accession | YP_847855 |
Protein GI | 116751168 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0673] Predicted dehydrogenases and related proteins [COG3494] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.162114 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATT TTGACCTGAA GACCGGTGAT CCGATTCGTG AAAGAATCGG ATTGATCGCG GGGAGCGGGC AGTTTCCGCT CCTTTTTGCT CACGCGGCGC GGCAGGCCGG AGTGGAGGTC GTGGCCCTCG GGTTCCAGGG GGAAACCGAT CCCGCATTGT CCAAGTATGT CAATGAGTTT CACATGTTGA AACTTGGGCA GCTGAGCCGG ATGATCAACG CTTTTCGCAG GGCGGGAATC ACCCGGGCCG CCATGGCGGG CGCGATCAAC AAAACGAAAC TCTACACGCG CATAAGGCCT GACTGGCGTG CGGTGAAGTT TCTGAACAGC CTGAGGAACA AGAAGGACGA CTCCCTGCTC AGAGCGTTTG CGGACGAGCT CGAAAGCGAA GGAATAAAGA TCGAGCCTTC CACCATGTTC CTGCCGTCTC TGCTCGCTCC CGAAGGCATC CTGACCCGGC GCAAGCCCAA TCATCGGGAG CAGGTCGACA TCGTTTTCGG CTGGAAGATG GCCAAGGTGA TCGGTGGGCT GGACATCGGG CAGTGCCTGG TCGTCAAGAA CCAGGCGGTC CTGGCGGTGG AAGGGATCGA CGGGACCGAT TCCACCATCC TGAGGGGCGG GCGGCTGTGC CGGGAAGGCG CCATCATCGT GAAGGTGAGC AAGCCGATCC AGGATCTGCG TTTCGACGTG CCGGCCGTCG GGTATGACAC CATCGAGACG ATGAAGCGGG TGAAAGCCCG TGTTCTGGCC GTCGAGGCGG GCAAGACGCT CATGTTCGAC CGGGAGAAGA TGATCGATGC GGCGGACGCG GCGGGGATTT CGATCCTGGT GCAGCCCGAC GAGACGCTTC CCGAAGTCCA TGAGCGGCCG TCCAAGATCG GACTGCCGGA TTTCATGGGC CGGGGTTCCG CGCAATCGAT TCTTCGAACG GATCAGACCC CGGTCCTGAT CCGGCAGTCG AGGCCCGATG CACTGCGGGT GGGTGTTGTG GGCGTGGGGT ATCTGGGGAG ATTTCATGCC CACAAGTATG CCCGGTTGCC CAAAGCGCGA CTCATGGGCG TGGTCGACAT CAACGCCGAA CAGGCGGCCA GGGTGGCCGC CGAGGTGGAC GCCGCCCCCC TGACCGACTA TCGCGAACTG ATGGGGAAAG TGGACTGCGT CAGCGTCGTA ACGCCCACTC CCAATCACTT CGCCATCGCC CGGGATTTTC TCGCCGCCGG AGTCCACGTG CTGCTCGAAA AACCCATGAC CACGACGATC GAGGAAGCGG ACCGGTTGAT CGCACTGGCG GAACAAAAGG GCTGCGTGCT GCAGGTGGGA CACCTCGAAC GGTTCAATTC CGCTTTCACG GCCATCCTTC CCAGGCTCAA GAACCCCATG TTCCTGGAGT CCACTCGGCT GGCCCTCTTC AATGAACGCG GCCTCGAAGT GGATGTCATC CTCGACCTCA TGATCCACGA CATCGACATC GCGCTTCACA TCGTCCAGTC TCCCCTGAAG CAGATCCGTG CCTCGGGGGT TTCCGTCCTC ACGCAACTGC CCGATATCGC CAACGTGCGC ATGGAATTTG CCAATGGGGC CGTGGCGAAT CTCATTGCGA GCCGAATCTC GCTGAAGAAC CTGCGCAAGC TCCGCATCTT CCAGGAAGAC TGCTATGTCG CGGCGGACTA CGGCCGGAAG CGGGCCTACG CGGTGTACCG GGAAGAGGAG GCCGATGATT CGGGCTATCC TGAAATGTCC ATGGAAGAGA TCGAGATCGA TGAACGCGAC GCGCTCGAAG AGGAAGTGAA CGCTTTCCTG CAAGCGGTCA AAACCGGCGG CAGGCCCCGA GTCGACGGGA ACGACGGGCG CCGTGCGCTG GACGTGGCAT TGAGCATTTC GCGGCAAATC GACGAGCAGA TGCGGGAGAA ATGGGAGCTT TCGCGCAACG TCCTCCAAGG GTCTTTCTGA
|
Protein sequence | MADFDLKTGD PIRERIGLIA GSGQFPLLFA HAARQAGVEV VALGFQGETD PALSKYVNEF HMLKLGQLSR MINAFRRAGI TRAAMAGAIN KTKLYTRIRP DWRAVKFLNS LRNKKDDSLL RAFADELESE GIKIEPSTMF LPSLLAPEGI LTRRKPNHRE QVDIVFGWKM AKVIGGLDIG QCLVVKNQAV LAVEGIDGTD STILRGGRLC REGAIIVKVS KPIQDLRFDV PAVGYDTIET MKRVKARVLA VEAGKTLMFD REKMIDAADA AGISILVQPD ETLPEVHERP SKIGLPDFMG RGSAQSILRT DQTPVLIRQS RPDALRVGVV GVGYLGRFHA HKYARLPKAR LMGVVDINAE QAARVAAEVD AAPLTDYREL MGKVDCVSVV TPTPNHFAIA RDFLAAGVHV LLEKPMTTTI EEADRLIALA EQKGCVLQVG HLERFNSAFT AILPRLKNPM FLESTRLALF NERGLEVDVI LDLMIHDIDI ALHIVQSPLK QIRASGVSVL TQLPDIANVR MEFANGAVAN LIASRISLKN LRKLRIFQED CYVAADYGRK RAYAVYREEE ADDSGYPEMS MEEIEIDERD ALEEEVNAFL QAVKTGGRPR VDGNDGRRAL DVALSISRQI DEQMREKWEL SRNVLQGSF
|
| |