Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3972 |
Symbol | |
ID | 4457686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 4825168 |
End bp | 4826802 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639704743 |
Product | hypothetical protein |
Protein accession | YP_848074 |
Protein GI | 116751387 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.220987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.634668 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGT ACCAGCCGGA ACTGACCGAT GAATACTTCG ATTACCTGGC CCAGCGTTTC CCGGTGATGT GCGCCAGCGA CGAATTCCAC TTTCTTCCCC GGGCGGTGGC GGCGAGCCGG CACTACGACC GGCTGGACGA CTTCGACCGC GCCGGAATCG AGGAATGCAT CGCCACGATC GAGGAGTATC GGCGCAAATT CGGCCGTCTG GCCGCCGTCG AGAGCGATCC GGAAAAGCTC ACCGACCTCG AGATGCTCGA TTCCAGTATT GCCGGCATCC TGATCGAACT GGACTCCAAG GAGGTGTGGC GGCACGATCC CCTCCTCTAC CTCAAAATCG CCTTCATCGG GCTCGATCAT TCGTTGACCA AACCTACCCC GAGCCACGAT GAACGGCTGC AAAGAACCCG GGCGCGCCTC CAGGCGATTC CCCGCCTGCT CCAACAAGCT GCGGAAAATA TCGACCGGGT GGCCCACAGC ACACTCCAGG CCGCGCTCGC CATGCTCGTC GATTGCCGGA GCTATCTTTC CGAAACCGCC GATACGTATG TGCCCCAGGA TTCCGGACGA TTCACGGCCG CTTTGGAGAA GACCCGATCC GCCCTGGACG CGTTCGGGAC ATTTCTTGGC GCGGTCAGGT CCGTTCCGGA CCGCGAGTTT GCCTTCCGGA GCCTGGAGGT CACTCTGCGG GACAGATTCC GATCGCATCG GAGCCTGTCC GAAGTGGATG AGATCGCCGT CGAGGAATGG CGCGAGAACC TTTGCCGCCT GGAACGCCTG CAAAGGAAGA TCGACCCCGG GAAATCCTGG AGAGAGCTGT ATCATGGATA CTTGCCGGAA GACGCGGCCG GTCGTGACAC CCTGGTGCTC TATGACCGCG AGATGAAAAG GCTGAAGCGT TTTTTCGAAT CGCACGGCTT CAGCGAGGTC ATGCCGGCGG GGTGGCCGGT GGTCTGTCGG ACCCCCACTT ACCTGCAATC CGTGCGGGGC TCGGCATCCT TCAGCGCCGC CTTTTCGAGG GACGGGACCG AAGAGGATTA CTTCTACATT ACGACCCAGG CCTCCGGACA GGGGCGCGAC CGATCAGCGG AGCTTCTCAG GAAGCGGTTT CACCGGGAGT ATAAATTCCT CGCCGCCCAT GAGACCTTCC CGGGGCACTA CCTGCTGGAT TCCACCCGGA GAATGCTCGA CAATCCAGTG CGAAGCCAGA TCGAATCGGC ACTTTTCTAC GAGGGCTGGG CCTATTACGT GGAATCACTC CTGACCGAAT ACGGGTACGC CGATCATCCC CTGGACCTGC TGGTCGACTG CAAACGGAGA CTCTGGAGGG CCGCGCGTTG CCGGATCGAC ATCGGGCTTC ATTCTGGGAG ACTGAGTCCC GACGACGCCC TGGGTCTGTT GACAACCGCC GGGTTCGGCA TGGAGGAAGC CTGCGGACAA ATCAGTCGCT TTCGGCTGAA TCCCGGCTAT CAACTCTGCT ACAGCCTGGG TCGATTCGAG ATCATGCGAC TCAGAGAGAC TTACGCGGGC GGAATGGGGC ACGACGGGTT TCACCGACTG ATGCTCGAAG GCGGTGAACC GCCGTTTCAT GCGATCGAGA AGCGGCTCCG GAATGCGGTG AATGCCACCC CATAG
|
Protein sequence | MPEYQPELTD EYFDYLAQRF PVMCASDEFH FLPRAVAASR HYDRLDDFDR AGIEECIATI EEYRRKFGRL AAVESDPEKL TDLEMLDSSI AGILIELDSK EVWRHDPLLY LKIAFIGLDH SLTKPTPSHD ERLQRTRARL QAIPRLLQQA AENIDRVAHS TLQAALAMLV DCRSYLSETA DTYVPQDSGR FTAALEKTRS ALDAFGTFLG AVRSVPDREF AFRSLEVTLR DRFRSHRSLS EVDEIAVEEW RENLCRLERL QRKIDPGKSW RELYHGYLPE DAAGRDTLVL YDREMKRLKR FFESHGFSEV MPAGWPVVCR TPTYLQSVRG SASFSAAFSR DGTEEDYFYI TTQASGQGRD RSAELLRKRF HREYKFLAAH ETFPGHYLLD STRRMLDNPV RSQIESALFY EGWAYYVESL LTEYGYADHP LDLLVDCKRR LWRAARCRID IGLHSGRLSP DDALGLLTTA GFGMEEACGQ ISRFRLNPGY QLCYSLGRFE IMRLRETYAG GMGHDGFHRL MLEGGEPPFH AIEKRLRNAV NATP
|
| |