Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0221 |
Symbol | |
ID | 4461297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 261500 |
End bp | 263107 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639700976 |
Product | hypothetical protein |
Protein accession | YP_844357 |
Protein GI | 116747670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00277701 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0427708 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAC CGAGAAACAA CGAGGACGGT TTGTTGGCGG CACATCTTGC GAAACGACTC ACCGCATACT CTGCGGCCGC CGGCCTGGTG CTGGCGATGA GCCCTCAGGC GGAATCGGCC ATCCATTGGT TCCATCCCTC CACACCTTTT GAGGTCGATC ATTCGCATAC CAAGACCATC GACATGGACA ACGATGGAAC CGGCGATATT TCTTTTGGCT TGCAGCCGAT GGGGAAGACG TTCTGGAGTC TGGCCGTCAA CCGAAAGAAT GCGGCCGGCC GCTTCGACGG CACGGGCCGC CTGGTGCGAA GGCTGGAGAA AGGCAGCATG GTGACCCCCT CCCTGGATGC CGATGCGGCC GTATTGATGA ACGAGCGCTA TTTCACTACA TGCTATCCCG CCAATGGCAC GAACACGTCC ATCATTTGCA GATGGTGGAC TGTCTGGGGG GACGGGGCGT TCAACAAGGA GGCGAAGGGC TATGTCGGAG TGAAATTCCG GGGAGGGAGC GGCAAGGACT ACAACGGCTG GGTCCACTTC ACCGGAGGCA ATGATCCGGC GGACCTCTCC GGAACGATCG ACGAGTGGGC CTACGAGGAT TCCGGCGGGG CCATCAAGGC GGGAGCCGTC ACCAGCCAGG CGGCGCACGC CTTTCTGTCC GATCTCAGCG GCAACGGCTC CCCCGAGCTT GCACTGCTCC AGGTGACGCA GGGTTCCTCC TTCGGGGGCT CCGTACTGAT CCGGGACATC GCCACCGGAG ATGAAATCAG GCAGGTGCAG TGCCTTGGGC CCCTGTTTCG TCCCGTGAGC ATGGCCAGAA TCAGTAACGT CGACGGCAAT GGCAACCCCG CCCTGGCGAT TCTGGGCGTG CGGGAAAATG CCGCGAAAGA AATCCTTGCG GTCCGGGTTG AGCTGAGAAA CCCCGCCAGC GGTGAGCTCA TAAGAAACAT CGGCTTTTCC AAGGGCTGCA GACCCGTGGC TCTGAGGTGG CTCAACAGGG ATATGAACGG CAGCGGCGTC GGGGAGCTGG CGGTGCTCGG CACCCATCCG GACGGCGGCA GATCGATGGT GGAGATCAGG GACCTGCTCA CGGGCGCCCT GGTGAAGAGA ATAACCGTGG GGCCGGACAG TACGGTCGAG TCCATCGGGC TGTCGTACTC GAACGATCTG AACGGCAACG GCTCCCAGGA GCTGGTGGTG CTTCAGAGGA AGGCCGCCAC GCAGGTCAGC CACCTTAAAG TGGTGGACAC CGGGACCGGC AGGACGATCA AGACCATCCC ATGCCTCAGT GAATCGTGGA TTCCCGTCGA CATCGCATCG ATCGGGGATC TGAACGGGAA CGGTAGCCTC GAACAGGGCG TGCTTGCGAG GAGCGCAACC GGGGAGAAAG TGGTGGTGAA GATCATGGAC CTGCGCGCGG GCAAAGCGCT CCGAAACGTA TCGTTCAACA GCGCGTTTTT GCCCCAAAAG CTTGCCGCGG GCGACATCAA CGGTGACGGC GTCTATGATC TGGGCGTTCT CGGGATCAAT CTGCTCGATG CAACGACGGG AATCGAAATC CGGGATCCCG TCACCAGGAG TCTGATAAAG ACGATCCTGA TACCTTGA
|
Protein sequence | MTKPRNNEDG LLAAHLAKRL TAYSAAAGLV LAMSPQAESA IHWFHPSTPF EVDHSHTKTI DMDNDGTGDI SFGLQPMGKT FWSLAVNRKN AAGRFDGTGR LVRRLEKGSM VTPSLDADAA VLMNERYFTT CYPANGTNTS IICRWWTVWG DGAFNKEAKG YVGVKFRGGS GKDYNGWVHF TGGNDPADLS GTIDEWAYED SGGAIKAGAV TSQAAHAFLS DLSGNGSPEL ALLQVTQGSS FGGSVLIRDI ATGDEIRQVQ CLGPLFRPVS MARISNVDGN GNPALAILGV RENAAKEILA VRVELRNPAS GELIRNIGFS KGCRPVALRW LNRDMNGSGV GELAVLGTHP DGGRSMVEIR DLLTGALVKR ITVGPDSTVE SIGLSYSNDL NGNGSQELVV LQRKAATQVS HLKVVDTGTG RTIKTIPCLS ESWIPVDIAS IGDLNGNGSL EQGVLARSAT GEKVVVKIMD LRAGKALRNV SFNSAFLPQK LAAGDINGDG VYDLGVLGIN LLDATTGIEI RDPVTRSLIK TILIP
|
| |