Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3414 |
Symbol | |
ID | 4458285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 4179355 |
End bp | 4180455 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639704186 |
Product | peptidase M42 family protein |
Protein accession | YP_847522 |
Protein GI | 116750835 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.774179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0330149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACG ATCCTGTAAC AGCCGCCGCA ATTCACTTGC TGAAGAGCCT CGCCGAAGCT CCCGGCGCGC CGGGTCATGA AGACGCCGTG CGCCGTATCT TCCGGACGGA AGTGGGGGGG GACACCACCA CGGACAAAAC CGGGAGCATC ATTTACACGA AAAAAGGAAC TTCCGAAACC CCTCGGATCA TGCTTGCCGC GCACATGGAT GAAGTCGGGT TCGTGGTGCA GAGCGTCACG CGGGAAGGAC TGATCCGGTT CCTCCCGTTG GGCGGCTGGT GGCCGCATAC GATCCTGGCC AAACGGGTGA GGATAATCAC CCGCAACAAC ACGGAGATCA TCGGCGTCGT GGGGGCCAAG CCACCCCATT TCCTGACCGA CGCCGAACGT GAAAAAGTGA TGAAAATCGA AGATATGTTC ATCGATGTGG GAGCGCGCGA CGCGGTGGAT GTCCGGGATC GTTTCGGGAT CGAGGTGGGA GACAGCATCG TTCCCGACAG CAGCTTCACC GTGCTGCACG ATCCCGACGT CTTCCTGTGC AAGGCGTTCG ACAACCGGGT GGGGATGGCC GTGGTCATAC ATGCCGCCGC CATGCTGATG TCGATGACGC ACCACAACAC GGTTTGCGCG GTGGGAACGG TCCAGGAGGA AGTCGGGGTG AGGGGCGCCC AGACCGCGGC GCACGCCGTC AATCCCGACG CGGCCATCAT CCTGGAAGGC ACGCCGGCGG ACGATCTGCC AGGAACGACG GAAGAGGAGC GACAGGGAAA ACTGCGGGGA GGCGTCCAGA TCAGGCTCAT GGACCCGTCG GCGATCATGA ATCGCAAGTT CAGCCGATAT GCCGTCGAAC TGGCCCGAGA ACACGGAATC GCGCACCAGG TCGCAGTCCG CCGAAGCGGA TCGACCGACG CCCGTGCCGT TCATCTGACG CGGGAAGGCG TGCCGACCAT CGTTCTCGGC GTACCCTCCC GTTACATCCA TACCCACAAC GGGCTCGTTC ACATGGAGGA CTACCTGAGC GCACTCGACC TGGTCATGAA ATTGCTCGAA CGGCTCGACG AGGATGCCGT TCGGTCGTTT GTTACCTACG ACGACAAGTA G
|
Protein sequence | MLNDPVTAAA IHLLKSLAEA PGAPGHEDAV RRIFRTEVGG DTTTDKTGSI IYTKKGTSET PRIMLAAHMD EVGFVVQSVT REGLIRFLPL GGWWPHTILA KRVRIITRNN TEIIGVVGAK PPHFLTDAER EKVMKIEDMF IDVGARDAVD VRDRFGIEVG DSIVPDSSFT VLHDPDVFLC KAFDNRVGMA VVIHAAAMLM SMTHHNTVCA VGTVQEEVGV RGAQTAAHAV NPDAAIILEG TPADDLPGTT EEERQGKLRG GVQIRLMDPS AIMNRKFSRY AVELAREHGI AHQVAVRRSG STDARAVHLT REGVPTIVLG VPSRYIHTHN GLVHMEDYLS ALDLVMKLLE RLDEDAVRSF VTYDDK
|
| |