Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0664 |
Symbol | |
ID | 4460198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 798509 |
End bp | 800599 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639701420 |
Product | peptidase U32 |
Protein accession | YP_844798 |
Protein GI | 116748111 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0317421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.243634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCGG GCAAGAGGAA CGACAAGCTC CCCGAGCTTC TCGCTCCCGC GGGGCACATG GAGGGTTTCT TCGCGGCGGT GGATAACGGC GCGGATTCGA TTTATCTCGG GCTCAAACAG TTGAGTGCAC GGGCATCGGC CGTGAATTTC TCGCTCGATG AGCTTGCGCG GCTCGTTCCA TATGCCCACG CGAGACGGGT CTCGGTAATC GTCGCCCTGA ACAGCCTGGT GACCGCTTCC GAATTGTCCG GAATACCGGA TGTGCTGCAA TCGCTGTCGG ATCTTCAGGT GGATGCGCTG ATCGTCCAGG ATCCGGGGGT CATTCACCTG GCGCGCAGGC TGTTCCCGTC CTTGAGACTG CACGCCAGCA CGCTCACGAC GATTCACAAC AGCGCCGGGG TGCGTCAAAT GCAGCGGATG GGAGTCTCGA GGGTGGTCCT GGCCCGCGAA TTGACCCTGG ACGAGATCGC CCGGATTCGC GCTTCGACCG GCGCGGAACT GGAAGTCTTC GTCCACGGGG CGTTGTGTTT TTCGTACTCC GGGATGTGCC TGACGAGCAG TTTCCGAGGC GGTCACAGCG GACTGCGGGG GCGGTGCGTT CAGCCTTGCC GGCTGCAGTT CAGGCAGGGC AGGAAAACCG GCTACTTTCT CTCCTGCAAT GACTTCAGCG CGCTGCCTTT CGTGCCCGAG CTGAAGAAAC TGGGACTCGC GGCTTTAAAG ATAGAAGGAC GCATGAAGCC CGCGGATTAC GTCAGCCGGG TCGTCAAGGC CTACAGGCTT GTCCTCGACG CCGACGAAGC TCACACATCG GAGGCGGTCC GGGAGGGGAT GCAGTGGATC GCCGATGCTC CGTCGAGACG GCTGGTTTCG GGCTTCTTGG AAAAAAACTT CAACGAGACC GTTTTGACTC CTCACAGATC GGGGTCCAGC GGTCTCTGGA TCGGCACGGT AAAGAAGATC TCGCAGGATG GAGCGCATGT TTCGCTGAGG CGCGGGCTGA AGGCGGGCGA TCGGTTGCGC CCGGAATCCA AGGAAGGGAA AGAAGAGCCC ATCTTCAGAG TGACGGGGAT TGCCGCGCTC GCCGGCACGC GGCTGGCTGC GGCTGAGCCG GGAGATGTCG TGGTCGTCAG CGGCAGGGGA GGGCTCAAGG CAGGCGACCG GTTGTTTCGA ATCGCCTCAG AAAGCGCCGG GGCTCACACG ACTGTCCCGA ACCGGCTCAA GAATGTGAAG CCCCTGACCT ACCGAAAGAC TTTTTCCGAC CCGGCCGGGA GGGCGGGAGC TCTCGGCGAA CGCCCGGAAG CCGGTGGTGT CGAACGACAG GAGGAGCGTC TCACCATCAA GACGGGAACA CTCCATCACA TGGCCGAAGC ATTCAAGAGC AACCCCTGGC GGGTCCTGCT CACGGCTACC AGGACGAATC TCGAGCGCAT GGCGAGGCAG CGGCTGCCGA AGGCTCAAAA GAGCCGCTTC GTGTGGTCGT TGCCGCCGTT GATTTCCGAA AAGTCCGACC TCGAATACTA CGCGCGCGCC GTCAACTGGT TTGTCGACAA GGGCTTTCTG CTCTGGGAAT TGAACAACTG GGGTCACTTC GATCTGTTCG CCGAAAGGCA GGGTCCGGGA TTCATCGCCG GTCCCCGGTT CAACCTGCGC AACGGGGCGG CGCTGGCTGC CATGGCGGAG GAAGGTTGCC GCTGGAGTGT GCTCTCCCCC GAAATCACCT TCAAGGAATT GCAGCTCCTG GGCCGCATGC CCCTCCCCTC GCAGCCGATC GTCAGTGTGT ATTCATGGCC GCCGCTCTTC ACTTCAAGGC TGACTCCGAA ACTCGAAGAA GGCAAACCCT TCTCGACCCT GCGCGGGGAC GTCTACATGA TGGAAAGGAA ATCCGACGGC ACCCGCGTCT ACGCGGATCA TCCCGTTTGC TGGTTCGACA GGTTGTCCCG TTTGCGCGAG GCGGGGCACC GCCATTTTCT CATCGATATC AGTGAAGCAC CGGACGAGAA GGCCCTGGAG TTTCACAAGC TCATTCTCGG GTTCAGACGA TCCAGATTCG AAGGGACCTG CCATCCGTTC AACTTCGACC GCGAACCCTG A
|
Protein sequence | MESGKRNDKL PELLAPAGHM EGFFAAVDNG ADSIYLGLKQ LSARASAVNF SLDELARLVP YAHARRVSVI VALNSLVTAS ELSGIPDVLQ SLSDLQVDAL IVQDPGVIHL ARRLFPSLRL HASTLTTIHN SAGVRQMQRM GVSRVVLARE LTLDEIARIR ASTGAELEVF VHGALCFSYS GMCLTSSFRG GHSGLRGRCV QPCRLQFRQG RKTGYFLSCN DFSALPFVPE LKKLGLAALK IEGRMKPADY VSRVVKAYRL VLDADEAHTS EAVREGMQWI ADAPSRRLVS GFLEKNFNET VLTPHRSGSS GLWIGTVKKI SQDGAHVSLR RGLKAGDRLR PESKEGKEEP IFRVTGIAAL AGTRLAAAEP GDVVVVSGRG GLKAGDRLFR IASESAGAHT TVPNRLKNVK PLTYRKTFSD PAGRAGALGE RPEAGGVERQ EERLTIKTGT LHHMAEAFKS NPWRVLLTAT RTNLERMARQ RLPKAQKSRF VWSLPPLISE KSDLEYYARA VNWFVDKGFL LWELNNWGHF DLFAERQGPG FIAGPRFNLR NGAALAAMAE EGCRWSVLSP EITFKELQLL GRMPLPSQPI VSVYSWPPLF TSRLTPKLEE GKPFSTLRGD VYMMERKSDG TRVYADHPVC WFDRLSRLRE AGHRHFLIDI SEAPDEKALE FHKLILGFRR SRFEGTCHPF NFDREP
|
| |