Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1716 |
Symbol | |
ID | 4459968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 2096942 |
End bp | 2099482 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639702485 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_845838 |
Protein GI | 116749151 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.116279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.782454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTG AGCCGGTCCG CCCGACCCGC CCCCTGTTGT GGCTCACGGT GTTCTTTGCG CTGGGCATCG CCGCGGAGCG TGTCTGCCCC CTTTCGTTGC CCTCGGCGTT CATTATCGGT TTCCCCGCCT TGCTTCTCCT CTGTCTTGCC TTTCTCTGTT CGAAATCCCG GGAGCGCTCA CGCGCATCGG TGCCGGTCTC CCTGGTGTTG TTTCTTGCCC TGGGTTTCGC CCTCGCGCGC GTCGCCGCCC CCCTTGTCCC CTGTCCGCCG GGATTGGAGC GGGTACTGGA TCGTCCCCAT ACGTTGTTCA TCGCCGACAT TGCATCGCCT CCCGATTTTT TCCCGGACAA GATCCGGCTC ACCCTGAGGC TGCGCTCGGC CTTGCTCGAT GATGAAAACG TTCCGCTCGA CGCAGGAGTG CTCCTCTCCG TGGCTCGAAC CGGCGTGGAA CGCGCGGCAT GGGTCCCGGG CGATCGCGTG CTGGCGCGCC TCACGCTCAG GCGCTTTCAC AATTTCAACA ATCCCGGCGG GTATGACTAT GTCATGAGGC AGGCGGAGCG CGGAATCCAC GCGCGGGCCG GGTCCCCTGA CGATCGTTTC CTGGTTCGGC TTGCGCCCGG GAACGGGCTC CCGGGCTGCT CCGTCTTCCG GGCGGTAAGA AGCACGGTGG ACCGGTTCAG GCAGGAAAGC CTGTTCTGGC TTCGCAAGCA CTTCGATCCC GACACGGCCG CGTTCTATGC GGCCCTGCTG CTCGGTTACC AGCAGTTGAT TTCCGCTGAT TGGAAAGAAG ATTTGAACCG GGTGGGGATC ACTCACCTCC TTTCGATTTC GGGAATGCAT CTCGGTCTGG TCAGCATGTT CACCTTCTGG ATTTGCCGGA AGTTTATCCG GCGTCTTTGC CCCGCCGCGC TTCACCGTCT GAGCGACAAA CAACTCGCCC TCTGGCCCGC CCTGGCGGCG GCTTTGCTTT ACGCTTTCCT CGCGGGATTC GGCGTGCCCC CGATCTGGCG CTCCCTCTTG ATGCTCACAC TCGGTCTGTG GGCCTCCTTT TGCTACCGGC ATGCGGATTC TCTCACCATC CTGGCCGCGA CGGCACTGAT CATCCTGGTC ATCGACCCCG CCGTTCTCTG GCAGGTTTCC TTCCAACTCA CCTATGCGTG CATGGTGGCG CTGTTTGTCA TCTACCCGAG ACTGCAGCGC TGCCGGCTGG CCGCGATCCA CCCCGTGTTC GGCGGCGACC GCATGGCCGG AAAGATCACA CGACCCTTCG AGGAGGCCTT CCACGTTTCG GTGGCGGTCA ACATCCTGGT TCTCCCGCTC ACGGTCTTTT ACTTCCAGGG TTTTTCCCTG GCCGGATTCA TTGCCAACAT CATCCTGGTG CCCCTGGTCG GGTTCCTGGT GCTCCCGTCC GGCCTGCTCG GGCTGGGACT GCTGGCGTTC AACGAATCCC TCGCCGCGCT CCTGCTGCAA TTCGGGGCCT GGTGGGTGAC CCTGGCGCAT CACCTCATCC GATGGTTCAG CGACCTTTCC TGGGCGTACT TCCATGTGGG CCCTTTTTCG TTGCTCGGCA TGGCCGCGTG TTACCTGGCA CTGTTCGTTC TGCTGAGCCC GTGGCACCGG AAGCGGAAGG GCGCCGCGTT GTGCGCCCTC GCGCTTTTTA TGGCGGCGGA TTCCGCCGTT GCCCATTGGC GTACCGCCGA GGATCAGCGG GATCATCTGC TGGTGGATTT CATCGATGTG GGACAGGGCA CCTCCACCCT GGTTCGATTT CCCGACGGTG CAGCCATGCT CGTCGACGGT GGCGGTTTCT TCGACGATTC CTACGACATC GGCCGCGCGG TGGTGGCCCC CTTCCTGTGG CGGAGCGGCA TCCGCAGACT GGACTACGTG GTGCTGTCCC ACGATCATCC GGACCATCGC AACGGGCTTC GCTTCGTCAT GCAGCGGTTC GAGGTGGGCT GTCTGTGGAC CGGCGCCCTC GCGTGCCGGC CGGGCGACCG GGAATCCATC GAGACGATCG CCGCGTTGCG CGGCATCCCC GTCCGGCTCA CGCACGAAAT CCCCGGGGAA TGCGCCATCG GGCGCTGCCG CGTCAAGCTC CTTCACCCCA GTCAAAAATA TCTTGAAACT GAGTGGAATG GGGATATAAA CAATGCTTCG CTCGTATTGA AGATCGACTA TGGCGAAACG GGAGTCATTT TGCCCGGAGA CATCGGTCAG TCGGTGGAAC GGGTGATTTT CGGGACCGGC ACCGCCTGGG GGAACGTGGT ACTGGGATCT CCGCACCACG GCAGCGACCG TTCGAACAGT CCCTTCATGG TGGAACGATT GAAACCTCGG GCGGTCGTCG TTTCGTGCGG GGCCGACAAC CGGTTCGGAT TTCCCTCCCC CGCGGTTCTC GAGACGTACC GGAAGCACGG AGTGGCCGTC TACCGGACCG ACCGGCACGG CGCGGTGCAT GCGGTTTCGG ACGGAACCCG CTGGGAATTC TCGACGTTCA TGGGCAGTTC CGGAGGATCG TCGCTCTCCG GCGGATACTG A
|
Protein sequence | MSREPVRPTR PLLWLTVFFA LGIAAERVCP LSLPSAFIIG FPALLLLCLA FLCSKSRERS RASVPVSLVL FLALGFALAR VAAPLVPCPP GLERVLDRPH TLFIADIASP PDFFPDKIRL TLRLRSALLD DENVPLDAGV LLSVARTGVE RAAWVPGDRV LARLTLRRFH NFNNPGGYDY VMRQAERGIH ARAGSPDDRF LVRLAPGNGL PGCSVFRAVR STVDRFRQES LFWLRKHFDP DTAAFYAALL LGYQQLISAD WKEDLNRVGI THLLSISGMH LGLVSMFTFW ICRKFIRRLC PAALHRLSDK QLALWPALAA ALLYAFLAGF GVPPIWRSLL MLTLGLWASF CYRHADSLTI LAATALIILV IDPAVLWQVS FQLTYACMVA LFVIYPRLQR CRLAAIHPVF GGDRMAGKIT RPFEEAFHVS VAVNILVLPL TVFYFQGFSL AGFIANIILV PLVGFLVLPS GLLGLGLLAF NESLAALLLQ FGAWWVTLAH HLIRWFSDLS WAYFHVGPFS LLGMAACYLA LFVLLSPWHR KRKGAALCAL ALFMAADSAV AHWRTAEDQR DHLLVDFIDV GQGTSTLVRF PDGAAMLVDG GGFFDDSYDI GRAVVAPFLW RSGIRRLDYV VLSHDHPDHR NGLRFVMQRF EVGCLWTGAL ACRPGDRESI ETIAALRGIP VRLTHEIPGE CAIGRCRVKL LHPSQKYLET EWNGDINNAS LVLKIDYGET GVILPGDIGQ SVERVIFGTG TAWGNVVLGS PHHGSDRSNS PFMVERLKPR AVVVSCGADN RFGFPSPAVL ETYRKHGVAV YRTDRHGAVH AVSDGTRWEF STFMGSSGGS SLSGGY
|
| |