Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1438 |
Symbol | |
ID | 4460490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 1782620 |
End bp | 1785853 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639702207 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_845564 |
Protein GI | 116748877 |
COG category | [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [K] Transcription [L] Replication, recombination and repair [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.977948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0058487 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGAGA AATCGTGTCC ATTCTGTTCT CCCGATGATC ATCGCGTGTT CTACGAGGGC AAGCTGGTAT TTGCACTGTG GGATGGCTTC CCGGTTGCTC CGGGGCATGC GCTCCTCGTT CCCAAACGGC ATGTGGCAAC CTGGTTCGAC GCAACCCCTG AAGAGCGGGA GGAACTCCTG GCTGCTACCG AGGTGGTTCG AGACCACATC CTCTCCCACT TCGAGCCGGC TGGCTTCAAC CTGGGAATCA ATATCGGCAC GGCAGCCGGT CAAACCGTTT TCCATCTTCA CCTGCATGTG ATCCCAAGAT ACCACGGCGA CGTTGAGGAC CCCACCGGGG GTGTTCGCAA TGTCATCCCG GCAAAGGCAA ACTACCTGAA AAGACCCGTC GTGGAGATGG ATGCCATTGC GAAGAGTGCT CACCGGGGGA TAACCGGCCC GTTTCTCGAC TTGGGTCGAT CAGAGCATTT GGTTTCTGGT GGAGATGACC CATTGCTGCC TCATCTCCAG ACCCACCTCG ATGTCTCCGA ACGTGCGGAT ATGGCCGTAG CTTTCATTCT CGAAAGCGGC ATAGACCTTC TCGAGGAACA CCTGCAGTCG CTTCTGGATC GCGGGGGGCG AGTTCGGCTT CTTACCGGGG ACTACCTGGG AATCACCGAC CCCATCGCGT TGTCCCGGCT CCTGGATCTG TGCGAAGGGG CTACAGGCCA ATTCGAACTC CGGGTCTTCG AAAGTAGGGG TGTCAGCTTT CACCCCAAAG CCTATATTTT TTACATCGAC ACTGTACCAG GGGGTGGCGT TGCATATGTT GGGAGCTCCA ACCTGAGCGG GCAGGCGCTT GCGGAGGGTA TTGAATGGAA CTACCGCATA GTCCCTGCCG AAAGTCACAA AGGATTCCGG GCGGTCACCG CCGCCTTCGA AACACTGTTT TGCCACGCAC GAACCAGGCC CGTCGACCAT GTGTGGATCC GACAATACCA GGAATCGCGA AGGCCGCCCT GCATCAACGT GCCGCCCTCG AACCAGGGCG AGGTCATTTT GGAAGAGGCC CAGAGGAACT GGGGAAGGCT CCCCGTTTCG GTGCCGGCTG AAATTGTCCC CGAGGATGCC AAACCTTCGC CTGAACCCCA CAGAATTCAA AGGGAGGCTC TCGAAGCGCT GGCGAGGACC CGTGCCGAAG GCAACCAGGC CGGGCTCGTG GTCCTGGCGA CTGGCCTTGG GAAAACGTGG CTCGCAGCCT TCGACAGCGT TCGGTTTGGT GCCGGGAAAG TGTTGTTTGT TGCCCATCGG GAGGAAATCC TTCGGCAGGC GCTGAAGACG TTCCGCCGGA TTCGACCGTT TGCCGCCCTT GGGCTTTACA ACGGCACGGA AAAGCTGCCT GGCGCGGATG TGCTTTTCGC CTCGATCCAG ACGTTGGGGC GCACCAATCA TCTCCGGTCT TTTGACCGTA ATCTGTTCGA CTACGTTGTG ATCGACGAAT TCCACCATGC CTCTGCCCGG AGCTATCAAA AGCTGATCGA CCACTTTCAG CCGGCTTTCC TTCTTGGGCT TACCGCAACG CCGGAAAGGA CTGACGGCGG GGACCTCCTG GCGCTCTGCG CAGAGAATCT CGTATACCGC TGCGATCTTG TTGAGGGAAT CCGCGACGGG CTCCTCTGCC CTTTTCGGTA CTTCGGTGTC CCGGATGAAG TGGACTACAG CAACATTCCG TGGCGTAGCG GTCGCTTTAA CGAGGAAGCT CTGACCCGGG CCGTGGCCAC CAAAAGGCGT GCCGTGAACG CCCTCGAGCA GTACCGGAGC CGTGCCGGCA GGCGCACCCT CGCCTTCTGC TGCTCCCAGC GTCACGCCGA TTTCATGGCG GATTATTTCA CCAATCAGGG GGTTCGGGCC GCTTCCGTTC ACTCCGGACC GCAGACGGCG CCGAGAGGAG GTACCCTCAA GCGCCTTACC GACGGTGAAC TCGATGTGAT ATTCGCTGTC GACATGTTCA ATGAGGGTGT CGACCTCCCT AATGTGGACA CCGTCATGAT GCTCCGGCCA ACGGAATCTC GAATCCTGTG GCTCCAGCAG TTCGGTCGCG GGCTTCGAGT TGCCGAGGGT AAGGACTACC TCCGTGTCAT CGACTACATC GGCAATCACC GATCCTTTCT GCTCAAGCCC CGGACGCTGC TGGGTCTTGG GAGGGGTGAC AATGAACTTT CGATGGCGCT GGAGGCAGTC CAAAATGGAA CATTCGATCT TCCACCGGGA TGCGAGGTCA CCTATGAACT GGAAGCCGTA GACGTCCTGC GGGCGCTCAT CAAGATGCCG CCGGCGCACG AAGCTCTGCG ATTCTGGTAT GAGGACTTTA AAGAGAACAA CGACAGGCGG CCGACAGCTG TCGAGGCCTT TCACGAAGGC TACAACCCCC GCTCTGCCAG GAAGAACTAT GGCTCTTGGT TTCGGTTTCT CGCTGCAATG GGCGATCTTT CGGCAGACGA ACGCAATGTG TTGGAGAATT CTCGGGCAGG AGACTTCCTC GATATCCTGG AAACCACCCC CATGTCCAGG AGCTTCAAGA TGCTCACGCT CATGGCCATG TTGAACGAGG GGCAGTTGCC CGGGCAAATC AAGGTTGAAA GGCTTGTGGA AGGCTTCGCA CAGCTGGCCC TTCGCTCGGC GACACTCAGA ACTGATGTCG GAGTGGACCT CCAGGACCGA GCAGCCCTTC GCCGATACCT GGAGAGGAAT CCCATCGAGG CATGGACCGG AGGGAGGGGC ACCAAGCGTA AGTCTTTCTT TTCCTACCGT GACGGAGTTT TCGGGGCCAC TTTTACGTTA GAGGATGCTA TTCGCGCCGC GTTTCATGAA CTTGCACGGG AAATCATAGA CTGGAGGCTG GCCGAATACC TCCAGCGACC TTCTCTGAGC AACCCCGTGA CGAGCGATAT CCTATGCAGG GTCAGTCACG CAAGTGGGCG GCCGATTCTG TTTCTTCCAG ACAGAAAGAT CCATCCACGC ATTCCGCTCG GATGGACAGA AGTGCTTGTG GAAGGGGAGC CTTGTGAAGC CAACTTTGTC AAGGTTGCGG TCAATGTGTT ACGTCGAAGA GACTCGGCAA AAAACGTCCT TTCAGAGATA CTGAGGCGAT GGTTTGGTTC CGACGCTGGT CGCCCCGGCA CCAATTTTCA AGTCGTGTTC CGAGAGTCGG ATGGAAGCCT GATTCTTGAA CCCATCGAGT CGGAGCGCTT TTGA
|
Protein sequence | MTEKSCPFCS PDDHRVFYEG KLVFALWDGF PVAPGHALLV PKRHVATWFD ATPEEREELL AATEVVRDHI LSHFEPAGFN LGINIGTAAG QTVFHLHLHV IPRYHGDVED PTGGVRNVIP AKANYLKRPV VEMDAIAKSA HRGITGPFLD LGRSEHLVSG GDDPLLPHLQ THLDVSERAD MAVAFILESG IDLLEEHLQS LLDRGGRVRL LTGDYLGITD PIALSRLLDL CEGATGQFEL RVFESRGVSF HPKAYIFYID TVPGGGVAYV GSSNLSGQAL AEGIEWNYRI VPAESHKGFR AVTAAFETLF CHARTRPVDH VWIRQYQESR RPPCINVPPS NQGEVILEEA QRNWGRLPVS VPAEIVPEDA KPSPEPHRIQ REALEALART RAEGNQAGLV VLATGLGKTW LAAFDSVRFG AGKVLFVAHR EEILRQALKT FRRIRPFAAL GLYNGTEKLP GADVLFASIQ TLGRTNHLRS FDRNLFDYVV IDEFHHASAR SYQKLIDHFQ PAFLLGLTAT PERTDGGDLL ALCAENLVYR CDLVEGIRDG LLCPFRYFGV PDEVDYSNIP WRSGRFNEEA LTRAVATKRR AVNALEQYRS RAGRRTLAFC CSQRHADFMA DYFTNQGVRA ASVHSGPQTA PRGGTLKRLT DGELDVIFAV DMFNEGVDLP NVDTVMMLRP TESRILWLQQ FGRGLRVAEG KDYLRVIDYI GNHRSFLLKP RTLLGLGRGD NELSMALEAV QNGTFDLPPG CEVTYELEAV DVLRALIKMP PAHEALRFWY EDFKENNDRR PTAVEAFHEG YNPRSARKNY GSWFRFLAAM GDLSADERNV LENSRAGDFL DILETTPMSR SFKMLTLMAM LNEGQLPGQI KVERLVEGFA QLALRSATLR TDVGVDLQDR AALRRYLERN PIEAWTGGRG TKRKSFFSYR DGVFGATFTL EDAIRAAFHE LAREIIDWRL AEYLQRPSLS NPVTSDILCR VSHASGRPIL FLPDRKIHPR IPLGWTEVLV EGEPCEANFV KVAVNVLRRR DSAKNVLSEI LRRWFGSDAG RPGTNFQVVF RESDGSLILE PIESERF
|
| |