Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3983 |
Symbol | |
ID | 4457697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 4834596 |
End bp | 4837445 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639704754 |
Product | TPR repeat-containing protein |
Protein accession | YP_848085 |
Protein GI | 116751398 |
COG category | [N] Cell motility [S] Function unknown [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.808044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.937608 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGAC GGGTTCCCGA CGAAGTGAAG CCCGGCATTA TAGGAAGGTT CTTCCGGGCT GTTTGCTTAT TGATCGTGTT CCTGCACGCA TTGCACGGAA CGGCCCTTGG TGATTGCGGC CCTCTGACGA AAGCCGCGGA CGACCGGTTT GCCGGGGCTG AAGATTCCTA TGCGGCGGGC AGGTATCGCG AAGCTTTGGA ATTGCTTGAC CAGGCGCTCT CAACGGACAT TTACTGCCGC CCCGAGAAGG CGCACTTGGA ACTGATCGGG CTGGCCGCCA CCTACGAGGC TCTTGGGGAC TACGCCATGG CCCTGGAAAC CCACCGCCGG GATCTCGAGC TTGCCATCCG GATCTTCGGG GCCGACAACC CCGATACGGC GGTGTCACAC AACAACCTGG GGAGAATGCA CCGCTACATG GGACAGTACC CCGAAGCACT CGCGCATCTC GAGAAGGCTT TGGTCATACT CATCCGCAGC AGTGGACCGG ACCAGCCCGA CACCGCCGTA TCTTACAATA ACGCGGCTTC GGTCTACGAA GAGGCGGGCA ATTACAAAAA GGCGCTCGAA TACTACGAGA AATCGCTGAG CATCAGGCTG AAGGTCTTCG GACCGGAACA CCCGGCCACG GCAACCGCCT TCAACAACCT GGGCGGGATC TACAAAGCCA TGGGCCAGTA CGAAAAAGCC CTCGAAAACC TCAACAAAGC CTTGCCCGTC TATATAAAAA CCTACGGTCC CGAACACTCC GGCACAGCCA TCACGTATAA CAACATAGCA TCGGTTCACA AGGCCCTGGG CCAATACCCC GAAGCTCTCG AACACTACCG AAAGGCGCTC GAAATCGATC TCAAGACCAG CGGTCCGGAT CACCCGGCCA CCGCGGTGAC TTACAACAAC ATTGCCTCCA CCCACGAGAG CATGGGGGAG TTCGAAAAGG CACTGCCGTA TTTCGACAAG GCACTGTCGA TCCAGTTGAG CAGGCTTGGC TCCGATCATC CCGCCACGGC GAGGACCTAC AACAATCTGG GGTCGGTTTA CCAGTCCACC GGCGAATGCT CCAGGGCGAT CGAATACTAC CGAAAGGCGC TGCCGGCCGC GCTGCGCTCC GGCGACGCCC AGCTTCAATG GAATGTGCTC GCCGGAATGA GCTCGACCTT TTCAAAGCTC GGCAATCCGA ACGTCGCCAT CCTCTTCGGA AAACAGGCGA TCAATGCAAT TCAGGCCATG CGGTCCTCCA TGGCGAGCAT GGACATGTCT CTCCGGACAA GCTTCATCCG CGACAAGGAA AACGTTTACA AGCGCCTTGC GACACTGTTG ACGAACATGG GCAGAATTCC TGAAGCGCAG CAGGTGATGG ACCTTCTGAA GGAAGAGGAG TTTTTCGAGT TTGTGCGAGG AATGGAGGTC CCGAAGGCCG GCGAAGGGAT CGGCTACAGC GAGGCCGAAA AACCCTGGGT TGAAAAATTC GCCATGATCA GCGATCAAAT TGCGCAGAGA GGCGCGGAGT ATGCGGAAAT CAGGAGAAAA GCCAAGAGCA GCGAGCTCAC CGAGGAGGAC AGGAAGAAGC AGAAGGAAAT CCTGTCCGAC CTGGAGACAG CGAGCCGCAC CTTCGAGGCT TTCATCGACA GGATCTCCGC GGAATTTGAG AAAACCGCCG GGGAAAAAGC CGCGGAGCTG ACGGAAAAAA GCCTGCGCGA CCTGAAGTCC TTCCAGTCCC TCCTGAGGTA CCTGGGAAAC GGGACGGTCG TCGTCCACTA TCTCGTGACC GAAAAGAAAC TGATCGTCAT ACTGACGACT TCGGAAACAC AACTGGCAAG AGAGACAAAC GTCAGCGCCA TCGAGCTCAA TCACATGATC TCGGATCTCA GAAAGGATCT CCTGTGCCCG TCGAAAGACC CCCGGCCCGC TTCCAGGAAG TTGTTCACGG CTGTCCTGGG CCCGATTTCC CGCGACCTGG AGCAGGCGGG CGCCACGACA CTTCTGCTTT CGCTTGATGG AGCACTCAGA TACATTCCTT TCTCCTGTCT TTACGACGGC CGGAAGTACC TGATCGAGCG CTATACGCCC GTGGTTTTTA CGCCCGCCAG CCGTGACAGG CTGAGGGATT CCCCTTCCCT ATCCTGGCAA GTTGCCGGGT TCGGGGTCGC GGACAAGGTG AGCGACCATT TCAGGGCCCT GCCTTCGGTT CGATATGAAT TGCACAGTAT TGTCGGGCAG GAAGGCAAAG GCGGCGTTCT CCCGGGTATC GTGAAACTCG ACGGCGAGTT CACGGTTGAC GCCATGCGCG AGGCGCTCCT TCGCGACTAC CCCGTGGTGC ACATTGCCAG CCATTTCGTC TTCCAACCGA CAGCCGAGGA ATCCTTTCTG TTGATGGGCG ACGGGTCTCA GCTGACTTTG GACAAAATCA AGCACCAGGG CTTCCGGTTT GACGAGGTGG ACCTGCTGAC CCTTTCGGCA TGTGAAACCG GACTGGGCGC ACGGGATGCC GACGGGCGGG AAGTCGAGGG TTTCGGAGTG CTTGCACAGC GCAGCGGGGC AAAAGGGGTT ATCGCGTCCC TATGGTCCGT TGCCGACAGG AGTACAGGAC TGCTCATGAA GAAAATGTAC GAAGCGCGCG CGGGAAACCC CTCCACAAAC AAGGCCGAGT GCCTGCGCCG GGCTCAGCTG GAATTGCTCC GGGGAGAAGA GGAGAAAGAG CCATCCAGCG AGCCGCCATC CCAGCAAGAG GCCGGGCAGT CGCGCGATTG CGGCGGGAGG AACAGCCCGT CCTTTGAAAC ACCTCCGTCG GCACCTTACG CCCACCCTTA TTTCTGGGCG CCCTTCGTTT TGATGGGGAA TTTCAGGTAG
|
Protein sequence | MNGRVPDEVK PGIIGRFFRA VCLLIVFLHA LHGTALGDCG PLTKAADDRF AGAEDSYAAG RYREALELLD QALSTDIYCR PEKAHLELIG LAATYEALGD YAMALETHRR DLELAIRIFG ADNPDTAVSH NNLGRMHRYM GQYPEALAHL EKALVILIRS SGPDQPDTAV SYNNAASVYE EAGNYKKALE YYEKSLSIRL KVFGPEHPAT ATAFNNLGGI YKAMGQYEKA LENLNKALPV YIKTYGPEHS GTAITYNNIA SVHKALGQYP EALEHYRKAL EIDLKTSGPD HPATAVTYNN IASTHESMGE FEKALPYFDK ALSIQLSRLG SDHPATARTY NNLGSVYQST GECSRAIEYY RKALPAALRS GDAQLQWNVL AGMSSTFSKL GNPNVAILFG KQAINAIQAM RSSMASMDMS LRTSFIRDKE NVYKRLATLL TNMGRIPEAQ QVMDLLKEEE FFEFVRGMEV PKAGEGIGYS EAEKPWVEKF AMISDQIAQR GAEYAEIRRK AKSSELTEED RKKQKEILSD LETASRTFEA FIDRISAEFE KTAGEKAAEL TEKSLRDLKS FQSLLRYLGN GTVVVHYLVT EKKLIVILTT SETQLARETN VSAIELNHMI SDLRKDLLCP SKDPRPASRK LFTAVLGPIS RDLEQAGATT LLLSLDGALR YIPFSCLYDG RKYLIERYTP VVFTPASRDR LRDSPSLSWQ VAGFGVADKV SDHFRALPSV RYELHSIVGQ EGKGGVLPGI VKLDGEFTVD AMREALLRDY PVVHIASHFV FQPTAEESFL LMGDGSQLTL DKIKHQGFRF DEVDLLTLSA CETGLGARDA DGREVEGFGV LAQRSGAKGV IASLWSVADR STGLLMKKMY EARAGNPSTN KAECLRRAQL ELLRGEEEKE PSSEPPSQQE AGQSRDCGGR NSPSFETPPS APYAHPYFWA PFVLMGNFR
|
| |