Gene Sfum_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3983 
Symbol 
ID4457697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4834596 
End bp4837445 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content58% 
IMG OID639704754 
ProductTPR repeat-containing protein 
Protein accessionYP_848085 
Protein GI116751398 
COG category[N] Cell motility
[S] Function unknown
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.808044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.937608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAC GGGTTCCCGA CGAAGTGAAG CCCGGCATTA TAGGAAGGTT CTTCCGGGCT 
GTTTGCTTAT TGATCGTGTT CCTGCACGCA TTGCACGGAA CGGCCCTTGG TGATTGCGGC
CCTCTGACGA AAGCCGCGGA CGACCGGTTT GCCGGGGCTG AAGATTCCTA TGCGGCGGGC
AGGTATCGCG AAGCTTTGGA ATTGCTTGAC CAGGCGCTCT CAACGGACAT TTACTGCCGC
CCCGAGAAGG CGCACTTGGA ACTGATCGGG CTGGCCGCCA CCTACGAGGC TCTTGGGGAC
TACGCCATGG CCCTGGAAAC CCACCGCCGG GATCTCGAGC TTGCCATCCG GATCTTCGGG
GCCGACAACC CCGATACGGC GGTGTCACAC AACAACCTGG GGAGAATGCA CCGCTACATG
GGACAGTACC CCGAAGCACT CGCGCATCTC GAGAAGGCTT TGGTCATACT CATCCGCAGC
AGTGGACCGG ACCAGCCCGA CACCGCCGTA TCTTACAATA ACGCGGCTTC GGTCTACGAA
GAGGCGGGCA ATTACAAAAA GGCGCTCGAA TACTACGAGA AATCGCTGAG CATCAGGCTG
AAGGTCTTCG GACCGGAACA CCCGGCCACG GCAACCGCCT TCAACAACCT GGGCGGGATC
TACAAAGCCA TGGGCCAGTA CGAAAAAGCC CTCGAAAACC TCAACAAAGC CTTGCCCGTC
TATATAAAAA CCTACGGTCC CGAACACTCC GGCACAGCCA TCACGTATAA CAACATAGCA
TCGGTTCACA AGGCCCTGGG CCAATACCCC GAAGCTCTCG AACACTACCG AAAGGCGCTC
GAAATCGATC TCAAGACCAG CGGTCCGGAT CACCCGGCCA CCGCGGTGAC TTACAACAAC
ATTGCCTCCA CCCACGAGAG CATGGGGGAG TTCGAAAAGG CACTGCCGTA TTTCGACAAG
GCACTGTCGA TCCAGTTGAG CAGGCTTGGC TCCGATCATC CCGCCACGGC GAGGACCTAC
AACAATCTGG GGTCGGTTTA CCAGTCCACC GGCGAATGCT CCAGGGCGAT CGAATACTAC
CGAAAGGCGC TGCCGGCCGC GCTGCGCTCC GGCGACGCCC AGCTTCAATG GAATGTGCTC
GCCGGAATGA GCTCGACCTT TTCAAAGCTC GGCAATCCGA ACGTCGCCAT CCTCTTCGGA
AAACAGGCGA TCAATGCAAT TCAGGCCATG CGGTCCTCCA TGGCGAGCAT GGACATGTCT
CTCCGGACAA GCTTCATCCG CGACAAGGAA AACGTTTACA AGCGCCTTGC GACACTGTTG
ACGAACATGG GCAGAATTCC TGAAGCGCAG CAGGTGATGG ACCTTCTGAA GGAAGAGGAG
TTTTTCGAGT TTGTGCGAGG AATGGAGGTC CCGAAGGCCG GCGAAGGGAT CGGCTACAGC
GAGGCCGAAA AACCCTGGGT TGAAAAATTC GCCATGATCA GCGATCAAAT TGCGCAGAGA
GGCGCGGAGT ATGCGGAAAT CAGGAGAAAA GCCAAGAGCA GCGAGCTCAC CGAGGAGGAC
AGGAAGAAGC AGAAGGAAAT CCTGTCCGAC CTGGAGACAG CGAGCCGCAC CTTCGAGGCT
TTCATCGACA GGATCTCCGC GGAATTTGAG AAAACCGCCG GGGAAAAAGC CGCGGAGCTG
ACGGAAAAAA GCCTGCGCGA CCTGAAGTCC TTCCAGTCCC TCCTGAGGTA CCTGGGAAAC
GGGACGGTCG TCGTCCACTA TCTCGTGACC GAAAAGAAAC TGATCGTCAT ACTGACGACT
TCGGAAACAC AACTGGCAAG AGAGACAAAC GTCAGCGCCA TCGAGCTCAA TCACATGATC
TCGGATCTCA GAAAGGATCT CCTGTGCCCG TCGAAAGACC CCCGGCCCGC TTCCAGGAAG
TTGTTCACGG CTGTCCTGGG CCCGATTTCC CGCGACCTGG AGCAGGCGGG CGCCACGACA
CTTCTGCTTT CGCTTGATGG AGCACTCAGA TACATTCCTT TCTCCTGTCT TTACGACGGC
CGGAAGTACC TGATCGAGCG CTATACGCCC GTGGTTTTTA CGCCCGCCAG CCGTGACAGG
CTGAGGGATT CCCCTTCCCT ATCCTGGCAA GTTGCCGGGT TCGGGGTCGC GGACAAGGTG
AGCGACCATT TCAGGGCCCT GCCTTCGGTT CGATATGAAT TGCACAGTAT TGTCGGGCAG
GAAGGCAAAG GCGGCGTTCT CCCGGGTATC GTGAAACTCG ACGGCGAGTT CACGGTTGAC
GCCATGCGCG AGGCGCTCCT TCGCGACTAC CCCGTGGTGC ACATTGCCAG CCATTTCGTC
TTCCAACCGA CAGCCGAGGA ATCCTTTCTG TTGATGGGCG ACGGGTCTCA GCTGACTTTG
GACAAAATCA AGCACCAGGG CTTCCGGTTT GACGAGGTGG ACCTGCTGAC CCTTTCGGCA
TGTGAAACCG GACTGGGCGC ACGGGATGCC GACGGGCGGG AAGTCGAGGG TTTCGGAGTG
CTTGCACAGC GCAGCGGGGC AAAAGGGGTT ATCGCGTCCC TATGGTCCGT TGCCGACAGG
AGTACAGGAC TGCTCATGAA GAAAATGTAC GAAGCGCGCG CGGGAAACCC CTCCACAAAC
AAGGCCGAGT GCCTGCGCCG GGCTCAGCTG GAATTGCTCC GGGGAGAAGA GGAGAAAGAG
CCATCCAGCG AGCCGCCATC CCAGCAAGAG GCCGGGCAGT CGCGCGATTG CGGCGGGAGG
AACAGCCCGT CCTTTGAAAC ACCTCCGTCG GCACCTTACG CCCACCCTTA TTTCTGGGCG
CCCTTCGTTT TGATGGGGAA TTTCAGGTAG
 
Protein sequence
MNGRVPDEVK PGIIGRFFRA VCLLIVFLHA LHGTALGDCG PLTKAADDRF AGAEDSYAAG 
RYREALELLD QALSTDIYCR PEKAHLELIG LAATYEALGD YAMALETHRR DLELAIRIFG
ADNPDTAVSH NNLGRMHRYM GQYPEALAHL EKALVILIRS SGPDQPDTAV SYNNAASVYE
EAGNYKKALE YYEKSLSIRL KVFGPEHPAT ATAFNNLGGI YKAMGQYEKA LENLNKALPV
YIKTYGPEHS GTAITYNNIA SVHKALGQYP EALEHYRKAL EIDLKTSGPD HPATAVTYNN
IASTHESMGE FEKALPYFDK ALSIQLSRLG SDHPATARTY NNLGSVYQST GECSRAIEYY
RKALPAALRS GDAQLQWNVL AGMSSTFSKL GNPNVAILFG KQAINAIQAM RSSMASMDMS
LRTSFIRDKE NVYKRLATLL TNMGRIPEAQ QVMDLLKEEE FFEFVRGMEV PKAGEGIGYS
EAEKPWVEKF AMISDQIAQR GAEYAEIRRK AKSSELTEED RKKQKEILSD LETASRTFEA
FIDRISAEFE KTAGEKAAEL TEKSLRDLKS FQSLLRYLGN GTVVVHYLVT EKKLIVILTT
SETQLARETN VSAIELNHMI SDLRKDLLCP SKDPRPASRK LFTAVLGPIS RDLEQAGATT
LLLSLDGALR YIPFSCLYDG RKYLIERYTP VVFTPASRDR LRDSPSLSWQ VAGFGVADKV
SDHFRALPSV RYELHSIVGQ EGKGGVLPGI VKLDGEFTVD AMREALLRDY PVVHIASHFV
FQPTAEESFL LMGDGSQLTL DKIKHQGFRF DEVDLLTLSA CETGLGARDA DGREVEGFGV
LAQRSGAKGV IASLWSVADR STGLLMKKMY EARAGNPSTN KAECLRRAQL ELLRGEEEKE
PSSEPPSQQE AGQSRDCGGR NSPSFETPPS APYAHPYFWA PFVLMGNFR