Gene Sfum_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_4054 
Symbol 
ID4457578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4922065 
End bp4923681 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID639704825 
Productsporulation domain-containing protein 
Protein accessionYP_848155 
Protein GI116751468 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGGA CAAAAAGGTT GCAGCTGTTG TGCGTCGTTG CGGTTCTGCT GTGCGTGGCG 
TGCGCCACGC CGAGTAGTAA GAGTACGCCG GGACAGTACA GCGCCGAACA ATTGCGGGTA
ATGGGAGAGA AGTTCCTGGC GGCCGGCGAT TCGATTCAAG CCGTGAAGTA TCTCACGGCC
GCGGAACAGA AGACCCCCAA GGATCCCACT CTTCTCTACT ATATCGGCGT TGCCTACAGC
GGTCGGAACA TGCACGCAGA AGCGCTCTCC TACTATCAGA AGGCACTCGC GGAGAAGCCG
GACTACCCCG AAGTCTATAA CGCCATGGGG GTGCTCTACG CCGGTCGGGG GCAGTACGAT
CAGGCCCAGG CCGCCTTCCA GAAAGTGCTC GCCAGCCCGT TCTACGAGAC GCCGCAGTTT
GCCCGGTACA ACCTGGGACT CGTCTACGAA AAGAAAGGGG ACCAGCAAGC CGCTCTCCAG
CAGTACCAGG AAGCCGCCCG GCTCCAGCCG ACACATGCCC TGTCCCATCA CCGCACGGCC
ATGATCCTTG AAGCGCAGGG TCGTGCCGGT GAAGCGCAGA AAGAGTTCGC GATGGCATTG
CAGTACTCGC CCGACCTGGC CGAGGCGCAC ATGCACTACG GCATCCTGTG TTTCGGCACG
GGGGATTTCG ACACCGCCGT CCATTCGTTT GCCCGCGTGA TCAGACTCAT GCCCAATACG
GTCGAGGCGG ACGAAGCCCG CAAATACCTG GACCGGATCG CCGCGGCCCA GGATGCCGCC
TCGCGTTCCA TGCCCTTTCA CCCTCCCGAG AGACGGGTCC GCATGGAAGT GATCCCGCAA
CCCGATGAGC GCTACTCGGA ACCGCCTCAG CCCACGTTCC GGACGCTTCC TTCCAAGCCG
TCCCCGGAGC CGTCGCGGAT GGAACCTCCC GTACGAGTCG AAACGGCTCC AGGGGTGGAT
GCGGCAGTGA GGATGGAACC GCCGGCAAGA CTCGAGCAGC CTGCCGGTCT GGAACCGCCC
CCCAGGGTTG AGCCGCCGCC GAGGGTGGAG CCCCCCGCCG ATGCCGGACC GAGCAAATCC
GAGTTGAGCG TCGTGCGTGA CGGAAATACG ATCAGAGTGG AACAGATCCC CAAAGCAGGA
CTGCCCCTCA AAGCGGAAGA ACCCGCAAGA ATGGAGCCGC CGGGAAGGAT CGACCAGGGC
GCCAGGGTTG AACCGCCTGC CATGCCGGAG CCGCCCGCGC GAGTGGAACA GCCCGTCAAG
GTGGAAGAAC CGGGCGGCAA GGAAGCGATC GCCGTCACGG ACAAGGCCCC GCCCGCGGGC
GCGGCGGACG CGCCTGCGGT CGAACCCGAC CAGGCTCCGC CACCGCCGCC CCAGTTCAAG
TTCGTGGTGC AGGTCGGGTC ATTTCCCGAA AAAGCAAATG CCGAGGAAAT GCAGGCCTCA
CTGCTCAAGA AGGGCTATGC GACGGTGCTC AGACGGGTCA AGGACCGTAC CCGGGGCATG
GTCTACGTGC TGCAGCTCAA ACCCGTCGAG TCCCTCTCCA AGGCCAGCAC GCTGGCGACC
CAACTGGAAA CCGAAGTCCA GGGCACACCG ACGGTTCTGA GGGTGCGGGC CGACTGA
 
Protein sequence
MLRTKRLQLL CVVAVLLCVA CATPSSKSTP GQYSAEQLRV MGEKFLAAGD SIQAVKYLTA 
AEQKTPKDPT LLYYIGVAYS GRNMHAEALS YYQKALAEKP DYPEVYNAMG VLYAGRGQYD
QAQAAFQKVL ASPFYETPQF ARYNLGLVYE KKGDQQAALQ QYQEAARLQP THALSHHRTA
MILEAQGRAG EAQKEFAMAL QYSPDLAEAH MHYGILCFGT GDFDTAVHSF ARVIRLMPNT
VEADEARKYL DRIAAAQDAA SRSMPFHPPE RRVRMEVIPQ PDERYSEPPQ PTFRTLPSKP
SPEPSRMEPP VRVETAPGVD AAVRMEPPAR LEQPAGLEPP PRVEPPPRVE PPADAGPSKS
ELSVVRDGNT IRVEQIPKAG LPLKAEEPAR MEPPGRIDQG ARVEPPAMPE PPARVEQPVK
VEEPGGKEAI AVTDKAPPAG AADAPAVEPD QAPPPPPQFK FVVQVGSFPE KANAEEMQAS
LLKKGYATVL RRVKDRTRGM VYVLQLKPVE SLSKASTLAT QLETEVQGTP TVLRVRAD