Gene Sfum_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3666 
Symbol 
ID4458023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4474344 
End bp4476044 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content59% 
IMG OID639704439 
Producttype II secretion system protein E 
Protein accessionYP_847771 
Protein GI116751084 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGT CTTCTTCATT GCTCAGTGCT TTCCGGCGGG AGTTCAACCT GCAGGAGGAG 
CAGGTCTGTC AGTTGCGCGA TGCCTATTTC ATGGAGGGAT TGCCTCTTGC GGACGCCTTC
GAGCAGATTT CGGTGACCGA TGAGAACCGC ATCCTCCGCG TTCTGTCCCA GCATTTCGGG
ATCCCGCTGC TGGAGCGTGA AGATTACCCG GAGAAACCAG TGCTCCTCGA GGGCGTTTCC
ATGGTTTTTC TGCGCAAGCA CGCCGTCCTC CCCATCCAGA TCGAAAGCGG CAAGGTGAAG
GTAGTGGTCA ACAATCCCTT GAACCTGCCG GTGCTGAACC TCCTGGGCGG TTACTTCGCC
GACATGAAAC TCAGTCTGTG CCTTGGGCAG CGGGAGGAGA TCCGAACGGC CATCGACCGG
CTGTACGGGA CCGCGGCAAG GGAGGCCGAA GAACAGTCCC GCTCCGGTGA AGGAGCGGTG
CTCAACGGCA GCGCTTTCGA GGAGGATCTC GAGCACCTGA GGGGATTGGC CCAGGAAGCG
CCTATCGTGA GGCTCGTCAA CGTCCTGATC TCGCGCGCGC TCGACATGCG GGCATCCGAC
ATTCATTTCG AGCCGTTCGA GCGCAGTTTC CAGGTCCGAT GCAGAGTCGA CGGTGTGCTC
TTCGACCTCG ATCAACCTCA GAAGAGCATG CAGGCGGCGA TCGTCAGCCG CCTCAAACTG
ATGGCCAACC TCAACATCGC TGAACGCAGG CTGCCCCAGG ACGGCAAGAT CAAGCTCAAG
TTCGGCAACC GCGAGGTCGA CATCCGCGTT TCGACCGTTC CGACCATATA CGGAGAGAGT
ATCGTCCTCC GGCTCCTGGC TCAGGAGGGG GTCGAATACA ATCTGGCCAA CCTGGGCATG
GACGGGGAGG ACCTCGCCTA CCTGGAACAG CTTGTCGAAC GGCCTTTCGG TATGATCCTC
GTCACCGGCC CGACCGGTTC CGGCAAGACG ACCACCCTTT ACGGGGTCCT CAAAAAACTG
AATTCCGTGG TGAGAAAAAT CATCACGGTC GAGGATCCCG TCGAATACCA GATCAACGGC
ATCAACCAGA TCCAGGTCAA GCCTCAGATA AACCTCACCT TCGCCAACGC GCTCAGGTCT
CTTGTCCGGC AAGACCCGGA CGTGCTTCTG ATCGGAGAAA TCCGGGACAA GGAAACGGCC
GACATCGCCA TCGAATCGGC TCTCACCGGT CACCTGGTGT TGTCGACCCT TCACACGAAC
GACGCGCCGG GCGCCATCAC ACGGCTCCGG GACATCGGGA TCGAGTCTTT TCTGATGGCC
GACTCCCTTC TGACGGTTCT CGCCCAGCGC CTGGTGCGTG TGCTCTGCCC TCATTGCAAG
GAACCTTACA CTGCGCGCGA AGCCGATTGG AACCGTCTCA ACGAGATCAT TCCCGACCTG
CCTCGCTCCC TTACGCTTTG CCAGAACAAG GGCTGCGATC GATGCGGCTA CACCGGATTC
AGGGGCAGAC AGGGAATCTT CGAGGTGCTC AAGGTGAATG AGAGCGTCCG ATCGGCGATC
GTCGCCGGGA AGGATGCCGG GAGCATTGCC CCGATCGCCT TCGAATCGGG CTACCGACCG
CTTTTGCACC ATGGCTATCG GAAGGTGATC GAAGGCATTA CGACGCTCTC GGAGGTGCTG
AGGGTGACAA GCCTGTCATG A
 
Protein sequence
MSESSSLLSA FRREFNLQEE QVCQLRDAYF MEGLPLADAF EQISVTDENR ILRVLSQHFG 
IPLLEREDYP EKPVLLEGVS MVFLRKHAVL PIQIESGKVK VVVNNPLNLP VLNLLGGYFA
DMKLSLCLGQ REEIRTAIDR LYGTAAREAE EQSRSGEGAV LNGSAFEEDL EHLRGLAQEA
PIVRLVNVLI SRALDMRASD IHFEPFERSF QVRCRVDGVL FDLDQPQKSM QAAIVSRLKL
MANLNIAERR LPQDGKIKLK FGNREVDIRV STVPTIYGES IVLRLLAQEG VEYNLANLGM
DGEDLAYLEQ LVERPFGMIL VTGPTGSGKT TTLYGVLKKL NSVVRKIITV EDPVEYQING
INQIQVKPQI NLTFANALRS LVRQDPDVLL IGEIRDKETA DIAIESALTG HLVLSTLHTN
DAPGAITRLR DIGIESFLMA DSLLTVLAQR LVRVLCPHCK EPYTAREADW NRLNEIIPDL
PRSLTLCQNK GCDRCGYTGF RGRQGIFEVL KVNESVRSAI VAGKDAGSIA PIAFESGYRP
LLHHGYRKVI EGITTLSEVL RVTSLS