Gene EcSMS35_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0441 
SymbolsecF 
ID6144917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp453427 
End bp454398 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content52% 
IMG OID641615337 
Productpreprotein translocase subunit SecF 
Protein accessionYP_001742544 
Protein GI170684249 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0341] Preprotein translocase subunit SecF 
TIGRFAM ID[TIGR00916] protein-export membrane protein, SecD/SecF family
[TIGR00966] protein-export membrane protein SecF 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACAGG AATATACTGT TGAACAACTA AACCACGGCC GTAAAGTCTA TGACTTTATG 
CGCTGGGACT ACTGGGCTTT CGGCATCTCT GGTCTGCTGT TAATCGCCGC TATCGTTATT
ATGGGCGTGC GCGGCTTTAA CTGGGGGCTG GATTTCACCG GTGGTACGGT TATTGAAATT
ACGCTCGAAA AACCGGCTGA AATTGACGTA ATGCGTGATG CATTGCAAAA AGCCGGTTTT
GAAGAGCCGA TGCTGCAAAA CTTTGGTAGC AGCCACGACA TCATGGTCCG TATGCCGCCT
GCTGAAGGCG AAACCGGCGG TCAGGTGCTG GGCAGCCAGG TTCTGAAGGT GATTAACGAA
TCCACCAATC AGAATGCGGC AGTGAAGCGT ATTGAGTTCG TCGGGCCGAG CGTGGGGGCA
GACCTTGCGC AAACCGGTGC GATGGCGTTG ATGGCGGCGC TGCTGTCTAT CCTCGTGTAC
GTAGGTTTCC GCTTTGAGTG GCGACTGGCG GCAGGGGTTG TTATTGCGCT GGCGCACGAC
GTTATCATTA CGCTGGGTAT TTTGTCGTTA TTCCATATCG AGATTGACCT GACCATTGTG
GCATCGTTGA TGTCAGTTAT CGGTTACTCG CTTAACGACA GTATCGTGGT ATCGGACCGT
ATTCGTGAAA ACTTCCGCAA GATCCGTCGC GGTACGCCTT ACGAAATCTT TAACGTGTCC
TTGACCCAGA CGCTGCACCG TACCTTGATC ACATCCGGTA CTACCTTGAT GGTTATCCTG
ATGCTGTACC TCTTCGGTGG TCCGGTACTG GAAGGCTTCT CGCTGACCAT GCTTATCGGT
GTTTCCATCG GTACTGCATC TTCCATCTAC GTAGCATCGG CGCTGGCTCT GAAACTGGGT
ATGAAGCGCG AACACATGCT GCAGCAGAAA GTGGAAAAAG AAGGGGCGGA TCAGCCGTCA
ATTCTGCCTT AA
 
Protein sequence
MAQEYTVEQL NHGRKVYDFM RWDYWAFGIS GLLLIAAIVI MGVRGFNWGL DFTGGTVIEI 
TLEKPAEIDV MRDALQKAGF EEPMLQNFGS SHDIMVRMPP AEGETGGQVL GSQVLKVINE
STNQNAAVKR IEFVGPSVGA DLAQTGAMAL MAALLSILVY VGFRFEWRLA AGVVIALAHD
VIITLGILSL FHIEIDLTIV ASLMSVIGYS LNDSIVVSDR IRENFRKIRR GTPYEIFNVS
LTQTLHRTLI TSGTTLMVIL MLYLFGGPVL EGFSLTMLIG VSIGTASSIY VASALALKLG
MKREHMLQQK VEKEGADQPS ILP