Gene ECH74115_B0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0005 
SymbolgspF 
ID6966458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp85684 
End bp86907 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID643384021 
Productgeneral secretion pathway protein F 
Protein accessionYP_002268500 
Protein GI209395613 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.138499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGT TTCATTATCA GGCATCAGAT ATTCACGGTA GAAAACGCAG TGGCATTCTG 
GAGGCGGATT CGGCCCGACA TGCCCGTCAG TTGCTGCGTG AGCAGGCGCT TATTCCAGTC
AGGTTGGATG AGAAACAGGT CCATCACAAG CACTCACTGC GGAGTATCCT GAGGTTTCGT
CCGCGCGGGG GGAGCAGCGC CGAACTCGCG TTACTGACAC GGCAGTTAGC CACACTAGTG
GCCGCATCTC TGCCGCTTGA AGAGGCGCTG GATGCGTTGT TGAGGCAGAG TGAAAAACCG
CGTCAGCGGA ACTTAATCGC TGCCGTGCGC ACCAAGGTTC TCGAGGGACA TTCCTTGGCC
GCGGCGATGG GCATGTTTCC AGGTACGTTT GAGCGTCTTT ATTGTGCGAT GGTGGCGGCG
GGGGAAACAT CCGGTCGTTT GGACGTGGTG CTCAGTCGTC TGGCTGACTA TACCGAGCAG
CGCCAGATCA TGCGAAACCG TCTACTCCAG GCATTGCTCT ATCCTTGTGT TCTGACGTTG
GTGGCGGTTG GGGTTATTGC CATTCTGCTT ACTGCGGTTG TGCCGAAAGT GGTTGAGCAG
TTTATTCATA TGAAACAGAC CCTTCCCTTA TCTACCCGCG TACTGATGGG GGCTGCTGAG
GTGAGCCAGA CTTGGGGCCC GTGGCTGTTA CTGGCCGCAG CGCTGGGCGG GATAGCGGGG
CGCATGATAT TACATCAGCC CTCCCAGCGT CTGGCTTTTC ATCATCTGCT GTTGCGGCTG
CCGGTCGTGG GGCGCATATC GCGCGGTCTG AATACCGCAC GTTATGCCCG CACGCTGAGC
ATTCTTAATG CCAGTGCGGT ACCGCTGCTC CAGGCGATGC ACATCAGTGG TGACGTACTG
AGTAATGACT GGGCCCGTCA TCAGTTAGCT ACCGCGGCCG AGTTGGTTAG GGAAGGGGTC
AGCCTGCATC AGGCACTGGA GCAGACTTCG CTGTTTCCGC CTATGATGCG GCACATGATT
GCCAGCGGTG AAAATAGTGG CGAACTCGAC AGCATGTTGG AACGGGCCGC CGACAATCAG
GATCGTGAGT TCAGCACACA GATGCAACTG GCGCTGGGAT TGTTTGAACC GCTGTTGGTG
GTGGGTATGG CCGGGGTCGT TTTGTTTATT GTTCTGGCAA TTCTGCAGCC GCTCCTGCAG
CTCAACAACA TGATGAATAT GTGA
 
Protein sequence
MALFHYQASD IHGRKRSGIL EADSARHARQ LLREQALIPV RLDEKQVHHK HSLRSILRFR 
PRGGSSAELA LLTRQLATLV AASLPLEEAL DALLRQSEKP RQRNLIAAVR TKVLEGHSLA
AAMGMFPGTF ERLYCAMVAA GETSGRLDVV LSRLADYTEQ RQIMRNRLLQ ALLYPCVLTL
VAVGVIAILL TAVVPKVVEQ FIHMKQTLPL STRVLMGAAE VSQTWGPWLL LAAALGGIAG
RMILHQPSQR LAFHHLLLRL PVVGRISRGL NTARYARTLS ILNASAVPLL QAMHISGDVL
SNDWARHQLA TAAELVREGV SLHQALEQTS LFPPMMRHMI ASGENSGELD SMLERAADNQ
DREFSTQMQL ALGLFEPLLV VGMAGVVLFI VLAILQPLLQ LNNMMNM