Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0005 |
Symbol | gspF |
ID | 6966458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 85684 |
End bp | 86907 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384021 |
Product | general secretion pathway protein F |
Protein accession | YP_002268500 |
Protein GI | 209395613 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | [TIGR02120] general secretion pathway protein F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.138499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTTGT TTCATTATCA GGCATCAGAT ATTCACGGTA GAAAACGCAG TGGCATTCTG GAGGCGGATT CGGCCCGACA TGCCCGTCAG TTGCTGCGTG AGCAGGCGCT TATTCCAGTC AGGTTGGATG AGAAACAGGT CCATCACAAG CACTCACTGC GGAGTATCCT GAGGTTTCGT CCGCGCGGGG GGAGCAGCGC CGAACTCGCG TTACTGACAC GGCAGTTAGC CACACTAGTG GCCGCATCTC TGCCGCTTGA AGAGGCGCTG GATGCGTTGT TGAGGCAGAG TGAAAAACCG CGTCAGCGGA ACTTAATCGC TGCCGTGCGC ACCAAGGTTC TCGAGGGACA TTCCTTGGCC GCGGCGATGG GCATGTTTCC AGGTACGTTT GAGCGTCTTT ATTGTGCGAT GGTGGCGGCG GGGGAAACAT CCGGTCGTTT GGACGTGGTG CTCAGTCGTC TGGCTGACTA TACCGAGCAG CGCCAGATCA TGCGAAACCG TCTACTCCAG GCATTGCTCT ATCCTTGTGT TCTGACGTTG GTGGCGGTTG GGGTTATTGC CATTCTGCTT ACTGCGGTTG TGCCGAAAGT GGTTGAGCAG TTTATTCATA TGAAACAGAC CCTTCCCTTA TCTACCCGCG TACTGATGGG GGCTGCTGAG GTGAGCCAGA CTTGGGGCCC GTGGCTGTTA CTGGCCGCAG CGCTGGGCGG GATAGCGGGG CGCATGATAT TACATCAGCC CTCCCAGCGT CTGGCTTTTC ATCATCTGCT GTTGCGGCTG CCGGTCGTGG GGCGCATATC GCGCGGTCTG AATACCGCAC GTTATGCCCG CACGCTGAGC ATTCTTAATG CCAGTGCGGT ACCGCTGCTC CAGGCGATGC ACATCAGTGG TGACGTACTG AGTAATGACT GGGCCCGTCA TCAGTTAGCT ACCGCGGCCG AGTTGGTTAG GGAAGGGGTC AGCCTGCATC AGGCACTGGA GCAGACTTCG CTGTTTCCGC CTATGATGCG GCACATGATT GCCAGCGGTG AAAATAGTGG CGAACTCGAC AGCATGTTGG AACGGGCCGC CGACAATCAG GATCGTGAGT TCAGCACACA GATGCAACTG GCGCTGGGAT TGTTTGAACC GCTGTTGGTG GTGGGTATGG CCGGGGTCGT TTTGTTTATT GTTCTGGCAA TTCTGCAGCC GCTCCTGCAG CTCAACAACA TGATGAATAT GTGA
|
Protein sequence | MALFHYQASD IHGRKRSGIL EADSARHARQ LLREQALIPV RLDEKQVHHK HSLRSILRFR PRGGSSAELA LLTRQLATLV AASLPLEEAL DALLRQSEKP RQRNLIAAVR TKVLEGHSLA AAMGMFPGTF ERLYCAMVAA GETSGRLDVV LSRLADYTEQ RQIMRNRLLQ ALLYPCVLTL VAVGVIAILL TAVVPKVVEQ FIHMKQTLPL STRVLMGAAE VSQTWGPWLL LAAALGGIAG RMILHQPSQR LAFHHLLLRL PVVGRISRGL NTARYARTLS ILNASAVPLL QAMHISGDVL SNDWARHQLA TAAELVREGV SLHQALEQTS LFPPMMRHMI ASGENSGELD SMLERAADNQ DREFSTQMQL ALGLFEPLLV VGMAGVVLFI VLAILQPLLQ LNNMMNM
|
| |