Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3245 |
Symbol | gspF |
ID | 6145182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3315401 |
End bp | 3316624 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641618075 |
Product | general secretion pathway protein GspF |
Protein accession | YP_001745225 |
Protein GI | 170681469 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | [TIGR02120] general secretion pathway protein F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.750263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGT TTTACTATCA GGCGCTGGAG CGTAATGGTC GCAAAACCAA AGGCATGATT GAGGCGGATT CCGCGCGTCA TGCCCGTCAA TTGTTACGCG GTAAAGACCT CATTCCCGTG CACATTGAAG CCCGGATGAA TGCATCGGCA GGGGGAATGT TGCAGCGTCG GCGGCACGCA CATCGTCGCG TGGCGGCGGC AGATCTGGCG CTGTTCACTC GTCAACTGGC AACGCTGGTG CAGGCAGCAA TGCCGCTGGA AACCTGCTTA CAGGCGGTCA GTGAGCAAAG CGAAAAACTG CATGTAAAAA GCCTCGGAAT GGCGCTGCGC AGCCGGATTC AGGAAGGTTA TACCCTGTCG GACAGCCTGC GCGAACATCC CCGCGTCTTT GACTCCCTGT TTTGTTCGAT GGTCGCCGCC GGAGAAAAAT CCGGGCATCT CGACGTGGTG CTCAATCGCC TGGCGGATTA CACCGAACAG CGGCAGCGTC TGAAATCACG CCTGCTGCAG GCCATGCTCT ATCCGCTGGT TCTGCTGGTG GTGGCAACGG GCGTAGTCAC TATTTTGCTG ACGGCAGTGG TGCCGAAAAT TATCGAACAG TTTGATCATC TCGGACACGC GCTACCCGCC TCCACCCGAA TGCTCATCGC TATGAGCGAC GCGTTACAGG CCAGCGGCGT GTACTGGCTG GCGGGTTTGC TGGGGCTTCT GGTGCTGGGG CAACGGTTAC TCAAAAATCC TGCGATGCGC CTGCGCTGGG ATAAAACCTT GCTGCGCCTG CCCGTGACGG GGCGTGTTGC GCGCGGACTG AATACGGCGC GTTTTTCCCG CACGTTAAGC ATCCTCACCG CCAGCAGTGT TCCGCTGCTG GAAGGCATTC AGACCGCCGC CGCCGTGTCG GCAAATCGTT ATGTCGAGCA ACAACTGCTG CTGGCGGCAG ATCGCGTCCG CGAAGGAAGC AGCCTGCGCG CCGCGCTGGC GGATCTGCGC CTGTTCCCGC CGATGATGCT GTACATGATC GCCTCCGGCG AACAGAGCGG CGAGCTGGAA ACCATGCTTG AACAGGCCGC GATCAACCAG GAACGGGAAT TTGATACCCA GGTGGGTCTG GCGTTAGGGC TGTTTGAGCC GGCGCTGGTG GTGGTGATGG CGGGCGTGGT GCTGTTTATC GTCATCGCCA TCCTCGAACC GATGCTGCAA CTGAACAATA TGGTTGGAAT GTAA
|
Protein sequence | MALFYYQALE RNGRKTKGMI EADSARHARQ LLRGKDLIPV HIEARMNASA GGMLQRRRHA HRRVAAADLA LFTRQLATLV QAAMPLETCL QAVSEQSEKL HVKSLGMALR SRIQEGYTLS DSLREHPRVF DSLFCSMVAA GEKSGHLDVV LNRLADYTEQ RQRLKSRLLQ AMLYPLVLLV VATGVVTILL TAVVPKIIEQ FDHLGHALPA STRMLIAMSD ALQASGVYWL AGLLGLLVLG QRLLKNPAMR LRWDKTLLRL PVTGRVARGL NTARFSRTLS ILTASSVPLL EGIQTAAAVS ANRYVEQQLL LAADRVREGS SLRAALADLR LFPPMMLYMI ASGEQSGELE TMLEQAAINQ EREFDTQVGL ALGLFEPALV VVMAGVVLFI VIAILEPMLQ LNNMVGM
|
| |