Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3136 |
Symbol | gspF2 |
ID | 5595255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3144205 |
End bp | 3145428 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640922255 |
Product | general secretion pathway protein F |
Protein accession | YP_001459754 |
Protein GI | 157162436 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | [TIGR02120] general secretion pathway protein F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 83 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTGT TTTACTATCA GGCGCTGGAG CGTAATGGTC GCAAAACCAA AGGTATGATT GAGGCGGATT CCGCGCGTCA TGCCCGCCAG TTGTTGCGCG GTAAAGAGCT TATCCCCGTG CATATTGAAG CCCGGATGAA TACTTCGTCA GGGGGGATGT TGCAGCGTCG GCGGCACGCA CATCGTCGCG TGGCGGCGGC AGATCTTGCG CTGTTCACGC GCCAACTGGC AACGCTGGTA CAGGCAGCAA TGCCGCTGGA AACCTGCTTA CAGGCGGTCA GTGAGCAAAG TGAAAAACTG CATGTAAAAA GCCTCGGAAT GGCGCTGCGC AGCCGGATTC AGGAAGGTTA CACCCTGTCG GACAGCCTGC GCGAACATCC CCGCGTCTTT GATTCTCTGT TTTGTTCGAT GGTTGCTGCC GGAGAAAAAT CCGGGCATCT CGACGTGGTG CTCAATCGCC TGGCAGATTA CACCGAACAG CGACAGCGCC TGAAATCACG TCTATTGCAG GCCATGCTCT ATCCGCTGGT TCTGCTGGTG GTGGCAACGG GCGTAGTCAC TATTTTGCTG ACGGCAGTGG TGCCGAAAAT CATCGAACAG TTCGATCACC TCGGACACGC GCTGCCCGCC ACCACCCGCG CGCTTATCGC CATGAGCGAC GCGTTACAGG CCAGCGGCGT TTACTGGCTG GCGGGATTGC TGGCGCTTCT GGTGCTGGGG CAACGGCTAC TTAAAAATCC TGCTATGCGC CTGCGCCGGG ATAAAACCTT GCTGCGTCTG CCCGTGACGG GCCGTGTTGC GCGCGGGCTG AATACGGCGC GTTTTTCCCG CACATTAAGC ATCCTCACCG CCAGCAGTGT TCCGCTACTG GAAGGCATTC AGACCGCTGC CGCCGTGTCG GCAAATCGCT ATGTCGAACA ACAACTACTG CTGGCGGCAG ATCGCGTCCG CGAAGGAAGC AGTCTGCGTG CCGCGCTGGC GGAGTTGCGC CTGTTCCCCC CGATGATGCT GTACATGATC GCCTCCGGCG AACAGAGCGG CGAACTGGAA ACCATGCTTG AGCAGGCCGC TGTTAACCAG GAACGGGAAT TTGATACCCA GGTGGGGCTG GCGTTAGGGC TGTTTGAACC GGCGCTGGTG GTGATGATGG CGGGCGTGGT GCTGTTTATC GTCATCGCCA TCCTCGAGCC GATGCTGCAA CTGAACAATA TGGTTGGAAT GTAA
|
Protein sequence | MALFYYQALE RNGRKTKGMI EADSARHARQ LLRGKELIPV HIEARMNTSS GGMLQRRRHA HRRVAAADLA LFTRQLATLV QAAMPLETCL QAVSEQSEKL HVKSLGMALR SRIQEGYTLS DSLREHPRVF DSLFCSMVAA GEKSGHLDVV LNRLADYTEQ RQRLKSRLLQ AMLYPLVLLV VATGVVTILL TAVVPKIIEQ FDHLGHALPA TTRALIAMSD ALQASGVYWL AGLLALLVLG QRLLKNPAMR LRRDKTLLRL PVTGRVARGL NTARFSRTLS ILTASSVPLL EGIQTAAAVS ANRYVEQQLL LAADRVREGS SLRAALAELR LFPPMMLYMI ASGEQSGELE TMLEQAAVNQ EREFDTQVGL ALGLFEPALV VMMAGVVLFI VIAILEPMLQ LNNMVGM
|
| |