Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3410 |
Symbol | gspF |
ID | 6272618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3170018 |
End bp | 3171241 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641727299 |
Product | general secretion pathway protein GspF |
Protein accession | YP_001881748 |
Protein GI | 187732930 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | [TIGR02120] general secretion pathway protein F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTGT TTTACTATCA GGCGCTGGAG CGTAATGGTC GCAAAACCAA AGGCATGATT GAGGCGGATT CCGCGCGTCA TGCCCGCCAG TTGTTGCGCG GTAAAGAGCT TATCCCCGTG CACATTGAAG CCAGGATGAA TGCATCGGCA GGGGGAATGT TGCAGCGTCG GCGGCACGCA CATCGTCGCG TGGCGGCGGC AGATCTGGCG CTGTTCACGC GCCAACTGGC AATGCTGGTG CAGGCAGCAA TGCCGTTGGA AACCTGCTTG CAGGCGGTCA GTGAGCAAAG TGAAAAACTG CATGTGAAGA GCCTCGGAAT GGCGCTGCGC AGCCGGATTC AGGAAGGTTA TACCCTGTCG GACAGCCTGC GCGAACATCC CCGCGTCTTT GATTCCCTGT TTTGTTCGAT GGTCGCTGCC GGAGAAAAAT CCGGGCATCT CGACGTGGTG CTCAATCGCC TGGCAGATTA CACCGAACAG CGACAGCGCC TGAAATCACG TCTATTGCAG GCCATGCTCT ATCCGCTGGT TATGCTGGTG GTGGCAACGG GCGTGGTCAC TATTTTGCTG ACGGCAGTGG TGCCGAAAAT CATCGAACAG TTTGATCATC TCGGACACGC GCTACCCTCC TCCACCCGTG CGCTTATCGC CATGAGTGAT GCGTTACAGG CCAGCGGCGT TTACTGGCTG GCGGGATTGC TGGCGCTTCT GGTGCTGGGG CAATGGCTAC TTAAAAATCC GACTATGCGC CTGCGCTGGG ATAAAACCTT GCTGCGTCTG CCCGTGACGG GCCGTGTTGC GCGCGGGCTG AATACGGCGC GTTTTTCCCG CACATTAAGC ATCCTCACCG CCAGCAGTGT TCCGCTACTG GAAGGCATTC AGACCGCTGC CGCCGTGTCG GCAAATCGCT ATGTCGAACA ACAACTGCTG CTGGCGGCAG ATCGCGTCCG CGAAGGAAGC AGTCTGCGCG CCGCGCTGGT GGAGTTGCGC CTGTTCCCGC CGATGATGCT GTACATGATC GCTTCCGGCG AACAGAGCGG CGAACTGGAA ACCATGCTTG AGCAGGCCGC TGTTAACCAG GAACGGGAAT TTGATACCCA GGTGGGGCTG GCGTTGGGGC TGTTTGAGCC TGCGCTGGTG GTGATGATGG CGGGCGTGGT GCTGTTTATT GTCATCGCCA TCCTCGAGCC GATGCTGCAA CTGAATAATA TGGTTGGAAT GTAA
|
Protein sequence | MALFYYQALE RNGRKTKGMI EADSARHARQ LLRGKELIPV HIEARMNASA GGMLQRRRHA HRRVAAADLA LFTRQLAMLV QAAMPLETCL QAVSEQSEKL HVKSLGMALR SRIQEGYTLS DSLREHPRVF DSLFCSMVAA GEKSGHLDVV LNRLADYTEQ RQRLKSRLLQ AMLYPLVMLV VATGVVTILL TAVVPKIIEQ FDHLGHALPS STRALIAMSD ALQASGVYWL AGLLALLVLG QWLLKNPTMR LRWDKTLLRL PVTGRVARGL NTARFSRTLS ILTASSVPLL EGIQTAAAVS ANRYVEQQLL LAADRVREGS SLRAALVELR LFPPMMLYMI ASGEQSGELE TMLEQAAVNQ EREFDTQVGL ALGLFEPALV VMMAGVVLFI VIAILEPMLQ LNNMVGM
|
| |