Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03129 |
Symbol | gspF |
ID | 8115701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3319261 |
End bp | 3320457 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644849312 |
Product | hypothetical protein |
Protein accession | YP_003000885 |
Protein GI | 251786581 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | [TIGR02120] general secretion pathway protein F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATC GCTATCGCGC CATGACCCAG GATGGTCAAA AATTGCAAGG GATCATTGAT GCTAACGATG AACGTCAGGC ACGACTGCGG CTGCGTGAAG AAGGGCTTTT CCTGCTGGAT ATTCGCCCCC AAAAAAGTTC GGGAGTAAAA ACACGTCGCC CGAGGATCAG CCATAGTGAA CTGACGCTTT TCACCCGGCA GTTGGCAACC TTAAGCGCAG CGGCATTACC CCTGGAAGAG AGCCTTGCCG TAATCGGTCA ACAAAGCAGT AATAAACGAC TGGGTGACGT GTTAAATCAG GTACGCAGCG CCATCCTTGA AGGGCATCCC CTTTCCGATG CATTACAGCA TTTTCCCACG CTTTTCGATT CGCTCTATCG TACCCTGGTA AAAGCGGGCG AAAAGAGCGG GCTGCTGGCC CCGGTGTTGG AAAAGCTGGC TGATTACAAT GAAAACCGGC AGAAAATCCG CAGCAAGCTC ATTCAGTCAC TGATCTACCC CTGTATGCTC ACTACGGTGG CGATTGGGGT CGTGATTATT CTCCTCACTG CTGTCGTGCC CAAAATTACC GAACAGTTCG TGCATATGAA GCAGCAACTG CCGCTGAGTA CACGCATTCT TTTAGGTCTG AGCGACACGT TGCAACGTAC CGGCCCGACA TTATTAGCGA CAGTGTTTAT TGTCGCTGTA GGTTTCTGGC TCTGGTTAAA ACGCGGCAAT AACCGCCACC GTTTTCATGC CATGTTGCTG CGCGTTGCGC TCATCGGCCC GCTGATTTGC GCCATTAACA GCGCACGCTA TCTCCGCACT TTAAGTATTT TGCAATCCAG CGGCGTCCCT CTGCTGGATG GGATGAATTT GTCCACCGAA AGCCTCAACA ACCTCGAAAT TCGCCAGCGT CTGGCAAATG CGGCAGAGAA CGTTCGCCAG GGTAACAGCA TTCATCTTTC GCTGGAACAA ACCGCAATTT TCCCGCCGAT GATGCTCTAC ATGGTGGCCT CTGGCGAAAA AAGCGGGCAG CTCGGCACAT TAATGGTCAG AGCCGCAGAT AACCAGGAGA CACTCCAACA AAATCGGATC GCCTTAACGC TCTCCATCTT CGAGCCAGCA CTCATTATTA CGATGGCACT GATCGTCCTG TTTATTGTCG TGTCGGTACT CCAACCTCTT CTTCAACTTA ACTCAATGAT TAATTAA
|
Protein sequence | MNYRYRAMTQ DGQKLQGIID ANDERQARLR LREEGLFLLD IRPQKSSGVK TRRPRISHSE LTLFTRQLAT LSAAALPLEE SLAVIGQQSS NKRLGDVLNQ VRSAILEGHP LSDALQHFPT LFDSLYRTLV KAGEKSGLLA PVLEKLADYN ENRQKIRSKL IQSLIYPCML TTVAIGVVII LLTAVVPKIT EQFVHMKQQL PLSTRILLGL SDTLQRTGPT LLATVFIVAV GFWLWLKRGN NRHRFHAMLL RVALIGPLIC AINSARYLRT LSILQSSGVP LLDGMNLSTE SLNNLEIRQR LANAAENVRQ GNSIHLSLEQ TAIFPPMMLY MVASGEKSGQ LGTLMVRAAD NQETLQQNRI ALTLSIFEPA LIITMALIVL FIVVSVLQPL LQLNSMIN
|
| |