Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0002 |
Symbol | gspC |
ID | 6966479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 81334 |
End bp | 82209 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643384018 |
Product | general secretion pathway protein C |
Protein accession | YP_002268497 |
Protein GI | 209395545 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3031] Type II secretory pathway, component PulC |
TIGRFAM ID | [TIGR01713] general secretion pathway protein C |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.856958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.237385 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTTCT TTCTATCATT CCGGGGTGAC CGGGGCTTAT TTATAAAAGA TATCGTACTT AAAATGCTGA CGCCAAACCG GCTATTGTGC GTAATTTTAC TTATTGCCGG ATATCAGCTG GTGTCGGTTA TCCATCATTT CTGGCTGACT CAGGCAGCAT CGGTGCCCGG CTTATCCCGT GTTAGCGCGC CGGAAACAGC GGTAACTGGT GATCAAACTG AAGAACGTTT TGTTTTTACG TTATTCGGCA GGGCATCCCC ACTATCATCG GAGGGGAGAG CGCAGGAAAC AATGCCTTCC CTGTCAGATG ATCTGCTTTC AGGGGAGGAT CTTGACGTGA GAGGTATACT TTATAGTTCG GTTGCAGAGC ATTCCGTTGC CATATTTGCA CATAATAACA GACAGTTCAG TCTGAGCGTC GGCGAAAAAG TACCCAGCTA CGATGCTACG ATTAGTGCTA TTTTTAGCGA TCATATTGTT ATTAACTATC AGGGAAAAAC TGTATCACTG CCTCTGCGAT ATGATAATAC CGAAAAAAAG AATGCATATG ACAATAATAA TTTAACAGTT GGAGACGTGA TAACCCAAGA TAATTTTCGG GTAGAAAGTG TTTTTGATAT TATGAGCTTT TCAGCCGTTA CGGTTAATAA TACATTAAGC GGTTATCGCT TGATTCCGGG TAAGCACAGT TCGTTATTTT ATAATGCTGG GTTGCATGAT AACGATCTGG CCGTATCGGT TAATGGTTCA GAATTGCGTG ATACCAGACA GGCGCAGCAG ATAATGAAGC AATTGCCAGA ACTTAAAGAA ATAAAAATAA CCGTCGAGCG TGATGGTCAG TTATATGATG CATTTATTGC TGTAGGAGAA AACTGA
|
Protein sequence | MLFFLSFRGD RGLFIKDIVL KMLTPNRLLC VILLIAGYQL VSVIHHFWLT QAASVPGLSR VSAPETAVTG DQTEERFVFT LFGRASPLSS EGRAQETMPS LSDDLLSGED LDVRGILYSS VAEHSVAIFA HNNRQFSLSV GEKVPSYDAT ISAIFSDHIV INYQGKTVSL PLRYDNTEKK NAYDNNNLTV GDVITQDNFR VESVFDIMSF SAVTVNNTLS GYRLIPGKHS SLFYNAGLHD NDLAVSVNGS ELRDTRQAQQ IMKQLPELKE IKITVERDGQ LYDAFIAVGE N
|
| |