Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3248 |
Symbol | gspC |
ID | 6145363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3320207 |
End bp | 3321166 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618078 |
Product | putative type II secretion protein GspC |
Protein accession | YP_001745228 |
Protein GI | 170681120 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3031] Type II secretory pathway, component PulC |
TIGRFAM ID | [TIGR01713] general secretion pathway protein C |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCGCGGG TTGTTTTTCG TGACGCACGA ATTTATCTCA TTCAATGGCT GACAAAAATT CGTCACACTC TTAACCAGAG ACAATCTCTT AATACAGACA AAGAGCATCT GCGCAAAATT GCACGCGGGA TGTTCTGGCT GATGCTGCTT ATTATTTCTG CAAAAGTGGC GCATTCACTC TGGCGCTATT TCTCTTTTTC TGCGGAATAT ATGGCTGTTT CCCCATCGGC GAATAAACCG CTCCGTGCGG ATGCAAAAGC GTTCGATAAA AATGACGTGC AATTAATCAG CCAGCAAAAC TGGTTTGGCA AATATCAGCC CGTCGCCACG CCGGTAAAAC AACCCGAACC TGCACCTGTG GCCGAAACGC GTCTTAATGT GGTGTTGCGT GGGATCGCCT TTGGTGCCAG ACCCGGCGCG GTTATTGAAG AAGGTGGTAA ACAGCAGGTC TATTTGCAGG GTGAAACGCT TGGCTCGCAC AACGCAGTGA TTGAGGAAAT CAACCGCGAC CATGTGATGC TGCGCTATCA GGGAAAAATG GAACGTCTGA GTCTGGCAGA AGAGAAGCGT CCCACCATAG CCGTGACCAG CAAAAAAGCC GTCAGCGACG AAGCAAAGCA AGCTGTTGCT GAGCCTGCTG CCAGTGCGCC AGTTGAGATC CCGGCTGCCG TGCGTCAGGC ACTGGCGAAA GATCCGCAGA AAATTTTTAA CTATATCCAG CTTACGCCTG TGCGTAAGGA AGGGATTGTC GGTTATGCAG TGAAACCGGG GGCAGATCGT TCTCTGTTCG ATGCCAGCGG TTTTAAGGAA GGCGATATCG CCATTGCGCT AAATCAGCAG GATTTCACTG ATCCACGAGC AATGATTGCT CTGATGCGGC AGTTACCTTC AATGGATTCC ATTCAACTTA CGGTTTTACG CAAGGGTGCG CGCTACGACA TTTCCATCGC GCTGCGCTAA
|
Protein sequence | MARVVFRDAR IYLIQWLTKI RHTLNQRQSL NTDKEHLRKI ARGMFWLMLL IISAKVAHSL WRYFSFSAEY MAVSPSANKP LRADAKAFDK NDVQLISQQN WFGKYQPVAT PVKQPEPAPV AETRLNVVLR GIAFGARPGA VIEEGGKQQV YLQGETLGSH NAVIEEINRD HVMLRYQGKM ERLSLAEEKR PTIAVTSKKA VSDEAKQAVA EPAASAPVEI PAAVRQALAK DPQKIFNYIQ LTPVRKEGIV GYAVKPGADR SLFDASGFKE GDIAIALNQQ DFTDPRAMIA LMRQLPSMDS IQLTVLRKGA RYDISIALR
|
| |