Gene EcSMS35_3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3248 
SymbolgspC 
ID6145363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3320207 
End bp3321166 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content51% 
IMG OID641618078 
Productputative type II secretion protein GspC 
Protein accessionYP_001745228 
Protein GI170681120 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3031] Type II secretory pathway, component PulC 
TIGRFAM ID[TIGR01713] general secretion pathway protein C 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCGGG TTGTTTTTCG TGACGCACGA ATTTATCTCA TTCAATGGCT GACAAAAATT 
CGTCACACTC TTAACCAGAG ACAATCTCTT AATACAGACA AAGAGCATCT GCGCAAAATT
GCACGCGGGA TGTTCTGGCT GATGCTGCTT ATTATTTCTG CAAAAGTGGC GCATTCACTC
TGGCGCTATT TCTCTTTTTC TGCGGAATAT ATGGCTGTTT CCCCATCGGC GAATAAACCG
CTCCGTGCGG ATGCAAAAGC GTTCGATAAA AATGACGTGC AATTAATCAG CCAGCAAAAC
TGGTTTGGCA AATATCAGCC CGTCGCCACG CCGGTAAAAC AACCCGAACC TGCACCTGTG
GCCGAAACGC GTCTTAATGT GGTGTTGCGT GGGATCGCCT TTGGTGCCAG ACCCGGCGCG
GTTATTGAAG AAGGTGGTAA ACAGCAGGTC TATTTGCAGG GTGAAACGCT TGGCTCGCAC
AACGCAGTGA TTGAGGAAAT CAACCGCGAC CATGTGATGC TGCGCTATCA GGGAAAAATG
GAACGTCTGA GTCTGGCAGA AGAGAAGCGT CCCACCATAG CCGTGACCAG CAAAAAAGCC
GTCAGCGACG AAGCAAAGCA AGCTGTTGCT GAGCCTGCTG CCAGTGCGCC AGTTGAGATC
CCGGCTGCCG TGCGTCAGGC ACTGGCGAAA GATCCGCAGA AAATTTTTAA CTATATCCAG
CTTACGCCTG TGCGTAAGGA AGGGATTGTC GGTTATGCAG TGAAACCGGG GGCAGATCGT
TCTCTGTTCG ATGCCAGCGG TTTTAAGGAA GGCGATATCG CCATTGCGCT AAATCAGCAG
GATTTCACTG ATCCACGAGC AATGATTGCT CTGATGCGGC AGTTACCTTC AATGGATTCC
ATTCAACTTA CGGTTTTACG CAAGGGTGCG CGCTACGACA TTTCCATCGC GCTGCGCTAA
 
Protein sequence
MARVVFRDAR IYLIQWLTKI RHTLNQRQSL NTDKEHLRKI ARGMFWLMLL IISAKVAHSL 
WRYFSFSAEY MAVSPSANKP LRADAKAFDK NDVQLISQQN WFGKYQPVAT PVKQPEPAPV
AETRLNVVLR GIAFGARPGA VIEEGGKQQV YLQGETLGSH NAVIEEINRD HVMLRYQGKM
ERLSLAEEKR PTIAVTSKKA VSDEAKQAVA EPAASAPVEI PAAVRQALAK DPQKIFNYIQ
LTPVRKEGIV GYAVKPGADR SLFDASGFKE GDIAIALNQQ DFTDPRAMIA LMRQLPSMDS
IQLTVLRKGA RYDISIALR