Gene EcHS_A3139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3139 
SymbolgspC1 
ID5593801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3149011 
End bp3149970 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content51% 
IMG OID640922258 
Productputative type II secretion protein GspC 
Protein accessionYP_001459757 
Protein GI157162439 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3031] Type II secretory pathway, component PulC 
TIGRFAM ID[TIGR01713] general secretion pathway protein C 


Plasmid Coverage information

Num covering plasmid clones75 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGCGGG TTGTTTTTCG TGACGCACGA ATTTATCTCA TTCAATGGCT GACAAAAATT 
CGTCACACTC TTAACCAGAG ACAATCTCTT AATACAGACA AAGAGCATCT GCGCAAAATT
GCACGCGGGA TGTTCTGGCT GATGCTGCTT ATTATTTCTG CAAAAATGGC GTATTCACTC
TGGCGCTATT TCTCCTTTTC TGCGGAATAT ACGGCTGTTT CCCCATCGGC GAATAAACCG
CCCCGTGCGG ATGCAAAAAC GTTCGATAAA AATGACGTGC AATTAATCAG CCAGCAAAAC
TGGTTTGGCA AATATCAGCC CGTCGCCACG CCGGTAAAAC AACCCGAACC TGTGCCTGTG
GCAGAAACGC GTCTTAATGT GGTGTTGCGT GGGATCGCCT TTGGTGCCAG ACCCGGCGCG
GTTATTGAAG AAGGTGGCAA ACAGCAGGTC TATTTGCAGG GGGAACGGCT TGGTTCTCAC
AACGCGGTGA TTGAGGAAAT CAACCGCGAC CATGTGATGC TGCGCTATCA GGGAAAAATA
GAGCGCCTGA GCCTGGCAGA AGAGGGGCAT TCCACCGTAG CCGTGACCAA CAAAAAAGCC
GTCAGTGACG AAGCAAAGCA AGCTGTTGCT GAGCCTGCTG CCAGTGCGCC AGTTGAGATC
CCGACTGCCG TGCGTCAGGC ACTGACGAAA GATCCGCAGA AAATTTTTAA CTATATCCAG
CTTACGCCTG TGCGTAAGGA GGGGATTGTC GGTTATGCAG TGAAGCCGGG GGCAGATCGT
TCTCTGTTCG ATGCCAGCGG TTTCAAGGAA GGCGATATCG CCATTGCGCT AAATCAGCAG
GATTTCACTG ATCCACGAGC AATGATTGCT CTGATGCGGC AGTTACCTTC AATGGATTCC
ATTCAACTTA CGGTTTTACG CAAGGGTGCG CGCCACGACA TTTCCATCGC GCTGCGCTAA
 
Protein sequence
MARVVFRDAR IYLIQWLTKI RHTLNQRQSL NTDKEHLRKI ARGMFWLMLL IISAKMAYSL 
WRYFSFSAEY TAVSPSANKP PRADAKTFDK NDVQLISQQN WFGKYQPVAT PVKQPEPVPV
AETRLNVVLR GIAFGARPGA VIEEGGKQQV YLQGERLGSH NAVIEEINRD HVMLRYQGKI
ERLSLAEEGH STVAVTNKKA VSDEAKQAVA EPAASAPVEI PTAVRQALTK DPQKIFNYIQ
LTPVRKEGIV GYAVKPGADR SLFDASGFKE GDIAIALNQQ DFTDPRAMIA LMRQLPSMDS
IQLTVLRKGA RHDISIALR