Gene EcSMS35_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3239 
SymbolgspL 
ID6147096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3311240 
End bp3312418 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID641618069 
ProductGspL-like protein 
Protein accessionYP_001745219 
Protein GI170683391 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3297] Type II secretory pathway, component PulL 
TIGRFAM ID[TIGR01709] general secretion pathway protein L 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.139825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.775494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCCA TGCTTGAGAT TTATTTCCCG CTTTGCGCCG CTGATCCCAT CCGTTGGCAG 
CGCCGTACAC CCGACGTGGA GCACGGCATC TGGCCTGACG TCGCCGACGA ATATCTCCAG
CAATGGCTGC AAACAGACAC AATTCGACTC TATATTCCCG GCGAATGGAT CAGCGTCTGG
CAGGTTGAAC TGCCTGATGT GCCTCGCAAG CAGATACCGA CGATTCTGCC CGCCTTGCTG
GAAGAAGAGC TGAACCAGGA TATCGATGAA CTGCATTTCG CCCCGTTGAA AATCGACCAG
CAACTGGCAA CCGTAGCTGT GATTCACCAG CAGCATATGC GCAACATTGC GCAGTGGTTG
CAGGCAAACG GCATCACCCG CGCTACCGTC GCGCCAGACT GGATGTCCAT TCCTTGTGGC
TATATGGCTG GCGATGCGCA ACGGGTTATC TGCCGCATCG ATGAATGCCG GGGATGGAGC
GCCGGGCGGG CGCTGGCTCC GGTCATGTTC CGCGCACAGC TCAATGAGCA GGATATACCG
CTTTCACTAA CCGTGGTCGG CATTGCACCG GAAGAACTGT CTGCATGGGC TGGTGCGGAC
GCCGAACGCC TGACCGTTAC GACTCTGCCA GCCATTACCA CTTATGGCGA ATCGGAAGGG
AACCTGCTAA CAGGGCCGTG GCAGCCTCGT GTCAGCTACC GAAAACAGTG GGCGCGCTGG
CGGGTGATGA TTCTGCCGAT ATTGCTGATT CTGGTTGCGC TGGTAGTGGA ACGGGGCGTG
ACGTTATGGA GCGTCAGCGA ACAGGTGGCG CAAAGCCGCA CCCAGGCGGA GAAACAGTTC
TTAACGCTAT TCCCAGAGCA GAAGCGGATT GTGAATTTAC GCTCTCAGGT GACGATGGCG
CTGAAAAAAT ATCGCCCACA GGCCGACGAT ACCCGGCTGC TCGCAGAATT GTCAGCGATC
GCCAGTACCC TGAAATCAGC GTCACTTACC GACATCGAAA TGCGTGGTTT CACCTTTGAT
CAAAAACGCC AGACGCTTCA CCTCCAACTG CGGGCTGCGA ACTTTGCCAG CTTCGACAAA
CTGCGTAGCG CACTGGCAAC CGATTATGTT GTGCAACAGG ACGCGTTACA GAAAGAGGGT
GATGCGGTTT CCGGCGGCGT AACGTTGCGG AGGAAATAA
 
Protein sequence
MSSMLEIYFP LCAADPIRWQ RRTPDVEHGI WPDVADEYLQ QWLQTDTIRL YIPGEWISVW 
QVELPDVPRK QIPTILPALL EEELNQDIDE LHFAPLKIDQ QLATVAVIHQ QHMRNIAQWL
QANGITRATV APDWMSIPCG YMAGDAQRVI CRIDECRGWS AGRALAPVMF RAQLNEQDIP
LSLTVVGIAP EELSAWAGAD AERLTVTTLP AITTYGESEG NLLTGPWQPR VSYRKQWARW
RVMILPILLI LVALVVERGV TLWSVSEQVA QSRTQAEKQF LTLFPEQKRI VNLRSQVTMA
LKKYRPQADD TRLLAELSAI ASTLKSASLT DIEMRGFTFD QKRQTLHLQL RAANFASFDK
LRSALATDYV VQQDALQKEG DAVSGGVTLR RK