Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3239 |
Symbol | gspL |
ID | 6147096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3311240 |
End bp | 3312418 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618069 |
Product | GspL-like protein |
Protein accession | YP_001745219 |
Protein GI | 170683391 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3297] Type II secretory pathway, component PulL |
TIGRFAM ID | [TIGR01709] general secretion pathway protein L |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.139825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.775494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTCCA TGCTTGAGAT TTATTTCCCG CTTTGCGCCG CTGATCCCAT CCGTTGGCAG CGCCGTACAC CCGACGTGGA GCACGGCATC TGGCCTGACG TCGCCGACGA ATATCTCCAG CAATGGCTGC AAACAGACAC AATTCGACTC TATATTCCCG GCGAATGGAT CAGCGTCTGG CAGGTTGAAC TGCCTGATGT GCCTCGCAAG CAGATACCGA CGATTCTGCC CGCCTTGCTG GAAGAAGAGC TGAACCAGGA TATCGATGAA CTGCATTTCG CCCCGTTGAA AATCGACCAG CAACTGGCAA CCGTAGCTGT GATTCACCAG CAGCATATGC GCAACATTGC GCAGTGGTTG CAGGCAAACG GCATCACCCG CGCTACCGTC GCGCCAGACT GGATGTCCAT TCCTTGTGGC TATATGGCTG GCGATGCGCA ACGGGTTATC TGCCGCATCG ATGAATGCCG GGGATGGAGC GCCGGGCGGG CGCTGGCTCC GGTCATGTTC CGCGCACAGC TCAATGAGCA GGATATACCG CTTTCACTAA CCGTGGTCGG CATTGCACCG GAAGAACTGT CTGCATGGGC TGGTGCGGAC GCCGAACGCC TGACCGTTAC GACTCTGCCA GCCATTACCA CTTATGGCGA ATCGGAAGGG AACCTGCTAA CAGGGCCGTG GCAGCCTCGT GTCAGCTACC GAAAACAGTG GGCGCGCTGG CGGGTGATGA TTCTGCCGAT ATTGCTGATT CTGGTTGCGC TGGTAGTGGA ACGGGGCGTG ACGTTATGGA GCGTCAGCGA ACAGGTGGCG CAAAGCCGCA CCCAGGCGGA GAAACAGTTC TTAACGCTAT TCCCAGAGCA GAAGCGGATT GTGAATTTAC GCTCTCAGGT GACGATGGCG CTGAAAAAAT ATCGCCCACA GGCCGACGAT ACCCGGCTGC TCGCAGAATT GTCAGCGATC GCCAGTACCC TGAAATCAGC GTCACTTACC GACATCGAAA TGCGTGGTTT CACCTTTGAT CAAAAACGCC AGACGCTTCA CCTCCAACTG CGGGCTGCGA ACTTTGCCAG CTTCGACAAA CTGCGTAGCG CACTGGCAAC CGATTATGTT GTGCAACAGG ACGCGTTACA GAAAGAGGGT GATGCGGTTT CCGGCGGCGT AACGTTGCGG AGGAAATAA
|
Protein sequence | MSSMLEIYFP LCAADPIRWQ RRTPDVEHGI WPDVADEYLQ QWLQTDTIRL YIPGEWISVW QVELPDVPRK QIPTILPALL EEELNQDIDE LHFAPLKIDQ QLATVAVIHQ QHMRNIAQWL QANGITRATV APDWMSIPCG YMAGDAQRVI CRIDECRGWS AGRALAPVMF RAQLNEQDIP LSLTVVGIAP EELSAWAGAD AERLTVTTLP AITTYGESEG NLLTGPWQPR VSYRKQWARW RVMILPILLI LVALVVERGV TLWSVSEQVA QSRTQAEKQF LTLFPEQKRI VNLRSQVTMA LKKYRPQADD TRLLAELSAI ASTLKSASLT DIEMRGFTFD QKRQTLHLQL RAANFASFDK LRSALATDYV VQQDALQKEG DAVSGGVTLR RK
|
| |