Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3404 |
Symbol | gspL |
ID | 6269867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3165860 |
End bp | 3167038 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641727294 |
Product | GspL-like protein |
Protein accession | YP_001881743 |
Protein GI | 187733759 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3297] Type II secretory pathway, component PulL |
TIGRFAM ID | [TIGR01709] general secretion pathway protein L |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTTCCA TCCTTGAGAT TTTTTTCCCG CTTTGCGCCG CTGATCCCAT CCATTGGCAG CGCCGTACAC CCGATGTGGA GCACGGTATC TGGTCTGACG TTGCTAACGA ACAGCTCCAG CAATGGCTGC AAACCGATGC GATTCGACTC TACATTCCCG GCGAATGGAT CAGCGTCTGG CAGGTTGAAC TGCCTGATGT CGCCCGTAAG CAGATACCGA CCATTCTGCC CGCCTTACTG GAAGAAGAGC TGAACCAGGA TATCGATGAA CTGCATTTCG CCCCGTTGAA TATCGACCAG CAACTGGCAA CCGTAGCAGT GATTCACCAA CAGCATATGC GCAACATTGC GCAGTGGTTG CAGGAAAACG GCATCACCCG CGCTACCGTC GCGCCAGACT GGATGTCCAT TCCTTGTGGG GTTATGGCTG GCGATGCGCA ACGGGTTATC TGCCGCATTG ATGAATGCCG GGGATGGAGC GCCGGGCGGG CGCTGGCTCC GGTCATGTTC CGCGCCCAGC TCAATGAGCA GGATTTACCG CTTTCGCTAA CCGTAGTCGG CATTGCACCG GAAAAGCTAT CAGCATGGGC TGGCGCAGAC GCTGAACGCC TGACCGTTAC AGCTCTGCCC GCCGTTACCA CTTATGGCGA ACCGGAAGGG AACCTGCTGA CAGGGCCGTG GCAGCCTCGC GTCAGCTACC GAAAACAGTG GGCGCGCTGG CGGGTGATGA TTCTGCCGAT ATTGCTGATT CTAGTTGCGC TGGCAGTGGA GCGGGGCGTG ACGTTATGGA GCGTCAGCGA ACAGGTGGCG CAAAGCCGCA CCCAGGCGGA GGAACAGTTC TTAACGTTGT TCCCGGAGCA GAAGCGGATT GTGAATTTAC GCTCTCAGGT GACGATGGCG CTGAAAAAAT ATCGCCCACA GGCCGACGAT ACCCGGCTGC TCGCCGAGTT GTCAGCGATA GCCAGCACCC TGAAATCAGC GTCACTTTCC GACATCGAAA TGCGTGGTTT TACCTTTGAT CAAAAACGCC AGATACTTCA CCTCCAGCTA CGGGCCGCGA ACTTTGCCAG TTTCGACAAA CTGCGTAGTG CACTGGCAAC CGATTATGTT GTGCAACAGG ACGCGTTACA GAAAGAGGGT GATGCGGTTT CCGGCGGCGT AACGTTGCGG AGGAAATAA
|
Protein sequence | MSSILEIFFP LCAADPIHWQ RRTPDVEHGI WSDVANEQLQ QWLQTDAIRL YIPGEWISVW QVELPDVARK QIPTILPALL EEELNQDIDE LHFAPLNIDQ QLATVAVIHQ QHMRNIAQWL QENGITRATV APDWMSIPCG VMAGDAQRVI CRIDECRGWS AGRALAPVMF RAQLNEQDLP LSLTVVGIAP EKLSAWAGAD AERLTVTALP AVTTYGEPEG NLLTGPWQPR VSYRKQWARW RVMILPILLI LVALAVERGV TLWSVSEQVA QSRTQAEEQF LTLFPEQKRI VNLRSQVTMA LKKYRPQADD TRLLAELSAI ASTLKSASLS DIEMRGFTFD QKRQILHLQL RAANFASFDK LRSALATDYV VQQDALQKEG DAVSGGVTLR RK
|
| |