Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1289 |
Symbol | |
ID | 6270518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1178897 |
End bp | 1180402 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641725410 |
Product | head-tail preconnector protein GP5 |
Protein accession | YP_001879921 |
Protein GI | 187733565 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGACGTA ATCTTTCACA CATTATTGCC GCAGCATTCA ATGAACCGCT GCTTCTGGAG CCCGCCTATG CGCGGGTTTT CTTTTGCGCG CTCGGGCGCG AGATGGGCGC AGCAAGTCTT TCGGTACCAC AACAGCAGGT ACAGCTTGAT GCTCCCGGAA TGCTGGCTGA AACGGACGAG TACATGGCCG GAGGTAAACG ACCGGCCCGT GTTTACCGGG TGGTGAACGG TATTGCGGTA CTGCCGGTGA CCGGCACGCT GGTGCACCGG CTGGGGGGGA TGCGGCCATT TTCCGGAATG ACTGGCTATG ACGGCATTGT CGCCTGTCTT CAGCAGGCAA TGGCAGATAG CCAGGTGCGG GGCATACTGC TGGACATTGA CAGTCCGGGC GGGCAGGCCG CCGGCGCGTT TGACTGCGCT GACATGATTT ACCGCCTCCG GCAGCAGAAG CCGGTCTGGG CACTGTGCAA TGACACGGCC TGTTCTGCAG CCATGCTGCT GGCGTCGGCC TGCTCCCGAC GGCTGGTTAC CCAGACATCC CGTATCGGCT CCATTGGCGT GATGATGAGC CATGTCAGCT ATGCCGGTCA TCTGGCGCAG GCCGGTGTGG ATATCACGCT GATTTATGCC GGGGCGCACA AGGTGGATGG CAATCAGTTT GAAGCGTTGC CGGCAGAGGT TCGCCAGGAT ATGCAGCAGC GGATTGATGC GGCGCACCGG ATGTTTGCCG AAAAAGTGGC GATGTATACC GGGTTGTCTG TGGATGCGGT CACGGGAACA GAGGCCGCCG TTTTTGAAGG TCAGTCCGGC ATTGAGGCCG GGCTGGCGGA TGAATTAATC AATGCGTCGG ATGCCATCAG TGTGATGGCC ACGGCGCTGA ACAGTAATGT CAGAGGAGGC ACTATGCCGC AATTAACTGC AACGGAAGCC GCCGTGCAGG AGAACCAGCG AGTGATGGGG ATCCTGACAT GCCAGGAAGC GAAAGGACGT GAACAGCTTG CCACGATGCT GGCAGGGCAA CAGGGCATGA GCGTTGAACA GGCCCGGGCG ATTCTGGCCG CGGCGGCACC GCAGCAGCCG GTGGCATCCG CGCAGAGTGA AGCCGATCGC ATTATGGCGT GTGAAGAAGC GAACGGTCGT GAACAACTGG CAGCAACGCT GGCGGCGATG CCGGAGATGA CGGTGGAAAA AGCCCGCCCG ATCCTGGCGG CTGCACCACT GGCGAATGCC GGACCATCAC TCCGTGATCA GATCATGGCA CTGGATGAGG CAAAAGGGGC TGAGGCGCAG GCTGAACAGC TGGCTGCCTG CCCGGGAATG ACTGTGGAGA GCGCCCGGGC TGTGCTGGCT GCGGGATCAG GTAAGGCAGA ACCGGTCTCT GCATCCACAA CCGCCCTGTT TGAACATTTC ATGGCGAACC ATTCACCGGC TGCGGTCCAG GGGGGCGTGT CACAGGCATC AGAAGACGGT GATGCGGACG TGAAAATGCT CATGGCCATG CCATGA
|
Protein sequence | MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVACL QQAMADSQVR GILLDIDSPG GQAAGAFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS RIGSIGVMMS HVSYAGHLAQ AGVDITLIYA GAHKVDGNQF EALPAEVRQD MQQRIDAAHR MFAEKVAMYT GLSVDAVTGT EAAVFEGQSG IEAGLADELI NASDAISVMA TALNSNVRGG TMPQLTATEA AVQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP VASAQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAAAPLANA GPSLRDQIMA LDEAKGAEAQ AEQLAACPGM TVESARAVLA AGSGKAEPVS ASTTALFEHF MANHSPAAVQ GGVSQASEDG DADVKMLMAM P
|
| |