Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1457 |
Symbol | |
ID | 6271732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1330927 |
End bp | 1332432 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641725558 |
Product | head-tail preconnector protein GP5 |
Protein accession | YP_001880064 |
Protein GI | 187732704 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGACGTA ATCTTTCACA CATTATTGCA GCAGCATTCA ATGAACCGCT GCTTCTGGAG CCCGCCTATG CGCGGGTTTT CTTTTGCGCG CTGGGGCGCG AGATGGGGGC AGCAAGTCTT TCGGTACCGC AACAGCAGGT ACAGCTTGAT GCTCCCGGGA TGCTGGCTGA AACGGACGAG TACATGGCCG GAGGTAAACG ACCGGCCCGT GTTTACCGGG TGGTGAACGG TATTGCGGTA CTGCCGGTGA CCGGCACGCT GGTGCACCGG CTGGGGGGGA TGCGGCCATT TTCCGGAATG ACTGGCTATG ACGGCATTGT CGCCTGTCTT CAGCAGGCAA TGGCAGATAG CCAGGTGCGG GGCATACTGC TGGACATTGA CAGTCCGGGC GGGCAGGCCG CCGGCGCGTT TGACTGCGCT GACATGATTT ACCGCCTCCG GCAGCAGAAG CCGGTCTGGG CACTGTGTAA TGACACGGCC TGTTCTGCGG CCATGCTGCT GGCGTCGGCC TGCTCCCGAC GGCTGGTTAC CCAGACATCC CGTATCGGTT CCATTGGCGT GATGATGAGC CATGTCAGCT ATGCCGGTCA TCTGGCGCAG GCCGGTGTGG ATATCACGCT GATTTACTCA GGGGCGCACA AGGTGGATGG CAATCAGTTT GAAGCGTTGC CGGCAGAGGT TCGCCAGGAC ATGCAGCAGC GGATTGATGC GGCGCGCCGG ATGTTTGCCG AAAAAGTGGC GATGTTTACC GGTCTGTCTG TTGATGCAGT CACGGGAACA GAGGCCGCTG TTTTTGAAGG TCAGTCCGGC ATTGAGGCCG GGCTGGCGGA TGAATTAATC AATGCGTCGG ATGCCATCAG TGTGATGGCC ACGGCGCTGA ACAGTAATGT CAGAGGAGGC ACTATGCCGC AATTAACTGC AACGGAAGCC GCCGTGCAGG AGAACCAGCG AGTGATGGGG ATCCTGACAT GCCAGGAAGC GAAAGGACGT GAACAGCTTG CCACGATGCT GGCAGGGCAA CAGGGCATGA GCGTTGAACA GGCCCGGGCG ATTCTGGCCG CGGCGGCACC GCAGCAGCCG GTGGCATCCG CGCAGAGTGA AGCCGATCGC ATTATGGCGT GTGAAGAAGC GAACGGTCGT GAACAACTGG CGGCAACGCT GGCGGCGATG CCGGAGATGA CGGTGGAAAA AGCCCGCCCG ATCCTGGCGG CTGCACCACT GGCGGATGCC GGGCCCTCGC TTCGTGATCA GATCATGGCC CTGGATGAGG CAAAAGGGGC AGAAGCGCAG GCTGAAAAAC TGGCGGCCTG CCCGGGAATG ACCGTGGAGA ACGCCCGGGC TGTGCTGGCT GCGGGATCAG GTAAGGCCGA ACCGGTCTCT GCATCCACAA CCGCCCTGTT TGAACATTTC ATGGCGAATC ATTCACCGGC AGCGGTGCGG GGTGGCGTGT CACAGACGTC AGCAGACGGT GATGCGGACG TGAAAATGCT CATGGCCATG CCATGA
|
Protein sequence | MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVACL QQAMADSQVR GILLDIDSPG GQAAGAFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS RIGSIGVMMS HVSYAGHLAQ AGVDITLIYS GAHKVDGNQF EALPAEVRQD MQQRIDAARR MFAEKVAMFT GLSVDAVTGT EAAVFEGQSG IEAGLADELI NASDAISVMA TALNSNVRGG TMPQLTATEA AVQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP VASAQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAAAPLADA GPSLRDQIMA LDEAKGAEAQ AEKLAACPGM TVENARAVLA AGSGKAEPVS ASTTALFEHF MANHSPAAVR GGVSQTSADG DADVKMLMAM P
|
| |