Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1096 |
Symbol | |
ID | 6273015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 999404 |
End bp | 1000639 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725231 |
Product | side tail fiber protein |
Protein accession | YP_001879749 |
Protein GI | 187731452 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTAA AGATTTCTGG TGTACTGAAA GACGGCACAG GAAAACCGGT ACAGAACTGC ACAATCCAGC TGAAAGCAAA ACGTAACAGT ACCACGGTGG TGGTGAACAC GCTGGCCTCA GAAAATCCGG ATGAAGCCGG GCGTTACAGT ATGGACGTTG AGTATGGTCA GTACAGCGTT ATTCTGTTGG TGGAGGGATT CCCGCCGTCA CATGCCGGGA CTATCACCGT GTATGAAGAT TCTCAACCCG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGCCCGTCCG GAGGCACTGC GTCGCTTTGA ACTGATGGTG GAAGAGGTGG TGCGTAACGC AGAGGAGGCG AAGAAGAATG CCGGAGAGGC GGAGACGTCA GCGAGGAATG CCGGCATATC AGCCAGTCAG GCAGAAGAGA GCGCTGCAAA TGCTGACACT TCAGCAGGGG ATGCATCGGA GTCAGCCCGG CAGGCGGCAG AAAGTGCAGC CTCAGCAAAG CAGTCAGAGG AGGCGTCCTC GTCCTCGGCC TCTGCGGCCG CTCAAAAAGC CAGTGAGTCA TTACAAAGTG CAACAGATGC TGAGTTGTCA AAAAAGACGG CAGAAAGTGC AGCCGGTAAT GCAGTCAGGG ATGCAACGAC CGCAACAGAA AAAGCCCGGG AGTCAACAGA AAGCGCACAG TCAGCGGAAC AAAGCAGGAT AGCGGCGGAA GAGGCCGTAA ACCGAATCCC CACCGTGGTG GGACCTCCCG GGCCAAAGGG GGAACCGGGG CCCGCGGGTC CTCAGGGGCC GAAGGGAGAC ACAGGAGCCC CCGGGCAAGG AACAGAACTG CTTACTACTG CCAATACATG GACTCAGGCA CAAACTTTTA ATGGTGGTAT TAATGGCAAT TTGACGGTGA CCGGAAACGG CTCATTTAAC GATATTCAGA TCCGTTCGGA TAAACGCAAC AAGCGAAATC TGGTAAAACT GGATAATGCG TTAGATCGTC TGGAGGCACT TACTGGTTAT CTTTACGAGA TACAGTACTC TGCCGACGGT TGGCAAACGT CGGTTGGTTT AATTGCTCAG GATGCACAAA AAGCATTGCC TGAACTGGTA ACTGAAGACG CAGACGTTAT ATCTGGTGAA AAACGTCTGC GTCTTAACTA CAACGGCATA ATTGCATTGT TAGTCGAGGG CTTTAAAACA CTTCGTCATG AGATTAAAGA ACTCCGGGAG AAGTAA
|
Protein sequence | MTVKISGVLK DGTGKPVQNC TIQLKAKRNS TTVVVNTLAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMTEDDARP EALRRFELMV EEVVRNAEEA KKNAGEAETS ARNAGISASQ AEESAANADT SAGDASESAR QAAESAASAK QSEEASSSSA SAAAQKASES LQSATDAELS KKTAESAAGN AVRDATTATE KARESTESAQ SAEQSRIAAE EAVNRIPTVV GPPGPKGEPG PAGPQGPKGD TGAPGQGTEL LTTANTWTQA QTFNGGINGN LTVTGNGSFN DIQIRSDKRN KRNLVKLDNA LDRLEALTGY LYEIQYSADG WQTSVGLIAQ DAQKALPELV TEDADVISGE KRLRLNYNGI IALLVEGFKT LRHEIKELRE K
|
| |