Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2376 |
Symbol | |
ID | 6269790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2166416 |
End bp | 2167486 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641726379 |
Product | putative fimbrial protein |
Protein accession | YP_001880861 |
Protein GI | 187731105 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0000169866 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATAA TCTTTGGAGA AAAATGCGTG TTATTACTAC GACTATTTTT TGCCGCCGTC TTAATGCTAT GGTGCGCTCA AACTGCTGCT TATAGCGGGC AGTGTCATAC CACTCAGGGG AATCCGTATA TTGGCGTCAA TTTTGGCGTT AAAACCCTGG AGGAAGAAGA AAATACGGCT GGAGTAGTTA AAGACAAATT TTATCAGTGG AACGAATCGA ATGATTATTA TGTTTCCTGT GATTGCGATA AAGACAATGT CAGAAGTGGC CGATGGGCAT TCGCCGCGGA TTCACCGTTA GTCTATTTAG GCGACAACTG GTACAAAATT AATGACTATC TTGCCGCCAA AGTTTTATTG CAGGTTAAAG GCAGTTCTCC TACTGCGGTT CCTTTCGAAA ACGTGGGCAC AGGGGCAGAT ACACGATGGC ATATTTGCGA TCCCGGCGGT CAACGTTTAG GTGGCCAGGG GGCTAGCGGT AATAGCGGTA GCTTTTCCCT GAAAATATTG CAGCCGTTCG TTGGTTCGGT CGTCATTCCT CCTATGGCGC TGGCGCGATT ATTTGAATGC TACAACATAC CCGCAGGTGA TTCCTGCACG ACTACAGGTA CACCGGTTTT AGTGTATTAC CTGTCTGGTA CTATCAATTC ACTTGGCTCA TGTTCCGTCA ATGCCGGAGA AACAATCGAG GTCGATCTGG GCGACGTATT TGCGGCTAAC TTTCGTGTTG TAGGGCATAA GCCTCTTGGG GCCAGAACGG CAGAACTTGC AATTCCAGTC AGGTGTAACA CGGGAAACGC GGGGTTAGTT AACGTCAACC TGAGTCTGAC GGCAACTACA GACCCCAGCT ATCCCCAGGC GATTAAGACG TCACGTCCTG GCGTGGGCGT GGTGGTGACC GATAGCCAGA ACAACATTAT TTCCCCTGCT GGTGGAACAT TACCGCTCTC TATTCCTGAT GATGCAGACA GTATCGCGCG AATGAATGTC TATCCAGTCA GCACGACAGG TGTACCACCA GAAACCGGGC GATTTGAAGC CACGGCAACG GTGAGAATAA ATTTTGATTA A
|
Protein sequence | MQIIFGEKCV LLLRLFFAAV LMLWCAQTAA YSGQCHTTQG NPYIGVNFGV KTLEEEENTA GVVKDKFYQW NESNDYYVSC DCDKDNVRSG RWAFAADSPL VYLGDNWYKI NDYLAAKVLL QVKGSSPTAV PFENVGTGAD TRWHICDPGG QRLGGQGASG NSGSFSLKIL QPFVGSVVIP PMALARLFEC YNIPAGDSCT TTGTPVLVYY LSGTINSLGS CSVNAGETIE VDLGDVFAAN FRVVGHKPLG ARTAELAIPV RCNTGNAGLV NVNLSLTATT DPSYPQAIKT SRPGVGVVVT DSQNNIISPA GGTLPLSIPD DADSIARMNV YPVSTTGVPP ETGRFEATAT VRINFD
|
| |