Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4026 |
Symbol | |
ID | 6270270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3757241 |
End bp | 3759502 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641727866 |
Product | haemagglutinin family |
Protein accession | YP_001882298 |
Protein GI | 187731863 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGACGA ATACCACCAA TATCGCCAAT AACACTTCCA ATATTGCCAC TAACACCACC AACATCTCTA ATCTGACTGA GACGGTGACT AATCTTGGTG AGGATGCGCT GAAATGGGAT AAGGACAATG GTGTATTCAC GGCAGCTCAT GGCACCGAGA CCACCAGCAA AATCACCAAC GTTAAAGATG GCGACCTGAC GACTGGCAGC ACCGATGCCG TTAACGGCTC TCAGCTGAAA ACCACCAACG ATGCCGTGGC GACGAATACC ACCAATATCG CCACTAACAC CACCAACATC TCTAATCTGA CTGAGACGGT GACTAATCTT GGTGAGGATG CGCTGAAATG GGATAAGGAC AATGGTGTCT TCACTGCAGC TCATGGCAAC AATACCGCCA GCAAAATCAC CAATATCCTG GACGGCACAG TCACTGCAAC CAGTTCCGAT GCCATTAACG GTAGCCAGCT TTATGACTTA AGCAGCAATA TCGCCACCTA CTTCGGCGGC AATGCTTCTG TGAATACTGA CGGTGTGTTT ACCGGTCCAA CCTACAAAAT CGGTGAAACA AATTATTATA ACGTCGGCGA TGCACTGGCT GCGATTAACT CCTCATTTAG CACGTCTCTC GGCGATGCTC TGCTTTGGGA TGCCACCGCA GGTAAATTCA GTGCCAAACA CGGTACTAAT GGTGACGCAA GCGTGATCAC TGATGTCGCA GATGGTGAAA TTTCAGACTC CAGTTCTGAC GCAGTAAACG GCTCACAACT CCACGGCGTG AGCAGTTATG TTGTTGATGC GCTGGGGGGT GGTGCCGAAG TCAATGCAGA CGGCACCATC ACTGCGCCGA CGTACACCAT TGCTAATGCT GATTACGATA ATGTCGGTGA TGCCCTGAAT GCTATCGATA CCACTCTTGA CGACGCTCTG CTCTGGGATG CGGACGCCGG TGAAAATGGT GCATTTAGCG CCGCTCATGG AAAAGATAAA ACTGCCAGTG TAATCACTAA CGTCGCTAAC GGTGCAATCT CTGCTGCCAG CAGCGACGCG ATTAACGGCT CACAACTCTA TACCACCAAT AAGTACATCG CTGATGCGCT GGATGGTGAC GCAGAAGTCA ACGCTGACGG CACCATCACC GCACCGACTT ACACCATTGC GAACGCCGAG TACAACAACG TCGGTGACGC CCTGGATGCG CTTGATGATA ACGCCCTGCT GTGGGATGAG ACTGCCAATG GCGGTGCTGG AGCCTACAAT GCCAGCCATG ACGGTAAAGC CAGCATCATC ACTAATGTCG CTAATGGCAG TATTAGTGAG GACAGTACCG ATGCAGTGAA CGGTTCTCAG TTGAATGCGA CGAATATGAT GATTGAGCAG AACACCCAAA TTATCAATCA GCTCGCTGGT AACACCGACG CAACCTATAT CCAAGAAAAC GGTGCGGGTA TTAACTATGT GCGTACTAAC GACGACGGCT TAGCGTTCAA CGACGCCAGC GCACAGGGTG TTGGCGCTAC AGCTATAGGT TATAACTCTG TCGCCAAAGG CGATAGCAGC GTAGCTATTG GTCAGGGCAG CTACAGCGAC GTTGATACGG GTATCGCCCT AGGTAGCAGC TCTGTTTCCA GCCGAGTGAT TGCCAAAGGC TCCCGTGACA CCAGCATAAC GGAAAATGGC GTTGTTATTG GTTACGACAC CACGGATGGC GAACTGCTCG GTGCATTGTC TATCGGTGAT GACGGTAAAT ATCGTCAAAT CATCAACGTA GCCGATGGTT CCGAAGCCCA TGACGCCGTT ACGGTTCGTC AATTGCAGAA TGCGATTGGT GCGGTCGCAA CCACGCCGAC TAAATACTTC CACGCTAATT CAACGGAAGA AGATTCACTG GCAGTGGGAA CTGACTCGCT GGCAATGGGT GCGAAAACCA TCGTGAATGG CGATAAAGGT ATTGGTATCG GTTATGGTGC CTACGTGGAC GCGAATGCAC TTAACGGCAT TGCCATTGGT AGCAATGCGC AAGTCATTCA TGTCAACAGT ATTGCGATAG GTAATGGTTC TACGACCACT CGTGGCGCTC AAACCAATTA TACCGCCTAC AACATGGACG CACCGCAGAA CTCTGTCGGT GAATTCTCAG TCGGTAGTGC GGATGGTCAA CGTCAGATCA CAAACGTCGC AGCTGGTTCA GCGGATACCG ATGCGGTTAA CGTGGGTCAG TTGAAAGTCA CTGATGAGCG CGTAGCGCAA AATACCCAGT AG
|
Protein sequence | MATNTTNIAN NTSNIATNTT NISNLTETVT NLGEDALKWD KDNGVFTAAH GTETTSKITN VKDGDLTTGS TDAVNGSQLK TTNDAVATNT TNIATNTTNI SNLTETVTNL GEDALKWDKD NGVFTAAHGN NTASKITNIL DGTVTATSSD AINGSQLYDL SSNIATYFGG NASVNTDGVF TGPTYKIGET NYYNVGDALA AINSSFSTSL GDALLWDATA GKFSAKHGTN GDASVITDVA DGEISDSSSD AVNGSQLHGV SSYVVDALGG GAEVNADGTI TAPTYTIANA DYDNVGDALN AIDTTLDDAL LWDADAGENG AFSAAHGKDK TASVITNVAN GAISAASSDA INGSQLYTTN KYIADALDGD AEVNADGTIT APTYTIANAE YNNVGDALDA LDDNALLWDE TANGGAGAYN ASHDGKASII TNVANGSISE DSTDAVNGSQ LNATNMMIEQ NTQIINQLAG NTDATYIQEN GAGINYVRTN DDGLAFNDAS AQGVGATAIG YNSVAKGDSS VAIGQGSYSD VDTGIALGSS SVSSRVIAKG SRDTSITENG VVIGYDTTDG ELLGALSIGD DGKYRQIINV ADGSEAHDAV TVRQLQNAIG AVATTPTKYF HANSTEEDSL AVGTDSLAMG AKTIVNGDKG IGIGYGAYVD ANALNGIAIG SNAQVIHVNS IAIGNGSTTT RGAQTNYTAY NMDAPQNSVG EFSVGSADGQ RQITNVAAGS ADTDAVNVGQ LKVTDERVAQ NTQ
|
| |