Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1053 |
Symbol | |
ID | 6270744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 964377 |
End bp | 966083 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641725193 |
Product | flagellin |
Protein accession | YP_001879712 |
Protein GI | 187730503 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC GCGAAGGATG ACGCCGCAGG TCAGGCGATT GCTAACCGTT TTACTTCTAA TATTAAAGGC CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATCTCCG TTGCGCAGAC CACTGAAGGC GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCTTCT ACCGGGACTA ACTCCGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTG GACGAAATTG ACCGCGTATC TGGCCAGACC CAGTTCAACG GCGTGAACGC ACTGGCGAAA GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCC AGACTATCAC GATTGATCTG AAGAAAATTG ACTCAGATAC GCTGGGGCTG AATGGTTTTA ACGTGAATGG CAAAGGCACT ATTGCGAACA AAGCTGCTAC AGTCAGCGAT CTGACCGCTG CTGGTGCAAC GGGAACAGGT CCTTATGCTG TGACCACAAA CAATACAGTA CTCAGCGCTA GCGATGCACT GTCTCGCCTG AAAACCGGAG ATACAGTTAC TACTACTGGC TCGAGTGCTG CGATCTATAC TTATGATGCG GCTAAAGGGA ACTTCACCAC TCAAGCAACA GTTGCAGATG GCGATGTTGT TAACTTTGCG AATACTCTGA AACCAGCGGC TGGCACTACT GCATCAGGTG TTTATACTCG TAGTACTGGT GATGTGAAGT TTGATGTAGA TGCTAATGGC GATGTGACCA TCGGTGGTAA AGCCGCGTAC CTAGACGCTA CTGGTAACCT ATCTACAAAC AACGCCGGCA TTGCATCTTC AGCGAAATTG TCCGATCTGT TTGCTAGCGG TAGTACCTTA GCGACAACTG GTTCTATCCA GTTGTCTGGC ACAACTTATA ACTTTGGTGC AGCGGCAACT TCTGGCGTAA CCTACACCAA AACTGTAAGC GCTGATACTG TACTGAGCAC AGTGCAGAGT GCTGCAACGG CTAACACAGC AGTTACTGGT GCGACAATTA AGTATAATAC AGGTATTCAG TCTGCAACGG CGTCCTTCGG TGGTGCGAAT ACTAATGGTG CTGGTAATTC GAATGACACC TATACTGATG CAGACAAAGA GCTCACCACA ACCGCATCTT ACACTATCAA CTACAACGTC GATAAGGATA CCGGTACAGT AACTGTAGCT TCAAATGGCG CAGGTGCAAC TGGTAAATTT GCAGCTACTG TTGGGGCACA GGCTTATGTT AACTCTACAG GCAAACTGAC CACTGAAACC ACCAGTGCAG GCACTGCAAC CAAAGATCCT CTGGCTGCCC TGGATGAAGC TATCAGCTCC ATCGACAAAT TCCGTTCATC CCTGGGTGCT ATTCAGAACC GTCTGGATTC TGCAGTCACC AACCTGAACA ACACCACTAC CAACCTGTCT GAAGCGCAGT CCCGTATTCA GGACGCCGAC TATGCGACCG AAGTGTCCAA CATGTCGAAA GCGCAGATCA TCCAGCAGGC CGGTAACTCC GTGCTGGCAA AAGCCAACCA GGTACCGCAG CAGGTTCTGT CTCTGCTGCA GGGTTAA
|
Protein sequence | MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG LTQAARNAND GISVAQTTEG ALSEINNNLQ RIRELTVQAS TGTNSDSDLD SIQDEIKSRL DEIDRVSGQT QFNGVNALAK DGSMKIQVGA NDGQTITIDL KKIDSDTLGL NGFNVNGKGT IANKAATVSD LTAAGATGTG PYAVTTNNTV LSASDALSRL KTGDTVTTTG SSAAIYTYDA AKGNFTTQAT VADGDVVNFA NTLKPAAGTT ASGVYTRSTG DVKFDVDANG DVTIGGKAAY LDATGNLSTN NAGIASSAKL SDLFASGSTL ATTGSIQLSG TTYNFGAAAT SGVTYTKTVS ADTVLSTVQS AATANTAVTG ATIKYNTGIQ SATASFGGAN TNGAGNSNDT YTDADKELTT TASYTINYNV DKDTGTVTVA SNGAGATGKF AATVGAQAYV NSTGKLTTET TSAGTATKDP LAALDEAISS IDKFRSSLGA IQNRLDSAVT NLNNTTTNLS EAQSRIQDAD YATEVSNMSK AQIIQQAGNS VLAKANQVPQ QVLSLLQG
|
| |