Gene SbBS512_E1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1053 
Symbol 
ID6270744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp964377 
End bp966083 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content50% 
IMG OID641725193 
Productflagellin 
Protein accessionYP_001879712 
Protein GI187730503 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC
GCGAAGGATG ACGCCGCAGG TCAGGCGATT GCTAACCGTT TTACTTCTAA TATTAAAGGC
CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATCTCCG TTGCGCAGAC CACTGAAGGC
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCTTCT
ACCGGGACTA ACTCCGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTG
GACGAAATTG ACCGCGTATC TGGCCAGACC CAGTTCAACG GCGTGAACGC ACTGGCGAAA
GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCC AGACTATCAC GATTGATCTG
AAGAAAATTG ACTCAGATAC GCTGGGGCTG AATGGTTTTA ACGTGAATGG CAAAGGCACT
ATTGCGAACA AAGCTGCTAC AGTCAGCGAT CTGACCGCTG CTGGTGCAAC GGGAACAGGT
CCTTATGCTG TGACCACAAA CAATACAGTA CTCAGCGCTA GCGATGCACT GTCTCGCCTG
AAAACCGGAG ATACAGTTAC TACTACTGGC TCGAGTGCTG CGATCTATAC TTATGATGCG
GCTAAAGGGA ACTTCACCAC TCAAGCAACA GTTGCAGATG GCGATGTTGT TAACTTTGCG
AATACTCTGA AACCAGCGGC TGGCACTACT GCATCAGGTG TTTATACTCG TAGTACTGGT
GATGTGAAGT TTGATGTAGA TGCTAATGGC GATGTGACCA TCGGTGGTAA AGCCGCGTAC
CTAGACGCTA CTGGTAACCT ATCTACAAAC AACGCCGGCA TTGCATCTTC AGCGAAATTG
TCCGATCTGT TTGCTAGCGG TAGTACCTTA GCGACAACTG GTTCTATCCA GTTGTCTGGC
ACAACTTATA ACTTTGGTGC AGCGGCAACT TCTGGCGTAA CCTACACCAA AACTGTAAGC
GCTGATACTG TACTGAGCAC AGTGCAGAGT GCTGCAACGG CTAACACAGC AGTTACTGGT
GCGACAATTA AGTATAATAC AGGTATTCAG TCTGCAACGG CGTCCTTCGG TGGTGCGAAT
ACTAATGGTG CTGGTAATTC GAATGACACC TATACTGATG CAGACAAAGA GCTCACCACA
ACCGCATCTT ACACTATCAA CTACAACGTC GATAAGGATA CCGGTACAGT AACTGTAGCT
TCAAATGGCG CAGGTGCAAC TGGTAAATTT GCAGCTACTG TTGGGGCACA GGCTTATGTT
AACTCTACAG GCAAACTGAC CACTGAAACC ACCAGTGCAG GCACTGCAAC CAAAGATCCT
CTGGCTGCCC TGGATGAAGC TATCAGCTCC ATCGACAAAT TCCGTTCATC CCTGGGTGCT
ATTCAGAACC GTCTGGATTC TGCAGTCACC AACCTGAACA ACACCACTAC CAACCTGTCT
GAAGCGCAGT CCCGTATTCA GGACGCCGAC TATGCGACCG AAGTGTCCAA CATGTCGAAA
GCGCAGATCA TCCAGCAGGC CGGTAACTCC GTGCTGGCAA AAGCCAACCA GGTACCGCAG
CAGGTTCTGT CTCTGCTGCA GGGTTAA
 
Protein sequence
MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG 
LTQAARNAND GISVAQTTEG ALSEINNNLQ RIRELTVQAS TGTNSDSDLD SIQDEIKSRL
DEIDRVSGQT QFNGVNALAK DGSMKIQVGA NDGQTITIDL KKIDSDTLGL NGFNVNGKGT
IANKAATVSD LTAAGATGTG PYAVTTNNTV LSASDALSRL KTGDTVTTTG SSAAIYTYDA
AKGNFTTQAT VADGDVVNFA NTLKPAAGTT ASGVYTRSTG DVKFDVDANG DVTIGGKAAY
LDATGNLSTN NAGIASSAKL SDLFASGSTL ATTGSIQLSG TTYNFGAAAT SGVTYTKTVS
ADTVLSTVQS AATANTAVTG ATIKYNTGIQ SATASFGGAN TNGAGNSNDT YTDADKELTT
TASYTINYNV DKDTGTVTVA SNGAGATGKF AATVGAQAYV NSTGKLTTET TSAGTATKDP
LAALDEAISS IDKFRSSLGA IQNRLDSAVT NLNNTTTNLS EAQSRIQDAD YATEVSNMSK
AQIIQQAGNS VLAKANQVPQ QVLSLLQG