Gene SbBS512_E2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2243 
SymbolflgK 
ID6270889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2043137 
End bp2044780 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content52% 
IMG OID641726265 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001880749 
Protein GI187732740 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.39076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCT TGATTAATAA CGCCATGAGC GGACTGAACG CGGCCCAGGC GGCGTTAAAT 
ACGGCAAGTA ATAATATCTC CAGCTATAAC GTTGCCGGAT ATACCCGCCA AACCACTATT
ATGGCGCAGG CCAATAGCAC GTTGGGCGCT GGCGGCTGGG TTGGCAATGG TGTCTACGTT
TCTGGTGTGC AGCGTGAGTA TGATGCGTTT ATCACCAACC AGTTACGTGC GGCGCAGACG
CAAAGTAGCG GCCTGACTGC CCGCTATGAG CAGATGTCGA AAATCGACAA TATGCTCTCC
ACCAGTACCT CTTCGCTGGC AACACAGATG CAGGATTTCT TCACCAGCCT GCAAACGCTG
GTGAGTAACG CGGAAGACCC GGCAGCGCGC CAGGCGCTGA TTGGGAAATC AGAAGGGTTG
GTGAATCAGT TTAAAACCAC CGATCAGTAT CTTCGCGACC AGGACAAACA GGTCAATATC
GCGATAGGTG CCAGCGTTGA TCAGATCAAC AATTACGCTA AACAAATCGC CAGCCTGAAC
GACCAAATCT CCCGCCTGAC AGGCGTGGGG GCAGGGGCGT CACCTAACAA TCTGCTGGAT
CAACGCGATC AACTGGTGAG CGAATTAAAC CAGATTGTTG GTGTAGAAGT CAGCGTTCAG
GATGGCGGCA CTTATAACAT CACGATGGCC AATGGTTACT CACTGGTTCA GGGAAGTACG
GCGCGGCAAC TGGCGGCAGT TCCTTCCAGC GCCGACCCTT CTCGTATGAC TGTCGCTTAC
GTTGATGGGA CGGCAGGCAA TATTGAGATC CCGGAGAAAT TACTGAATAC CGGGTCGCTG
GGCGGCATTC TGACATTCCG TTCTCAGGAT CTGGACCAGA CGCGTAATAC GCTTGGACAA
CTGGCGCTGG CATTTGCCGA GGCTTTCAAC ACCCAACACA AAGCCGGATT TGATGCCAAC
GGCGATGCTG GTGAAGATTT CTTTGCTATC GGTAAGCCCG CGGTTTTGCA AAACACGAAA
AACAAAGGTG ACGTTGCGAT CGGTGCCACG GTAACTGATG CCTCCGCGGT ACTGGCGACA
GATTACAAAA TCTCGTTCGA TAATAATCAG TGGCAGGTCA CCCGCCTTGC CAGCAATACC
ACTTTTACGG TGACGCCGGA TGCCAACGGT AAAGTGGCAT TTGATGGTCT GGAGTTGACG
TTTACAGGAA CGCCTGCCGT TAACGACAGC TTCACGCTGA AACCAGTAAG TGACGCCATC
GTCAACATGG ATGTATTAAT CACCGACGAA GCGAAAATAG CGATGGCGAG CGAAAAAGAT
GCGGGTGATA GCGATAACCG CAACGGTCAG GCCCTGCTGG ATCTGCAAAG CAACAGTAAA
ACGGTGGGCG GTGCGAAATC CTTTAACGAC GCTTATGCCT CGTTAGTGAG TGATATCGGT
AATAAAACCG CGACGTTGAA AACCAGTAGC GCCACGCAAG GTAATGTGGT GACGCAGCTT
TCCAATCAGC AGCAGTCTAT CTCTGGCGTC AATCTCGACG AAGAGTACGG AAATCTGCAA
CGTTTTCAGC AGTATTACCT GGCGAATGCG CAGGTTCTGC AGACGGCAAA CGCGATTTTT
GATGCGCTGA TTAACATTCG CTAA
 
Protein sequence
MSSLINNAMS GLNAAQAALN TASNNISSYN VAGYTRQTTI MAQANSTLGA GGWVGNGVYV 
SGVQREYDAF ITNQLRAAQT QSSGLTARYE QMSKIDNMLS TSTSSLATQM QDFFTSLQTL
VSNAEDPAAR QALIGKSEGL VNQFKTTDQY LRDQDKQVNI AIGASVDQIN NYAKQIASLN
DQISRLTGVG AGASPNNLLD QRDQLVSELN QIVGVEVSVQ DGGTYNITMA NGYSLVQGST
ARQLAAVPSS ADPSRMTVAY VDGTAGNIEI PEKLLNTGSL GGILTFRSQD LDQTRNTLGQ
LALAFAEAFN TQHKAGFDAN GDAGEDFFAI GKPAVLQNTK NKGDVAIGAT VTDASAVLAT
DYKISFDNNQ WQVTRLASNT TFTVTPDANG KVAFDGLELT FTGTPAVNDS FTLKPVSDAI
VNMDVLITDE AKIAMASEKD AGDSDNRNGQ ALLDLQSNSK TVGGAKSFND AYASLVSDIG
NKTATLKTSS ATQGNVVTQL SNQQQSISGV NLDEEYGNLQ RFQQYYLANA QVLQTANAIF
DALINIR