Gene SeD_A2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2189 
SymbolflgK 
ID6871661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2099869 
End bp2101530 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content52% 
IMG OID642785291 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002215954 
Protein GI198241785 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000358653 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAGCT TGATTAATCA CGCCATGAGC GGACTTAACG CCGCGCAGGC CGCGTTAAAT 
ACGGTCAGTA ATAACATCAA CAATTATAAC GTTGCGGGTT ATACCCGGCA GACAACTATT
CTGGCGCAGG CAAACAGTAC GTTAGGGGCT GGCGGCTGGA TAGGTAATGG CGTTTACGTT
TCAGGCGTAC AGCGCGAATA TGATGCGTTT ATCACTAATC AGCTACGCGG CGCGCAAAAC
CAGAGCAGCG GCTTAACCAC GCGCTATGAA CAAATGTCGA AAATCGACAA CCTGCTGGCC
GATAAATCCA GCTCACTGTC TGGCTCGCTG CAGAGTTTTT TTACCAGCCT GCAAACGTTA
GTCAGTAACG CGGAAGATCC TGCGGCGCGT CAGGCGCTGA TTGGTAAAGC GGAAGGGCTG
GTAAACCAGT TCAAAACCAC CGATCAGTAT CTGCGCGATC AGGATAAACA GGTCAATATC
GCGATTGGCT CCAGCGTGGC GCAAATCAAC AATTACGCGA AGCAGATAGC TAACCTGAAC
GATCAAATCT CCCGTATGAC GGGCGTAGGC GCGGGCGCAT CGCCGAACGA CCTGCTCGAT
CAGCGTGATC AGTTGGTCAG CGAGCTTAAC AAGATCGTTG GCGTCGAGGT GAGTGTACAG
GACGGCGGCA CCTATAACCT GACGATGGCC AATGGCTATA CGCTGGTGCA GGGGTCGACG
GCGCGTCAGT TGGCGGCGGT TCCCTCCAGC GCCGACCCGA CGCGAACGAC TGTCGCTTAT
GTCGATGAGG CCGCCGGTAA CATCGAAATT CCGGAAAAGT TGCTGAACAC CGGTTCGCTC
GGCGGGCTAC TGACGTTCCG TTCTCAGGAT CTGGATCAGA CTCGTAATAC GCTGGGCCAG
TTGGCGTTGG CGTTTGCCGA TGCGTTTAAC GCGCAGCATA CCAAAGGTTA TGACGCCGAC
GGCAATAAAG GGAAAGACTT CTTTAGCATT GGCTCGCCGG TGGTATATAG CAACAGTAAT
AATGCCGATA AAACGGTATC GCTAACCGCT AAGGTGGTCG ACAGCACGAA GGTTCAGGCG
ACGGATTATA AGATTGTTTT TGACGGTACA GACTGGCAGG TTACTCGCAC TGCGGATAAC
ACCACCTTCA CGGCGACAAA AGATGCTGAC GGAAAACTGG AGATTGACGG TCTGAAAGTG
ACGGTAGGGA CCGGCGCACA GAAAAACGAC AGTTTTCTTC TCAAGCCGGT CAGCAATGCT
ATCGTCGACA TGAACGTTAA AGTGACGAAT GAAGCCGAGA TTGCGATGGC GTCTGAGTCA
AAACTCGATC CTGACGTGGA TACCGGCGAC AGCGATAACC GCAATGGTCA GGCATTGCTG
GACTTACAAA ACAGCAATGT AGTGGGCGGC AACAAAACTT TTAACGATGC TTACGCCACG
TTGGTCAGCG ATGTGGGTAA CAAAACGTCA ACGCTGAAAA CCAGCAGCAC CACGCAGGCG
AATGTGGTTA AACAGCTTTA TAAACAGCAA CAGTCGGTTT CCGGCGTTAA CCTCGACGAA
GAGTACGGCA ATTTGCAGCG TTATCAGCAG TATTATCTGG CGAATGCGCA AGTATTGCAG
ACCGCGAATG CGCTGTTTGA TGCGTTATTG AATATTCGCT AA
 
Protein sequence
MSSLINHAMS GLNAAQAALN TVSNNINNYN VAGYTRQTTI LAQANSTLGA GGWIGNGVYV 
SGVQREYDAF ITNQLRGAQN QSSGLTTRYE QMSKIDNLLA DKSSSLSGSL QSFFTSLQTL
VSNAEDPAAR QALIGKAEGL VNQFKTTDQY LRDQDKQVNI AIGSSVAQIN NYAKQIANLN
DQISRMTGVG AGASPNDLLD QRDQLVSELN KIVGVEVSVQ DGGTYNLTMA NGYTLVQGST
ARQLAAVPSS ADPTRTTVAY VDEAAGNIEI PEKLLNTGSL GGLLTFRSQD LDQTRNTLGQ
LALAFADAFN AQHTKGYDAD GNKGKDFFSI GSPVVYSNSN NADKTVSLTA KVVDSTKVQA
TDYKIVFDGT DWQVTRTADN TTFTATKDAD GKLEIDGLKV TVGTGAQKND SFLLKPVSNA
IVDMNVKVTN EAEIAMASES KLDPDVDTGD SDNRNGQALL DLQNSNVVGG NKTFNDAYAT
LVSDVGNKTS TLKTSSTTQA NVVKQLYKQQ QSVSGVNLDE EYGNLQRYQQ YYLANAQVLQ
TANALFDALL NIR