Gene EcolC_2518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2518 
SymbolflgK 
ID6067378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2769887 
End bp2771530 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content52% 
IMG OID641601924 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001725476 
Protein GI170020522 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.38905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00149416 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAGCT TGATCAATAA CGCCATGAGC GGACTGAACG CGGCCCAGGC GGCGTTAAAT 
ACGGCAAGTA ATAATATCTC CAGCTATAAC GTTGCCGGAT ATACCCGCCA AACCACTATT
ATGGCGCAGG CCAATAGCAC GTTGGGCGCT GGCGGCTGGG TTGGCAATGG TGTCTACGTT
TCTGGTGTGC AGCGTGAGTA TGATGCTTTT ATTACCAACC AGTTACGTGC GGCGCAGACG
CAAAGTAGCG GCCTGACTGC CCGCTATGAG CAGATGTCGA AAATCGACAA TATGCTCTCC
ACCAGTACCT CTTCGCTGGC AACACAGATG CAGGATTTCT TCACCAGCCT GCAAACGCTG
GTGAGTAACG CGGAAGACCC GGCAGCGCGC CAGGCGCTGA TTGGGAAATC AGAAGGGTTG
GTGAATCAGT TTAAAACCAC CGATCAATAT CTGCGCGACC AGGACAAACA GGTCAATATC
GCGATAGGTG CCAGCGTTGA TCAGATCAAC AATTACGCTA AACAAATTGC CAGCCTGAAC
GATCAAATCT CGCGCCTGAC AGGCGTGGGG GCAGGGGCCT CACCTAACAA TCTGCTGGAT
CAACGCGATC AACTGGTGAG CGAATTAAAC CAGATTGTTG GCGTAGAAGT CAGCGTTCAG
GATGGCGGTA CTTATAACAT CACGATGGCC AATGGTTACT CACTGGTTCA GGGAAGTACG
GCGCGGCAAC TGGCGGCAGT TCCTTCCAGC GCCGACCCTT CTCGTACGAC TGTCGCTTAC
GTTGATGGGA CGGCAGGCAA TATTGAGATC CCGGAGAAAT TACTGAATAC CGGGTCGCTG
GGCGGCATTC TGACATTCCG TTCTCAGGAT CTGGACCAGA CGCGTAATAC GCTTGGACAA
CTGGCGCTGG CATTTGCCGA GGCTTTCAAC AGCCAACACA AAGCCGGATT TGATGCCAAC
GGCGATGAGG GTGAAGATTT CTTTGCTATC GGTAAGCCCG CGGTTCTGCA AAACACTAAA
AACAACGGTA ACGTTGCGAT CGGTGCCACG GTAACTGATG CCTCCGTGGT ACTGGCGACA
GATTACAAAA TCTCGTTCGA TAATAATCAG TGGCAGGTCA CCCGCCTTGC CAGCAATACC
ACTTTTACGG TGACACCAGA TGCCAACGGT AAAGTGGCAT TTGATGGTCT GGAGTTGACG
TTTACAGGAA CGCCTGCCGT TAACGACAGC TTCACGCTGA AACCAGTAAG TGACGCCATC
GTCAACATGG ATGTATTAAT CACCGACGAA GCGAAAATCG CGATGGCGAG CGAAGAAGAT
GCGGGTGATA GCGACAACCG CAACGGTCAG GCCCTGCTGG ATCTGCAAAG CAACAGTAAA
ACGGTGGGCG GAGCGAAATC CTTTAACGAC GCTTATGCCT CGTTAGTGAG TGATATCGGT
AATAAAACCG CGACGTTGAA AACCAGTAGC ACCACGCAAG GTAATGTGGT GACGCAGCTT
TCCAATCAGC AGCAGTCGAT TTCCGGTGTC AATCTCGATG AGGAGTACGG AAATCTGCAA
CGTTTTCAGC AGTATTACCT GGCGAATGCG CAGGTTCTGC AGACGGCAAA CGCGATTTTT
GATGCGCTGA TTAACATTCG CTAA
 
Protein sequence
MSSLINNAMS GLNAAQAALN TASNNISSYN VAGYTRQTTI MAQANSTLGA GGWVGNGVYV 
SGVQREYDAF ITNQLRAAQT QSSGLTARYE QMSKIDNMLS TSTSSLATQM QDFFTSLQTL
VSNAEDPAAR QALIGKSEGL VNQFKTTDQY LRDQDKQVNI AIGASVDQIN NYAKQIASLN
DQISRLTGVG AGASPNNLLD QRDQLVSELN QIVGVEVSVQ DGGTYNITMA NGYSLVQGST
ARQLAAVPSS ADPSRTTVAY VDGTAGNIEI PEKLLNTGSL GGILTFRSQD LDQTRNTLGQ
LALAFAEAFN SQHKAGFDAN GDEGEDFFAI GKPAVLQNTK NNGNVAIGAT VTDASVVLAT
DYKISFDNNQ WQVTRLASNT TFTVTPDANG KVAFDGLELT FTGTPAVNDS FTLKPVSDAI
VNMDVLITDE AKIAMASEED AGDSDNRNGQ ALLDLQSNSK TVGGAKSFND AYASLVSDIG
NKTATLKTSS TTQGNVVTQL SNQQQSISGV NLDEEYGNLQ RFQQYYLANA QVLQTANAIF
DALINIR