Gene Msil_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0043 
SymbolflgK 
ID7092371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp39857 
End bp41347 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID643463376 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002360388 
Protein GI217976241 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGG CGAATGCGGC GGCGATCGCG CAGTCGGGGT TAGCTTCGGT TACGACCGAG 
ATCGCAACAT TGTCGCGTAA CATTTCCGGC GCTAACGACA CCTCGGTTTA TTCGCGCAAG
ATCGCCAATG TCGTATCGAC CGCCTCTGGC TCGCAGGTCA CCTCGATCAG CCGCGCCTCG
AGTCAGGCGG TGTTCGAGAA TGTGCTTAAT GCGACCTCTG CCATTGCCGC TGAAGACGCT
GTATCGACAG GTCTTGAAGC GCTGGCGACG ACAGTTGGAG ATGTCGCCAG CGCCGCAGGC
GCGGACGCAA CCTCGACGGC GACGTCGCCA GCAGCCTTGA TCAGCCGATT GTCCGATGCG
CTGCAATATT ATTCCGGGTC GCCGAGCGAC ATGACGGCTG CGGCGAATGT CGTCTCCGCG
GCCAATGCTT TGGCAAGGGG GCTCAATCAG GGCTCGGCCG CCATTCAGCA GGCGCGCGCG
ACCGCCGACT CCGACATTGC GGCCGCCGTG TCGGATATAA ATTCACAGCT CGCCCAGTTC
CAAGAAGTCA ACGAAAAAAT CATCGCGGCT ACGGCCGTGG GCAAAGACAG CACGGATTTA
CAGGATCAGC GCGATACGAT CCTGAAGCAG ATTTCCGCGA ATATCGGCAT ATCGACCGTC
ACCGCGGGAA ATGGCGATAT GTCCCTCTAT ACCGACAGCG GCGTTACGCT TTTTCAGGGG
GGCCGCGCGC GGACAGTCAG CTTCACGCCG ACAACCACTT ATGTGACGGC GACGGTGGGA
CAGGCTGTTT ATGTCGACGG CGTGGCGATT ACGGGCGCTA CGGCTACCAT GGCGATTGCC
TCGGGCAAGA TCGCTGGCCT GGCGACAATC CGGGACGCCG TTGCAGTCAC CTATCAGGCG
CAGCTCGATG GCGTCGCAAG CGCTTTGATC ACCGCTTTTC GGGAAAGCGA TCAGGCAGCG
GTCGGCCCCG ATTTGCCTGG GCTCTTTACA ACGGCGAGCG CAACAGCAAT TCCATCTTCG
GCGGCCGGAT TGGCGAGCGC GATCATTGTT AACGCAGCGG TAGACCCCTC GCAAGGCGGG
GATTTGACCC TCCTGCGCGA CGGCGGCATC GCCGATCCAT CAGGCACGGA TTACACTTAC
AACACAAGCG GCGCCGCGAG CTTTGCGGGC CGCATCTCAG AATTGATCGA CAATCTATCC
GCAACACAAA GCTTTTCATC GTCTGGGAGC CTGACGACGA GCGCGAGCGT TGGAGGCTAT
GCCGCCGCGT CAGTCAGTTG GCTGGAGGCG CAACGGTCGG CCGCATCCTC GCGCAGCAGC
TACCAAGGCG CGCTTTTGAG CACCGCCTCG ACAGCATTGT CGAACGCGAC CGGCGTCAAT
ATCAACGATG AGATGTCAAA AATGCTTGAT CTCGAGCAGT CCTACGCTGC CTCAGCGAAA
TTGCTTAGCT CGATCAACGA TATGTTCAAC GCTCTCCTGT CGGGCATATA G
 
Protein sequence
MSLANAAAIA QSGLASVTTE IATLSRNISG ANDTSVYSRK IANVVSTASG SQVTSISRAS 
SQAVFENVLN ATSAIAAEDA VSTGLEALAT TVGDVASAAG ADATSTATSP AALISRLSDA
LQYYSGSPSD MTAAANVVSA ANALARGLNQ GSAAIQQARA TADSDIAAAV SDINSQLAQF
QEVNEKIIAA TAVGKDSTDL QDQRDTILKQ ISANIGISTV TAGNGDMSLY TDSGVTLFQG
GRARTVSFTP TTTYVTATVG QAVYVDGVAI TGATATMAIA SGKIAGLATI RDAVAVTYQA
QLDGVASALI TAFRESDQAA VGPDLPGLFT TASATAIPSS AAGLASAIIV NAAVDPSQGG
DLTLLRDGGI ADPSGTDYTY NTSGAASFAG RISELIDNLS ATQSFSSSGS LTTSASVGGY
AAASVSWLEA QRSAASSRSS YQGALLSTAS TALSNATGVN INDEMSKMLD LEQSYAASAK
LLSSINDMFN ALLSGI