Gene Msil_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0042 
SymbolflgE 
ID7092370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp38578 
End bp39822 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content57% 
IMG OID643463375 
Productflagellar hook protein FlgE 
Protein accessionYP_002360387 
Protein GI217976240 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGT TTAGCGCATT GACCGCAAGC GTATCCGGCA TGGCCGCGCA AGCGAACAAA 
TTGTCGACGG TGTCCGACAA TATCGCGAAT TCCGACACGA CCGGCTACAA ACAGGCCATG
ACGGAGTTTG AGAATCTGAT TAGCCAGGCC GGAACCTCGT CTTACAATGC AAGCGGCGTC
GCCACTGTGG TGCGCTACAA TATCAGCGAA CACGGCAATC TCAAATCGAC GACGTCGTCG
ACGGATCTTG CCATACAGGG AAATGGATTC TTTCTAGTCG GTGATGCAAA TGGCTCGGTG
TTTCTGACGC GCGCGGGCAA TTTTAAGCCG GATGCGACTG GAAATCTCGT CAACGCAGCC
GGTTTCACGC TGCTCGGATA CAGCGCTTCG TCGGGAAATT CCTCCAGCGC AGGCCTGAGC
GGGCTGGAGC CGGTCAATAT TTTCGGGGAC GGTCTCAAAG CCGTCCCCTC CACAACAGGA
AAATTGACGG CCAATCTCAA CTCCAATGCT GACATCGTCA CGGGGAGCTT GCCGAGCGCC
AATGTCGCCG GGTCAGTTTA TACGTCCAAG ACGTCGTTGG TGACATACGA CAATCTCGGA
AACGCCGTTA AGCTCGATGT ATACTACAGT AAAACGGGCA CAAACACTTG GGAGGCGTCG
ATTTACAACG CGGCGGACGC CGCGCCCGGC GGCGGCTTTC CCTATTCCGG CGCCGCGCTC
GCCACGCAAA CATTGAACTT CAGCGCAAGC GACGGCAGTC TGACCGGACA GAGCTCCATC
TCGCTGTCTG TTCCGGGCGG AGCGAACGTC GCTATCGACC TTTCCGGCAC GACGCAGCTT
GCGTCCGCTT TCGAAGTCTC GGCGGCGACC ACCAATGGAA ATGCGCCAAG CGCTGTGCAG
AGCGTCAGCA TAAGCCCCGA TGGAACTTTG TCCGAGGTTT TTGTCAATGG GACCCAAAAG
GCGATCTTCA CAATTCCGCT CGCAACGGTC GCCAGCGTCG ACAATATGAC CTCCCTGGCG
GGTGACGTGT TCTCGGACAA TAGTCTCTCG GGACCGATCC TTGTCGGAGA TCCCGGCCTT
GGCGGCTTTG GTTCGATCCA GTCGGAAAAG CTCGAATCCT CGACTGTCGA TCTTGCCTCG
CAATTGACCG ACATGATCGT CGCGCAGCGT TCCTATGAGT CGAACTCCAA GGTGTTCCAG
ACCGGATCGG AACTTTTGTC GACGTTGAAC AACATGCTCA AGTGA
 
Protein sequence
MSLFSALTAS VSGMAAQANK LSTVSDNIAN SDTTGYKQAM TEFENLISQA GTSSYNASGV 
ATVVRYNISE HGNLKSTTSS TDLAIQGNGF FLVGDANGSV FLTRAGNFKP DATGNLVNAA
GFTLLGYSAS SGNSSSAGLS GLEPVNIFGD GLKAVPSTTG KLTANLNSNA DIVTGSLPSA
NVAGSVYTSK TSLVTYDNLG NAVKLDVYYS KTGTNTWEAS IYNAADAAPG GGFPYSGAAL
ATQTLNFSAS DGSLTGQSSI SLSVPGGANV AIDLSGTTQL ASAFEVSAAT TNGNAPSAVQ
SVSISPDGTL SEVFVNGTQK AIFTIPLATV ASVDNMTSLA GDVFSDNSLS GPILVGDPGL
GGFGSIQSEK LESSTVDLAS QLTDMIVAQR SYESNSKVFQ TGSELLSTLN NMLK