Gene Msil_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0037 
Symbol 
ID7092365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp32603 
End bp33811 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content60% 
IMG OID643463370 
ProductOmpA/MotB domain protein 
Protein accessionYP_002360382 
Protein GI217976235 
COG category[N] Cell motility 
COG ID[COG1360] Flagellar motor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGAGA AAGAGCATCA GGAAATCGTC ATCGTAAAGC GCTACAGCCG TGACGACGAG 
GCAAGGCATG GCGGCGCATG GAAGATCGCC TTCGCCGATT TTATGACCGC CATGATGGCG
CTTTTTCTCG TGCTTTGGCT GATCAGCTCG ACGAGCGACA AGACCAAACA TTCGGTCGCC
CAATATTTCA ATCCGGTGAA GCTGGTCGAC ATGACAACCC TGAAAAAGGG ATTCCGCGAC
CCGAAAAAAA CGGAGATGGG CGCCGGACCG AAGACGACGG AATCGGAAAT TGACGCCGAT
AGCAACAAGG ATCTCGCCGA AACGCAGGAG GTCGCCGAAC ATCCTGGCGC CAAGGTTCGG
CTTATTTCGG AATCGAAGCT GTTTCGCGAT CCTTACGCCG CGCTTGCGGA AATCGCGGCG
AACGCAATCG AGGCGGCGCC GCATCCGAGT TCCGGCGAAC CGCAATCCGG ACCGACCGAA
TTTTCTGTAG AATCATCCGA TGTTTTCGTT GATCCGTTTA CGACGGCGCC TCGTCTGGCG
GATACAACTG TCGACGGACC GGCTTCCCAT GCAAAGCCAG CAATTCCCGA ACGCGACAAG
CAGGCGCCTT CAAGTCCGAA GCAACGCGCC GAACCTTTGC CGCACGCGGA GGAGCAACTG
AGTCCGCCGG CGGGAGCCGG CAAGGGGACA AAGACCGGGG ACGCCGCCAC CGAAGCGGCG
ATGGCGTCCG CGCAGCCCAT GGATACGGAA ACGGCAAGCT TGAAAGCGGG CTTGACGGCG
CTGGCGCCGC AGAAGGGGCG CTTCGGCGAC GGCCCGCGGA TCGAAGTCGA GAACACGGAC
GAGGGCCTTC TGATCAGCCT TACGGACGAT CGCAAATTTT CGATGTTCGC GATCGGATCG
GCGGCGCCTC TGCCACAGAC CATTGAGGCG ATGGCGAAGA TCGGCGATCT CTTGAAGACG
CGCTCCGGCA TGGTTGTTGT TCGCGGCCAT ACCGACGCGC GCCCCTTCAA ATCCGCGACC
TATGACAATT GGCGGCTCTC CACGGCGCGG GCGCATATGG CGCAATACAT GCTGACGCGC
GGCGGTCTCG ACGAGAAGCG TATCGAAAAG ATCGAAGGCT TCGCCGATCA TCGCCTGAAA
GTAGCGGCCG AGCCGACGGC GGCTGCAAAT CGCCGGATCG AAATCTTGTT GCGAAAGGTG
AAGTCGTGA
 
Protein sequence
MAEKEHQEIV IVKRYSRDDE ARHGGAWKIA FADFMTAMMA LFLVLWLISS TSDKTKHSVA 
QYFNPVKLVD MTTLKKGFRD PKKTEMGAGP KTTESEIDAD SNKDLAETQE VAEHPGAKVR
LISESKLFRD PYAALAEIAA NAIEAAPHPS SGEPQSGPTE FSVESSDVFV DPFTTAPRLA
DTTVDGPASH AKPAIPERDK QAPSSPKQRA EPLPHAEEQL SPPAGAGKGT KTGDAATEAA
MASAQPMDTE TASLKAGLTA LAPQKGRFGD GPRIEVENTD EGLLISLTDD RKFSMFAIGS
AAPLPQTIEA MAKIGDLLKT RSGMVVVRGH TDARPFKSAT YDNWRLSTAR AHMAQYMLTR
GGLDEKRIEK IEGFADHRLK VAAEPTAAAN RRIEILLRKV KS