Gene Msil_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0038 
Symbol 
ID7092366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp33808 
End bp35019 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content64% 
IMG OID643463371 
Productchemotaxis protein 
Protein accessionYP_002360383 
Protein GI217976236 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGAA GAGTAGAATT CCTTGCGGGC GCGTTGATCT GGTTGTTTCA GGGCTTCGCG 
GTCGCGCAGG ATGGCCATAG CGCGGCGCCG CAGGGGGGCG ACGTTCGGCC GTTCGAACTT
GTCCGCACGG TGGCGGCGCT GCAAGACCGG ATCGTCATGG GCGACGCCGC CGCCAAAGCC
AAATTGCCGC TCCTGATCAG CCAGATTTCC GACCGTCTTT TTGCCTGCGG CGCGGCGGTA
TGGGCGGAGG CGCGGAACGA TCACGCCATC GTCGCCTACA CGCTGAGCGG CGGCCAACCG
CGTGTAATCC GGAAGGTCTT GCAAACAGGG GCCGTCCCGC ACGCGGAGGC CGATCTCATG
GCGGGAGCGC TCGCCTATGC CGAAGGACAG GAGGCAAAAG CGAAACAGAT CCTTCTGCCG
ATCGAGGCGA CGAAGCTGCC GCCTTCCGTC GGCGGCCTTG TCGCGCTCGC CCAAGCCGCG
CTGCTCTCGA AAATTGATCC GCGCCGCGCG GCGCGTCTGC TGGACGAGGC GCGTATTCTC
GCGCCAGGCA CGCTCGTCGA GGAAGCGGCG TTGCGGAGAT CGGCGTTGCT CGCTGATGAA
GTCGCGGATT TCGATCGATT TATCAATGCG TCGAGCCAAT ATTTCCGTCG CTATTCGAAG
TCGCTTTACG CCAATGATTT CCGGCGACGC TTTGCCGAGT CGATCGTCCG CTTCGGTCTG
AAGGACGAGC CGGGACCGTC GGCGCGATTG ACGGGTCTGC TGAGCGAGCT CGACCGTCCT
TATCAGGCCG AGCTTTATCT GATCATCGCG CAAGCCGGGG TCCGAAACGG CAAGATCGGT
CCGGCGAAGG CGGCGGCAGA GAAGGCGCTG TCTCTTTCAG AGCAGGGCGG CGCCGCGAGG
TCGCGCGCAC AGCTTTACGC AGCGATAGCA AAAGTGCTCA TCGTCTCTCC CGCCGAAGGC
TTGACCGAAC TCGCGCAAAT CGATGACGCC GTTTTGCCGA GGGGCGATCG CGACCTCAAA
TCGGCGGTCG CCCAGCTCGC AACTCAGATC CAAAGATCGG CCGATGGAGG GCAGGCGCAG
GACGCCTCCT CGATTGATCG CGCGCCGGGG TCGGGGGGCG AGTCGCATGA CGCGAACGGC
TCCATGCTGA TCCAATCGGC GCAGGCTGCG CTGCAGCAAA CGGACGCGCT CTTGAGGAGG
TCGGCGCAAT GA
 
Protein sequence
MKRRVEFLAG ALIWLFQGFA VAQDGHSAAP QGGDVRPFEL VRTVAALQDR IVMGDAAAKA 
KLPLLISQIS DRLFACGAAV WAEARNDHAI VAYTLSGGQP RVIRKVLQTG AVPHAEADLM
AGALAYAEGQ EAKAKQILLP IEATKLPPSV GGLVALAQAA LLSKIDPRRA ARLLDEARIL
APGTLVEEAA LRRSALLADE VADFDRFINA SSQYFRRYSK SLYANDFRRR FAESIVRFGL
KDEPGPSARL TGLLSELDRP YQAELYLIIA QAGVRNGKIG PAKAAAEKAL SLSEQGGAAR
SRAQLYAAIA KVLIVSPAEG LTELAQIDDA VLPRGDRDLK SAVAQLATQI QRSADGGQAQ
DASSIDRAPG SGGESHDANG SMLIQSAQAA LQQTDALLRR SAQ