Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0038 |
Symbol | |
ID | 7092366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 33808 |
End bp | 35019 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643463371 |
Product | chemotaxis protein |
Protein accession | YP_002360383 |
Protein GI | 217976236 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCGAA GAGTAGAATT CCTTGCGGGC GCGTTGATCT GGTTGTTTCA GGGCTTCGCG GTCGCGCAGG ATGGCCATAG CGCGGCGCCG CAGGGGGGCG ACGTTCGGCC GTTCGAACTT GTCCGCACGG TGGCGGCGCT GCAAGACCGG ATCGTCATGG GCGACGCCGC CGCCAAAGCC AAATTGCCGC TCCTGATCAG CCAGATTTCC GACCGTCTTT TTGCCTGCGG CGCGGCGGTA TGGGCGGAGG CGCGGAACGA TCACGCCATC GTCGCCTACA CGCTGAGCGG CGGCCAACCG CGTGTAATCC GGAAGGTCTT GCAAACAGGG GCCGTCCCGC ACGCGGAGGC CGATCTCATG GCGGGAGCGC TCGCCTATGC CGAAGGACAG GAGGCAAAAG CGAAACAGAT CCTTCTGCCG ATCGAGGCGA CGAAGCTGCC GCCTTCCGTC GGCGGCCTTG TCGCGCTCGC CCAAGCCGCG CTGCTCTCGA AAATTGATCC GCGCCGCGCG GCGCGTCTGC TGGACGAGGC GCGTATTCTC GCGCCAGGCA CGCTCGTCGA GGAAGCGGCG TTGCGGAGAT CGGCGTTGCT CGCTGATGAA GTCGCGGATT TCGATCGATT TATCAATGCG TCGAGCCAAT ATTTCCGTCG CTATTCGAAG TCGCTTTACG CCAATGATTT CCGGCGACGC TTTGCCGAGT CGATCGTCCG CTTCGGTCTG AAGGACGAGC CGGGACCGTC GGCGCGATTG ACGGGTCTGC TGAGCGAGCT CGACCGTCCT TATCAGGCCG AGCTTTATCT GATCATCGCG CAAGCCGGGG TCCGAAACGG CAAGATCGGT CCGGCGAAGG CGGCGGCAGA GAAGGCGCTG TCTCTTTCAG AGCAGGGCGG CGCCGCGAGG TCGCGCGCAC AGCTTTACGC AGCGATAGCA AAAGTGCTCA TCGTCTCTCC CGCCGAAGGC TTGACCGAAC TCGCGCAAAT CGATGACGCC GTTTTGCCGA GGGGCGATCG CGACCTCAAA TCGGCGGTCG CCCAGCTCGC AACTCAGATC CAAAGATCGG CCGATGGAGG GCAGGCGCAG GACGCCTCCT CGATTGATCG CGCGCCGGGG TCGGGGGGCG AGTCGCATGA CGCGAACGGC TCCATGCTGA TCCAATCGGC GCAGGCTGCG CTGCAGCAAA CGGACGCGCT CTTGAGGAGG TCGGCGCAAT GA
|
Protein sequence | MKRRVEFLAG ALIWLFQGFA VAQDGHSAAP QGGDVRPFEL VRTVAALQDR IVMGDAAAKA KLPLLISQIS DRLFACGAAV WAEARNDHAI VAYTLSGGQP RVIRKVLQTG AVPHAEADLM AGALAYAEGQ EAKAKQILLP IEATKLPPSV GGLVALAQAA LLSKIDPRRA ARLLDEARIL APGTLVEEAA LRRSALLADE VADFDRFINA SSQYFRRYSK SLYANDFRRR FAESIVRFGL KDEPGPSARL TGLLSELDRP YQAELYLIIA QAGVRNGKIG PAKAAAEKAL SLSEQGGAAR SRAQLYAAIA KVLIVSPAEG LTELAQIDDA VLPRGDRDLK SAVAQLATQI QRSADGGQAQ DASSIDRAPG SGGESHDANG SMLIQSAQAA LQQTDALLRR SAQ
|
| |