Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1988 |
Symbol | |
ID | 7094186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2158078 |
End bp | 2159268 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643465314 |
Product | hypothetical protein |
Protein accession | YP_002362292 |
Protein GI | 217978145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGGGG CCGGCGGAGG TAGCATAAAC GACACCCGGC GCAGATTTTT CGCCAAAGCT GGCGCCGCTC TCGCCTCGCT GCCTGTTTTG AGCATGGAAA GCAAGCAAGC GCAGGCGCAA GATTTCGATA ATTCCTCGGC CAATCATCCC GCAGGCAACG GTCCCCTCGT CATCGCTCGG CAGGGCAGCT TCATGGCAGG CGGTTCTGTG CTCCAGACGC CAGGCGTGTT CGATCCGACG GTGACGGGCG GCCCCGGCCA GACTCTTCAC GGCGATCATG CTTACGCGCA GTTTCAAATC CCGCAAAATC CGCGTTCTCT TCCACTGGTG TTTTGGCCGG GCGGCGGTCA GAGCGGCAAG AGCTTTGAGA CGACGCCCGA TGGGCGCGAG GGATATCAAT CGATTTTTCT ACGCCGCAAT TTTGGCGTCT ACATCATCGA TCAGCCGCGA CGCGCTCGCG CAGGCAACAC CACGGTAGGA ACCACTCTGA CGCCAACGCC CGGAGAACAA GATCTTTTTG TTGCGTGGCG ACTCGGCGTC TGGCCGAAGT TTTTTCCGAA CAGTCAATTC CCGCAGGCAA AACAGGGAAT CAACTCGCCC GCGCTCAACC AATTCTTCCG TTGGGCGACG CCGGACACAG GGCCGCAAGA CAGAAACGTC ATCACCGATG GAGTGGCGGC GCTTCTCGAA GAGATAGGAC CGTGCATCCT GATCATCCAC TCCGCCAGCG GCGTCCTTGG CTGGCTTACG GCGATGAAGA GCCCGAACGT CAAGGCGATC TATGCCTACG AGCCCGGTGG CGGAAACCCG GACTACGCTT TTCCATCCAA TGAGCTTCCG CCCGCGCTCG GCAGCGGGCC GACCTTGCTG AGCCCGAACC CCGTGCCGTT GTCGGATTTT CTCAAGCTCA CGAAAATCCC GATCCGGATG CAGTTCGGCG ACGGGCTTTC CGCGCAGTCA AGCCCCTATC CGCGCGTGCA GCTCTGGCTG AACCGCTTCA AGATGGCGCA GCAAATGGTG GCGGCGATCA ACAAACACGG CGGCAACGCC TCGATACTCC ACCTGCCGGA TATCGGGATT CGCGGCAACA CGCATTTCTC GTTTTCCGAT GCGAACAATC TGCAGATCGC CGACATCTTT TCACAATGGC TCGCCAAGAA TCGCCTCGAT GGTTATGGCG GCCATCGATA G
|
Protein sequence | MGGAGGGSIN DTRRRFFAKA GAALASLPVL SMESKQAQAQ DFDNSSANHP AGNGPLVIAR QGSFMAGGSV LQTPGVFDPT VTGGPGQTLH GDHAYAQFQI PQNPRSLPLV FWPGGGQSGK SFETTPDGRE GYQSIFLRRN FGVYIIDQPR RARAGNTTVG TTLTPTPGEQ DLFVAWRLGV WPKFFPNSQF PQAKQGINSP ALNQFFRWAT PDTGPQDRNV ITDGVAALLE EIGPCILIIH SASGVLGWLT AMKSPNVKAI YAYEPGGGNP DYAFPSNELP PALGSGPTLL SPNPVPLSDF LKLTKIPIRM QFGDGLSAQS SPYPRVQLWL NRFKMAQQMV AAINKHGGNA SILHLPDIGI RGNTHFSFSD ANNLQIADIF SQWLAKNRLD GYGGHR
|
| |