Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0039 |
Symbol | |
ID | 7092367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 35016 |
End bp | 36743 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643463372 |
Product | hypothetical protein |
Protein accession | YP_002360384 |
Protein GI | 217976237 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTAG TCAATGGACA TTTTGGCGCG CGCACCATCG CTGCCGAATC GCCGTCGAGC GCTGCATTGG AGCGCGGCGG CGCCGCAGCT TCCCATGCGG AGCAGGGCGC GGCGTTCGGA AACATTATGG CGGACATCTC AAAAGCCGAC GGACCTGTTG CTCAAGGCTC CGCAACGCCG AAGGGATTGA GCCGCCCGAT CACGGCGCGC TTTGAAGAAG GAGAGAAAGC TTGCCCTTCT GCCGACCCAA TCCAAGGCGG GTCTGATGCG TCGGCCGACG CCGGGTCTGC GGCAGGGCCG CAACTACCGG GAGAGGCTGC TTTTGGCGCC GCGCGGCGCG CTTTCACCGC AGAGGGCTCC GGCGCTGGGA TCGTTAATGG AGTCGGTCAG AAGGCCGCGT CGCCAACGCG AGCCGACGCG GGCGTCGCCG CGCAATTCAC CGGCGATCCG GCCATTGACG CTGATCTGGC GCCGTGTTTT GGAGCTGGCG GCCGACACAA ATTTCACGAC GTGAAGGAGT CAAAAGAGAC CGCGCCAAAT CCTTCGACGC CAGATCATGT TACGACAAAG CCAAAAAGCG GCCTCGCCGG AGACGCCGCG ATTTCGGCGT CTGTCGTTCC GTCGGCCGGG GTTGCGAGCG GGCTGTCGCT CCGTTCCGAT CCAGCCGCCG CGCCTGTAGA TCCCTTGTCG GGCGCTGGGA GCGACTGGCT GACCAGACAT CCGGCCGCGC CGTCGCTTGC ACTGCCGAGC GAAATGCGTC TGGCCCGCGC GGCATCCGAA CCTGCCAGCG ATCTCGACTT GCTGGCGGCC GGACTGTCGG AGGCCCGAGG GCCAGGCGCT GAAAAAGCGC AACGCCGTGA AAGCCTGCTC CAGCAATCGT CTGGCTCAAG GGCGACACAG ATCGTCGCGG GCGCTTCGGA GGACTCATCG TCGAGCGCAA TGGCGATTCT TCGCGCGAGG CGCCCGACAG TGGCGAACCT CGCGCCCCAT TCAACGGCAC ATGAAGGCGC AACGCAACTG GATCCGCTGG CGGCTTCACG CAGCGAGGCC GATATTGGTC GAGAGATCGC GCGGCTTGGA TCCCAGGGTA GCGAGATCAA GGTCTCCGTC ATTGAGCGGC GGACTCATCT ACCGCCTGTG GTCGAGGCGG CAAAGCCAAT TGAGCAAATC GGCAATCAGA TCCTGACCGA AGCATCGCTT TTGTTGACGC CATCCGCCGG CGGGAGTGAA TCCCGAACCA CGACACAGCG TCCCGCCGGC GCATCCACGA CTGGCGTGGA GGCCCCGCAA AAACCGATGC CCGTGATGAA AACTCTCGAT CTGAGGCTGG AGCCCGAAAG CCTTGGCGCC GTCACCATTC GCTTGCGCCT TTCCGGCGTC CAGCTCGAGG TGCAGGTCGA GGCGTCCCAT GTTCAAACGA TGAAGCTGAT CCGAGACGAC AAGGATCTGC TTTCCGCCAA GCTGCGGTCG TCCGGCTATG CAATGGATCA CCTTGTGGTG AAGCTTGCAG AGCAACAGAT CGGGCCGGCG CAAACTCGGG CGGAAGCGGG ACAAGGTCAT ACATTCGATG GTCAATCGGG GAATTTCACG CCCTCGTCAG AATTATCGCA GCAAGGGGGG TCCGGCGCCA ATGATCGGCA GGCCGCCAGG CGAGATCAGA CGACGGGATT TGGCAAAGGC GATGATGCGG AGGATCATGC TCGTCGCGGC GGCGATCTCT ATCTTTGA
|
Protein sequence | MNVVNGHFGA RTIAAESPSS AALERGGAAA SHAEQGAAFG NIMADISKAD GPVAQGSATP KGLSRPITAR FEEGEKACPS ADPIQGGSDA SADAGSAAGP QLPGEAAFGA ARRAFTAEGS GAGIVNGVGQ KAASPTRADA GVAAQFTGDP AIDADLAPCF GAGGRHKFHD VKESKETAPN PSTPDHVTTK PKSGLAGDAA ISASVVPSAG VASGLSLRSD PAAAPVDPLS GAGSDWLTRH PAAPSLALPS EMRLARAASE PASDLDLLAA GLSEARGPGA EKAQRRESLL QQSSGSRATQ IVAGASEDSS SSAMAILRAR RPTVANLAPH STAHEGATQL DPLAASRSEA DIGREIARLG SQGSEIKVSV IERRTHLPPV VEAAKPIEQI GNQILTEASL LLTPSAGGSE SRTTTQRPAG ASTTGVEAPQ KPMPVMKTLD LRLEPESLGA VTIRLRLSGV QLEVQVEASH VQTMKLIRDD KDLLSAKLRS SGYAMDHLVV KLAEQQIGPA QTRAEAGQGH TFDGQSGNFT PSSELSQQGG SGANDRQAAR RDQTTGFGKG DDAEDHARRG GDLYL
|
| |