Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2936 |
Symbol | |
ID | 7092856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3235263 |
End bp | 3238331 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643466249 |
Product | hypothetical protein |
Protein accession | YP_002363214 |
Protein GI | 217979067 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.897534 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAT TGAGGCTTGT CGCAGGCGTC CTTGCCGCGC TCTTTTTTGC CGGCGCGACG GCCGCGCAGG CGGCGCCAGT GGCGGACGCC TATTTCTCCG CCGACGGACG CGATGCGCGC GGCGATGCGA TCGGCGCCGA TTGCACAAGA TCCGCGCCTT GCTCGAGCGT TCGCAAGGCT CTCGATTTTC TGAAAGCCAA TGTCACGCGC AACCCGGATC GTGATTATAC GCTCGCCTTC AGCGATCGCG GCGGCCCGTT CTACCTCGAC GACACGCTGC TGTTCACGAC GGCGCACACG CCTGCGGCCG GCCATTTCGC CAATCTGATC GCCTTTCCGG GCAAGGCGCC GGTCTTTAGC GGCGGAACGG ATATTTCGGG ATCATTCAAG GCGGCGACGC TGCCGACGAA CGGCGCCCCG GCCTGTGTCT CGCGCTGGCT CGCGCCTTAT GCGCCGCTGA CCAACGCCAA TAAAACGTCC TGGCGCTATG TCGGCGTCAT GTGGGTCGAT GGGCGCCGCG TGCAGCAGGT CATCAGCCCG CGCAAGGGCA ATCCATGGAA AACCATTCCC GGCGCGCCGG GAAGCCCCGT GGGGGCGAGC GTCCCCTCCA CCCTGACGGC GACGTTCAGC GCCGGCAAGC CGGACGTTTC CGTCAGCGAT CTCTCGAAAA TCGGCGAGGT CGGCGAGACG ATCGGCTTTA ATTCCAGCGT CGGCTCGTTC AAATGGATCA CCCGTTACTG GATCTTGAGC AAGAGCGCCG CCGCGGGAAG CGGGACGGTT ACGCTTGCGA GCGCTCCGAA CGGGCTGCCC CGAATCGCCT CCGACAGCGG GACGTCGATT GTTCAGGATC CGGTTCAATC GACGAACGCC GGCGACAGGA CGCGGCTTGG CGGCGTCAAC CGGTTCCCTG TTAACGGCGA TGACGCGCTC AGCGCCTATA ATCAAACCGA TGTGAAAATC CAGCTCCCCG GTTCAGCCTC GCAATCTTCG CTCGTCCCGG TCGAGAGCGC AGAGCCCGGC ATGGTGACGG TGAACAGCCA TCTGCATGCC TGGGGAAAGA TCTATGGCGG AACAGGCTAC CGCGTCTGGA ACCGCTTTGA AGATCTTGGC GCGGGCGGCT ACGCCGGCGA GCTTTATACC GACCGGACGA CCGGCGCGCT TTATTACACG CCGCGGCGCG GCGAGGATTG CGCTTCGATC AAGGTTGTGA TCCCGCGCCT CAAGCAATTG CTGCGGATTT CCAACGCGAA GGCCGATATC GCCCGCGTGG GCGGCAAGCT GGGCGCGCCG GTTGGCAATC TCAATATCAC AGGCGTTTCA TTCCAGCATA CCAACAGCAC CGTATTTACG GGCGAACATG ATCTTGCCGG AGCGACGGGC GGCTTCATGG GCACGCCATC CGCCACCGTG CTTGCGCTTA CGCCGTCGGT GGTCACAATC GGGGCGAAAC AGGTCGTCTT CGATGGGATC GAGGTGACCC ACTCCGGCGG CGCCGGCCTC TACATGGCGT TCGGATCCTC GCATATCACA TTGAAGAATA GCCGAATCCA CGAAGCCGGC GGAACGCTTG TCGGCTCCGG CACGGCGGTC GAGTATTACC TTAAATATCA GCATAATAAC GATGCTGGCT ATTACGGCGT CAGCGACTTC GGCCCCTCGA CCAAATCGAG CCGCGCCGAT CTCTCGGGCT GCTGCCAGAC GCTGCATAAC AACTTCATCT TTGGCGCGGG CGAGGTCTCG ACCGGAGCAA GCTGCGTCTC CTTCGTCGGG CTGCTCAAAT ATGAGGTGAC CTACAACAGC ATTCACGACT GCACGAGCTT CGGCCTCGCC AACGCCCAGA GCCACGGGGA GAGGGGCGCG CCCTATTTCA GCGGCTTGTA TGACCCCTAT TCCTATGAGT CCAATATTTC CCACAATGAA ATCTATTATT GCGGCTATGA GACCAATCTC GATGGGGTTT CGGTCCCGGG TTCGTCTATG GCGAATGACT TCGGCTGTTT CTACAGCAAT GGACCGGTCG ACGGCGCAGG CGATGGATCG CAACAAGGAC TCACCTTCAG CTACAATAGC GTCCATGACG TGAGTTCAGG CGCCTATCAG ACCGTGATGT GGAACGGCAG GGTCAGGCCG CATGGCTACG ATGGCGTGCT CGACTACCAT GACGGCAATA TTTCCTTTGG CCTCACCTTG CATCATAATC TGTTCTACAA TCGGGCCAAG GTGAACGGAT ATACGCCTTA CGCCAACCGC CTGGCCCAAC ACACCGGTAA TTTGCGCGAC AAGATCTACA ATAACATCTT TGCCAGCGTC TTCCCGGAAG GCTACGCCTA TGGCGCCGAC TATGCGATCC AACTGACGGT CCCGCCCTGG GACTTTAGGT GGATCGCCGG ACATCCCTGG GGCGGGGCCG GCGAACAGAG TTGCGGGCTT GCCGAAAAAC AGAGCTGCTA TGTCATCAAC AATGGCGTGC TCTATGTGCT CAAGGCCGTC TGTGACAAGG CCGCACCATG CATTGCAGGC GATAAGCCGC CAGGCTGCGC GCCAGGCGAG ACATGCTCGG ACGGCGCGCT GAACTGGACT GCTCTAAAGG CGCTTTATGA TCCGGTCCTG ACGGCGACGC GCAATATCTT TGCCTGGCAA GTCGTCAATT CGCAAAAGGC GGGCGCGCCG CAAAGCGGAA ACGAGGTCCC GCCGTTCAGC CAATTCTTCA AATTTGACTT CAATCTCTTT TACCAGGCGG GCGAGACGCT GGTCGATTAT GTGCGCCTGG GCAAGCCTCC GACGAGCTTT GTCGATTGGC GCGCTCTCTA CGGACAGGAC GTCCATTCGA TCGTCAACGA GGACCCGAAC TTCATGGATC TCGCCAATGG CGATTTCAAC TTCCGGGCGG AGGGGCGCGG CGCGCCCTGC CATGGCGGGC GTGGGGGCGT TTCGCCGGCA TGTGCGCTCG GCTTCGAGCC GTGGGACTAT AGCGATGTCG GGGCGCGGGA AGGCGACAGG CGGACAATTG CGGAAGATTC GCGGGCCGGC GCGCCGGCGA AGGCCGAGCC GCGAACGGAT CGCCCTTGA
|
Protein sequence | MLKLRLVAGV LAALFFAGAT AAQAAPVADA YFSADGRDAR GDAIGADCTR SAPCSSVRKA LDFLKANVTR NPDRDYTLAF SDRGGPFYLD DTLLFTTAHT PAAGHFANLI AFPGKAPVFS GGTDISGSFK AATLPTNGAP ACVSRWLAPY APLTNANKTS WRYVGVMWVD GRRVQQVISP RKGNPWKTIP GAPGSPVGAS VPSTLTATFS AGKPDVSVSD LSKIGEVGET IGFNSSVGSF KWITRYWILS KSAAAGSGTV TLASAPNGLP RIASDSGTSI VQDPVQSTNA GDRTRLGGVN RFPVNGDDAL SAYNQTDVKI QLPGSASQSS LVPVESAEPG MVTVNSHLHA WGKIYGGTGY RVWNRFEDLG AGGYAGELYT DRTTGALYYT PRRGEDCASI KVVIPRLKQL LRISNAKADI ARVGGKLGAP VGNLNITGVS FQHTNSTVFT GEHDLAGATG GFMGTPSATV LALTPSVVTI GAKQVVFDGI EVTHSGGAGL YMAFGSSHIT LKNSRIHEAG GTLVGSGTAV EYYLKYQHNN DAGYYGVSDF GPSTKSSRAD LSGCCQTLHN NFIFGAGEVS TGASCVSFVG LLKYEVTYNS IHDCTSFGLA NAQSHGERGA PYFSGLYDPY SYESNISHNE IYYCGYETNL DGVSVPGSSM ANDFGCFYSN GPVDGAGDGS QQGLTFSYNS VHDVSSGAYQ TVMWNGRVRP HGYDGVLDYH DGNISFGLTL HHNLFYNRAK VNGYTPYANR LAQHTGNLRD KIYNNIFASV FPEGYAYGAD YAIQLTVPPW DFRWIAGHPW GGAGEQSCGL AEKQSCYVIN NGVLYVLKAV CDKAAPCIAG DKPPGCAPGE TCSDGALNWT ALKALYDPVL TATRNIFAWQ VVNSQKAGAP QSGNEVPPFS QFFKFDFNLF YQAGETLVDY VRLGKPPTSF VDWRALYGQD VHSIVNEDPN FMDLANGDFN FRAEGRGAPC HGGRGGVSPA CALGFEPWDY SDVGAREGDR RTIAEDSRAG APAKAEPRTD RP
|
| |