Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2556 |
Symbol | |
ID | 7093210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2790826 |
End bp | 2793156 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643465872 |
Product | pentapeptide repeat protein |
Protein accession | YP_002362842 |
Protein GI | 217978695 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.684592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACA GCGACAGGGA GGCCGAGAAG AAGGAGCTCG AGGCGCTGAC CGCCGCGCTC AATCGCTCGG CCGAGCGCGT TCAGACGCTA TGGCTGAGCT TTATCGTCTT CATGCTCTAT CTCGCCATCG CCGCCGGTAC GACGACGCAT CGCATGCTGT TTCGGGGCGA TCCGCTAAAA CTGCCAGTGC TCAATATCGA TCTGCCTTTG CTCGGCTTTT ATACGCTGGC GCCAGTTCTG CTTGTTATAT TTCAATTCTA CGTGCTGCTT AATCTGATGC TGCTTGCCCG CACCGCTTCG TCGTTCGAGG CGGTGTTGCC GAAGGTCTTT CCACCGGCGA TGCCGGGAAA TGACAAATTT CGCATGCGCA TTGAGAATGC GCTGTTCGCG CAACTTGTCG CGGGCGCGAA ACTTGAGCGG CTAGGGCGCA ACGCTTGGCT GCTCGGCTTT ATGAGCTTCC TCACCGTAGC CGTCGCGCCA GCGGCGGCGC TGTTGCTGAT CGAAATCCGG TTCCTGCCGT ACCACCATGA CGCCATCACA TGGCTGCATC GCGGCTTGCT GTTGCTGTCG CTTGGCCTGT CCATCACGTT CTGGCCCGCC TATCGAAACG GTTTCGGCGA GCGCTGGGGT CCGGCGAGCG CGGTGGGATG GGCGGTTGGA GTGGCGGCGG CGAGCGTCGT GATGGCCTAT GCGTCGCTCG TCGCAAATTT TCCGGGCGAG AAGCTCTACG CATGGACCGA GCCGACGCGA CTGTGGCTCG GCGTCTATGA GGCGAGCTTC AGGACCATGA AGTGGCCCGA GTTCGCAGCG GCAGAGGAGA AAAGGAAAAA AGATAGATTT GGCATCGTCG CGCCATTCTT GGCTGCGCTC TGGCCAGCGA GCGCGTTGAG CCTGCGCAAC GAGGATCTGA TCGATGACTC GCGGCGCAAG CAGATCGCAG ACCGTCGCAC GACTAATGGC GAAGCGCATG AACCGAGCGC TGACGGGAGC GGCGCGGGAA AAAACTCGGC AGGCCAGACA AAATGGATCG AAAGCATAAA TCTGCGCGAC CGCAATCTCA TTGCGGCCGA TCTGACCAAT GCCGATGTGG AGCAGGCCGA GTTTTCCGGG GCCGATCTTT CTCGCTCATT GCTGTTGAGG ACATGGACGC CAAGGGCACA TTTCGCAAAC AGCGATACTA AGCCGGCGCA GCTAAAGGGC GCATCGCTCG CTGGGGCGCA GCTGCAGGGA GCGTCGCTCG ATCGAGCGCT GCTGCAGGGA GCCGCGCTCA AAGAGGCGCA GCTGCAAGGC GCGTCGCTCG TTTTGGCGAA TTTGCAAGGT GCGTCGCTCA TTAACGCGCA GCTGCAGGGA GCATCGCTCG GGGTGGCGCA GCTGCAAGGC GCATCGCTCG TTAGGGCGCA GCTGCAGGGC GCGTCGCTCG ACTGGGCACA GTTGCAGGGC GCGTCGCTCG CTGGGGCGCA GCTGCAGGGA GCGTCGCTCG ATAGGGCGGA GCTGCAGGGC GCGTCGCTCG ACTGGGCACA GTTGCAGGGC GCGTCGCTCT ACCAGGCGCA GCTGCAGGGC GCGTCGCTCG ATTGGGCGCA TTTGCAGGGC GCCTCGCTCG AATGGGCACA GTTGCAGGGC GCGTCGCTCG CTGGGGCGCG GCTGCAGGGC GCGTTGCTCG ATAGAGCAGG GCTGCAGGGC TCGTTGCTCG ATGGGACGCA GCTGCAGGGC GCGTCGCTCG ATAGGGCGCA GCTGCGGGGC GCGTCGCTCT CTGGTGCTCT GACTTGGCGT GCGGATGTGC TTAACGCAAA CGCCGGAGGA GCCCTAATCG CTTTGCCATT CTCGGGCCAA TTTTTCGATT GCGAGAATAT TCGGTTACAG ATCCGGCGTC TCTATCGAAT ATGCACAAAA GGTGAGAAAG ACTTCAAATC CTTTATGCGA GTTTTCGACG CCGTCCCCGA GGGTGGCCTC AAAAAAAAGG CGACCCTGCG CCTCACGAAA AGCCTGAAGG TCGAAGCTTT TCCGGACCAG GCACCGATCG TTAGAAAAAT CGCCGCGCGC TGGACCGCGC TTGCAAAAAC GCCGCTTCCG CCAGCGGATC TTCTGCAAGA ATGGCCTGAC ATCGCCTGCG AAGAAATCGG CGCGCCCTAT GTCGCGCGGG CGCTGCTCGG ACGTCTGAGG GATTATCGTT TCAAGACCGA AGCCGCCGCT AAAGCCGCTA GGGCCGATTT TTCCAGCGGT CTGCTCCGCA AGGCATACTG CCTCGGCGCT AAGGGCCTGA CCGACGCTGA AAAAGCCGAG CTCCAATCCT TCATCGACGC CGCGAAGGTC CCAGCGTCGT CCGCGCCTTG A
|
Protein sequence | MADSDREAEK KELEALTAAL NRSAERVQTL WLSFIVFMLY LAIAAGTTTH RMLFRGDPLK LPVLNIDLPL LGFYTLAPVL LVIFQFYVLL NLMLLARTAS SFEAVLPKVF PPAMPGNDKF RMRIENALFA QLVAGAKLER LGRNAWLLGF MSFLTVAVAP AAALLLIEIR FLPYHHDAIT WLHRGLLLLS LGLSITFWPA YRNGFGERWG PASAVGWAVG VAAASVVMAY ASLVANFPGE KLYAWTEPTR LWLGVYEASF RTMKWPEFAA AEEKRKKDRF GIVAPFLAAL WPASALSLRN EDLIDDSRRK QIADRRTTNG EAHEPSADGS GAGKNSAGQT KWIESINLRD RNLIAADLTN ADVEQAEFSG ADLSRSLLLR TWTPRAHFAN SDTKPAQLKG ASLAGAQLQG ASLDRALLQG AALKEAQLQG ASLVLANLQG ASLINAQLQG ASLGVAQLQG ASLVRAQLQG ASLDWAQLQG ASLAGAQLQG ASLDRAELQG ASLDWAQLQG ASLYQAQLQG ASLDWAHLQG ASLEWAQLQG ASLAGARLQG ALLDRAGLQG SLLDGTQLQG ASLDRAQLRG ASLSGALTWR ADVLNANAGG ALIALPFSGQ FFDCENIRLQ IRRLYRICTK GEKDFKSFMR VFDAVPEGGL KKKATLRLTK SLKVEAFPDQ APIVRKIAAR WTALAKTPLP PADLLQEWPD IACEEIGAPY VARALLGRLR DYRFKTEAAA KAARADFSSG LLRKAYCLGA KGLTDAEKAE LQSFIDAAKV PASSAP
|
| |