Gene Msil_2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2556 
Symbol 
ID7093210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2790826 
End bp2793156 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content62% 
IMG OID643465872 
Productpentapeptide repeat protein 
Protein accessionYP_002362842 
Protein GI217978695 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.684592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA GCGACAGGGA GGCCGAGAAG AAGGAGCTCG AGGCGCTGAC CGCCGCGCTC 
AATCGCTCGG CCGAGCGCGT TCAGACGCTA TGGCTGAGCT TTATCGTCTT CATGCTCTAT
CTCGCCATCG CCGCCGGTAC GACGACGCAT CGCATGCTGT TTCGGGGCGA TCCGCTAAAA
CTGCCAGTGC TCAATATCGA TCTGCCTTTG CTCGGCTTTT ATACGCTGGC GCCAGTTCTG
CTTGTTATAT TTCAATTCTA CGTGCTGCTT AATCTGATGC TGCTTGCCCG CACCGCTTCG
TCGTTCGAGG CGGTGTTGCC GAAGGTCTTT CCACCGGCGA TGCCGGGAAA TGACAAATTT
CGCATGCGCA TTGAGAATGC GCTGTTCGCG CAACTTGTCG CGGGCGCGAA ACTTGAGCGG
CTAGGGCGCA ACGCTTGGCT GCTCGGCTTT ATGAGCTTCC TCACCGTAGC CGTCGCGCCA
GCGGCGGCGC TGTTGCTGAT CGAAATCCGG TTCCTGCCGT ACCACCATGA CGCCATCACA
TGGCTGCATC GCGGCTTGCT GTTGCTGTCG CTTGGCCTGT CCATCACGTT CTGGCCCGCC
TATCGAAACG GTTTCGGCGA GCGCTGGGGT CCGGCGAGCG CGGTGGGATG GGCGGTTGGA
GTGGCGGCGG CGAGCGTCGT GATGGCCTAT GCGTCGCTCG TCGCAAATTT TCCGGGCGAG
AAGCTCTACG CATGGACCGA GCCGACGCGA CTGTGGCTCG GCGTCTATGA GGCGAGCTTC
AGGACCATGA AGTGGCCCGA GTTCGCAGCG GCAGAGGAGA AAAGGAAAAA AGATAGATTT
GGCATCGTCG CGCCATTCTT GGCTGCGCTC TGGCCAGCGA GCGCGTTGAG CCTGCGCAAC
GAGGATCTGA TCGATGACTC GCGGCGCAAG CAGATCGCAG ACCGTCGCAC GACTAATGGC
GAAGCGCATG AACCGAGCGC TGACGGGAGC GGCGCGGGAA AAAACTCGGC AGGCCAGACA
AAATGGATCG AAAGCATAAA TCTGCGCGAC CGCAATCTCA TTGCGGCCGA TCTGACCAAT
GCCGATGTGG AGCAGGCCGA GTTTTCCGGG GCCGATCTTT CTCGCTCATT GCTGTTGAGG
ACATGGACGC CAAGGGCACA TTTCGCAAAC AGCGATACTA AGCCGGCGCA GCTAAAGGGC
GCATCGCTCG CTGGGGCGCA GCTGCAGGGA GCGTCGCTCG ATCGAGCGCT GCTGCAGGGA
GCCGCGCTCA AAGAGGCGCA GCTGCAAGGC GCGTCGCTCG TTTTGGCGAA TTTGCAAGGT
GCGTCGCTCA TTAACGCGCA GCTGCAGGGA GCATCGCTCG GGGTGGCGCA GCTGCAAGGC
GCATCGCTCG TTAGGGCGCA GCTGCAGGGC GCGTCGCTCG ACTGGGCACA GTTGCAGGGC
GCGTCGCTCG CTGGGGCGCA GCTGCAGGGA GCGTCGCTCG ATAGGGCGGA GCTGCAGGGC
GCGTCGCTCG ACTGGGCACA GTTGCAGGGC GCGTCGCTCT ACCAGGCGCA GCTGCAGGGC
GCGTCGCTCG ATTGGGCGCA TTTGCAGGGC GCCTCGCTCG AATGGGCACA GTTGCAGGGC
GCGTCGCTCG CTGGGGCGCG GCTGCAGGGC GCGTTGCTCG ATAGAGCAGG GCTGCAGGGC
TCGTTGCTCG ATGGGACGCA GCTGCAGGGC GCGTCGCTCG ATAGGGCGCA GCTGCGGGGC
GCGTCGCTCT CTGGTGCTCT GACTTGGCGT GCGGATGTGC TTAACGCAAA CGCCGGAGGA
GCCCTAATCG CTTTGCCATT CTCGGGCCAA TTTTTCGATT GCGAGAATAT TCGGTTACAG
ATCCGGCGTC TCTATCGAAT ATGCACAAAA GGTGAGAAAG ACTTCAAATC CTTTATGCGA
GTTTTCGACG CCGTCCCCGA GGGTGGCCTC AAAAAAAAGG CGACCCTGCG CCTCACGAAA
AGCCTGAAGG TCGAAGCTTT TCCGGACCAG GCACCGATCG TTAGAAAAAT CGCCGCGCGC
TGGACCGCGC TTGCAAAAAC GCCGCTTCCG CCAGCGGATC TTCTGCAAGA ATGGCCTGAC
ATCGCCTGCG AAGAAATCGG CGCGCCCTAT GTCGCGCGGG CGCTGCTCGG ACGTCTGAGG
GATTATCGTT TCAAGACCGA AGCCGCCGCT AAAGCCGCTA GGGCCGATTT TTCCAGCGGT
CTGCTCCGCA AGGCATACTG CCTCGGCGCT AAGGGCCTGA CCGACGCTGA AAAAGCCGAG
CTCCAATCCT TCATCGACGC CGCGAAGGTC CCAGCGTCGT CCGCGCCTTG A
 
Protein sequence
MADSDREAEK KELEALTAAL NRSAERVQTL WLSFIVFMLY LAIAAGTTTH RMLFRGDPLK 
LPVLNIDLPL LGFYTLAPVL LVIFQFYVLL NLMLLARTAS SFEAVLPKVF PPAMPGNDKF
RMRIENALFA QLVAGAKLER LGRNAWLLGF MSFLTVAVAP AAALLLIEIR FLPYHHDAIT
WLHRGLLLLS LGLSITFWPA YRNGFGERWG PASAVGWAVG VAAASVVMAY ASLVANFPGE
KLYAWTEPTR LWLGVYEASF RTMKWPEFAA AEEKRKKDRF GIVAPFLAAL WPASALSLRN
EDLIDDSRRK QIADRRTTNG EAHEPSADGS GAGKNSAGQT KWIESINLRD RNLIAADLTN
ADVEQAEFSG ADLSRSLLLR TWTPRAHFAN SDTKPAQLKG ASLAGAQLQG ASLDRALLQG
AALKEAQLQG ASLVLANLQG ASLINAQLQG ASLGVAQLQG ASLVRAQLQG ASLDWAQLQG
ASLAGAQLQG ASLDRAELQG ASLDWAQLQG ASLYQAQLQG ASLDWAHLQG ASLEWAQLQG
ASLAGARLQG ALLDRAGLQG SLLDGTQLQG ASLDRAQLRG ASLSGALTWR ADVLNANAGG
ALIALPFSGQ FFDCENIRLQ IRRLYRICTK GEKDFKSFMR VFDAVPEGGL KKKATLRLTK
SLKVEAFPDQ APIVRKIAAR WTALAKTPLP PADLLQEWPD IACEEIGAPY VARALLGRLR
DYRFKTEAAA KAARADFSSG LLRKAYCLGA KGLTDAEKAE LQSFIDAAKV PASSAP