Gene Msil_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3106 
Symbol 
ID7092455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3409766 
End bp3412702 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content56% 
IMG OID643466416 
Productpentapeptide repeat protein 
Protein accessionYP_002363377 
Protein GI217979230 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTA CTCGTAAACC CGTTGCCCGC GCCGAGACTG GCATTTTGTT GGTCCGGGAC 
TCTACATTGA AGGCCGAATG GCCGAAATTC GTGGCCAACA TCGTCAAGGT GGTTGGCTCG
GGCTTTGCGA TTTTCCACGG CCACCTTGAT CATATCCCTG AAGGTATCTC TAGCTTGGTC
GAGGCTGCTG GTGCCATCTC TATAGATGCG CCGCCTGGAG AGCGCGGTTG GTTATTGTTC
GCGTTAGGAT TTGCCTGGGC TTTTGACGAG CTTAGAGCTC GCGGTCAACT TGACGACATC
GGTGTCGAAA CGGCCGTGCG CGATGCGCTC GCCGAAGCGA AGGAAAGAGT GGATCAAGGC
GAGCAGATCA TCCCGACGTC ATTCCTTGAC CAACCCACTA CGCTACCCCT CTACCGACTG
GTGCGAGACT CATTTGTCGA ACGGAAGGGC GCGTTCCGAG CGGCAGGGGG CGAGAAGGAC
GACATATTGC GGGCGCGTTT CGACGCGGCG TTCGATCGTG CAATATTCCT GCTTTGGTCA
CAGTCGCCGG AACAATACCA ACCGCTGGCA GCCGCCCTGA ATGCTCCGGG AGCAGCCGCC
CATGCGTTCC GTCTGGAATG GGACGCATAT CGAAAAGGCC TGATCCACGA ATTCGAGGTT
AAGCCGGTTT TTGGGCAAGA AACCCTCAAG ATTGCACTAT CGCAAATTTA TGTTCCGCTT
CGGGGAGTCT GGCGTGAAGA ACACGACGAT CCACGACCTC CGTCCCGAAG CGTCAAGACG
CGGGCGCTTA CACTCTCCCC TTCACGTTGG AAGGCGTCTC ATTTTGTCTG GTTGGACGAC
GAACTTGACG CTTGGGCGAC TGGTGGCCAC GAGGAATACG AGTTGAAACT GTTGGGCGGT
GGCCCCGGCA GCGGAAAGTC GACGATCGCG AAAGCTCTCG CGCGTAGGCT CGCGGCACGC
GACGACTGCC GACCTCTTTT CATCTCCCTC GCTAACATCG ACGCAGCTAT TGATCTGCGT
GAGGCGATCA ACGCCCACTT CACTAAGCGC ACGGCAAATC CATTCAGACA GCCGCCCCTT
GCCCGCAATG TGGTAGAGGA TGGGCCCCCG CTTGTTCTCA TCTTCGACGG CCTCGATGAA
CTGGCCCGAC CGGGGGAGGC TGCTAACGAC ATCGCTCGCG ACTTCATGGG CAAGCTTGGG
CAGCTTACGT CCGAATTGCG CGGAGATAAA AAGAGCCGTC TTCGGGTCAT CGTCACGGGG
CGAATGCCGT CCTTCCAGGC CGCTCAGCGC TTTGCGGGCG CGGTCAATAG GAAAAGCTAC
GAGGTCGTGG GCTTTCTTCC CATTAAGGGG GCCGAACTTT TTCAGACGAG CGAGGAATTC
GGCGGTAAGG GCCTAGACGT CGCTTCAGGG GTGCTAGAGT CTGTTCTGAC TGTTGCACCC
CCTCAAAAAG CAGCGCCTGA GCAGCAGAGC GACATTGCCC GCATCGATCA GAGGGAAACC
TGGTGGAAAG GTTATGCAGC AGCGGTCGGA CTTTCCTTAA AACCACCCCA TGCCTTATCC
GCCCCCGCCC TGACAGAACT CACTCGCGAG CCCTTGCTAT GCTACCTCCT CGTGCTCTCG
GGATATCTTA CCGAAGGAGC GCAATTAGCT GCCGGGAACC TGAACTTGAT CTATCGTGAA
CTGATTGACG AGGTGTGGCG ACGTGGTTGG GGAGACGACC CTGATAACGG GCGGCGCCCA
GGTGCCGGGA GATTTCTAAG CCAGGAGGGC TTCAATAGAC TGATGGAAAC CATTGCTCTG
GCCGGATGGC AGGGAGGCGA CACGCGAGTG TGCAGCGAAG CTCGATTTCA TAATGCGGCA
AGGTCTCTGC GAACTGGCAA TGCCTGGGAT GACTTCGTTC GAGACAACGG CTCCGATATT
ACTAACCTCG CAATGAACTT CTACCTCAAG AACGCTGAGA TTGAGACGCG TGGTTTTGAA
TTTACACATA AGAGCTTTGG CGACTATCTC GCCGGTCGAG CGCTTCTGGA AGTAGCGTTA
GATCTGATCA CTCTTATTGA TCGCCGGCCC GAGACGGCGA TGAAGGAATG GTTCGATGTC
ACGGCCACCG GCAACCTCAC CAACGAATTA TTGGACCACA TGCGGCGCGA ATTGCAATTA
CGTGCCAGCA GTTCTGAGGG ATTAGAAGAA GTTCGCGCAC TAAAATCTGG ATTCGAACGT
TTGGCTTCCG CGGTCATCCG TGATGGTTTG CCGGCGAATG GCAATGCGGA TCAGCCCTGG
AGAGTGGCGG AAGCAAGGCA GCGCAACGCC GAGGTCATGG TATGGGCGGT GCTCAATGCA
TGCTCATTAG CCATCGCCGC CACTGATCCT GAGCGCGCAA AAATTGAGGT TGCGTGGCCC
GATAAGGCCG CTTTCGGCAA CCTTCTGCGA CGAGTTGTAG GACAGACCGA TCTCACCACA
AGCTTCGGCC CGAAAATGTT CTATAAAATT TTGTTTTTTC AATTTCAATT TGCCATTGCC
ACGGCAGCTG CCCCGGTCAT GAAATGCTTC GCTTATCTCG TGGCTCCTGA AGCAGACTTG
ACAGGGCAAG CACTCTTTGC TGCTGATCTG CAACACGCCG ATCTGCGCCG CGCGCGCCTC
GAGTTAGCCT CGTTGCGCAC ATGCAAGTTG AAAGGGACCA ACCTCTCCAA CGCAAATTTC
AGCGGGTCGG TGATCGAAAC CACCGACTTC GAAGACGCTA ATCTGGAGGG CGCTCAATTC
AATGATGCTA AGCTGCTAGT TTCTCCTCTG GATAAGGCCA GTGTAAAGGG AGCTACGCTT
CTAAGGACGC TCATTCTCCC CGGCAAGGGC GACCGCGGCG CCGATCTCAA AAGGAGAGGA
GCGAACGTTA CAGGTGTTCA AATGGCAGAT ATGCTGCGGA AACCTGATTC AGTTTGA
 
Protein sequence
MARTRKPVAR AETGILLVRD STLKAEWPKF VANIVKVVGS GFAIFHGHLD HIPEGISSLV 
EAAGAISIDA PPGERGWLLF ALGFAWAFDE LRARGQLDDI GVETAVRDAL AEAKERVDQG
EQIIPTSFLD QPTTLPLYRL VRDSFVERKG AFRAAGGEKD DILRARFDAA FDRAIFLLWS
QSPEQYQPLA AALNAPGAAA HAFRLEWDAY RKGLIHEFEV KPVFGQETLK IALSQIYVPL
RGVWREEHDD PRPPSRSVKT RALTLSPSRW KASHFVWLDD ELDAWATGGH EEYELKLLGG
GPGSGKSTIA KALARRLAAR DDCRPLFISL ANIDAAIDLR EAINAHFTKR TANPFRQPPL
ARNVVEDGPP LVLIFDGLDE LARPGEAAND IARDFMGKLG QLTSELRGDK KSRLRVIVTG
RMPSFQAAQR FAGAVNRKSY EVVGFLPIKG AELFQTSEEF GGKGLDVASG VLESVLTVAP
PQKAAPEQQS DIARIDQRET WWKGYAAAVG LSLKPPHALS APALTELTRE PLLCYLLVLS
GYLTEGAQLA AGNLNLIYRE LIDEVWRRGW GDDPDNGRRP GAGRFLSQEG FNRLMETIAL
AGWQGGDTRV CSEARFHNAA RSLRTGNAWD DFVRDNGSDI TNLAMNFYLK NAEIETRGFE
FTHKSFGDYL AGRALLEVAL DLITLIDRRP ETAMKEWFDV TATGNLTNEL LDHMRRELQL
RASSSEGLEE VRALKSGFER LASAVIRDGL PANGNADQPW RVAEARQRNA EVMVWAVLNA
CSLAIAATDP ERAKIEVAWP DKAAFGNLLR RVVGQTDLTT SFGPKMFYKI LFFQFQFAIA
TAAAPVMKCF AYLVAPEADL TGQALFAADL QHADLRRARL ELASLRTCKL KGTNLSNANF
SGSVIETTDF EDANLEGAQF NDAKLLVSPL DKASVKGATL LRTLILPGKG DRGADLKRRG
ANVTGVQMAD MLRKPDSV