Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2809 |
Symbol | |
ID | 7092972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3085251 |
End bp | 3087761 |
Gene Length | 2511 bp |
Protein Length | 836 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643466120 |
Product | hypothetical protein |
Protein accession | YP_002363089 |
Protein GI | 217978942 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGAA TTGGATCGAT CTATACATGC TTCTTGCCAA GGGCGCACGC GCTTGAGCCG TGGCCGCCAA GACGAGGACG CCGCCGCCTC GCCATCGCCG CCATGCTCAT CGGCCTGTCC ATCAATCCGC CCGGCGCCGC GCTGGCCGAC TCCGCCGCAA AGCCAATCAT CATCAACAAG CCGTTCTCGG CAAAGCCCGG CGATGTCTTC AGCCTTGCCG GATCAGGCTT CGGTTCGGCC CCACGGGTTT ATCTGAAGCC CAGCCGGTTA ACGGCGACGA CGGAGCTGCC CGTCAAGACA GCAGACGACG CAACCGTCGT CGTGGAGGTT CCCAAGGCTG CGGCCTTCGA CGTCTATGAG GTGTGGGTTG CAAATGGCGC GGCGACCAGC CCGCATGTTC TCATCAACAC GCCTCAGCCT CATCATTTCG ACAATCCCGA CATCGCGACC GGCGCGCATT TTCGTATCTT CGGCCGCAAC CTCTCGGTAG GGACGCTCGC GCCGACGGCG ACGCTTGTCG ACATCGGCAC GGGAGCGCAG ATCAAGGCGG TCGTCGGCGT TTCGGCGAGC AACGCCTATA TGCTCGACGT AACGCCGACC GGCGTTGTCG CCGGGCATGC CTATAAGGTT CTGGTTTCAA ATGGCTATGC GAGCGCCTTG AGCGAGGCGA GCGTGCTTGG CCACGCCACG GGCATGGACC GCTTCGCGAT CGGGCAGCCA TGGGCCTATG ATTTCGTCAC TGCCGATGGT CCCGGCTACA AGGCCGGCGT CAAGGGAACG AACCTCGCCG ACCATCATGT CTTCAATGTG CGAACGGACG TCGTTCTGGC GACATTGGCC AAAGGCGACG GAAAGACAAA CGACGGACCG GCCATTCAGG GGGCGATCAA CGCGGCGGCG CGTTACGGCG GCGTCGTCTA TTTGCCGGCG GGAACCTATA ATATCGGCTC CACCAGCATC AGTTTGACGT CCAACGTGCT GCTGCAGGGA CAGAGCGCGT CGGCGACGAA GATCATCTAC ACGGCGACGA AGCCGGGGTT CATCATCGGC GCAGGCGCGT CGATGACCGG CTTCGTCGAT CTCACCTTGC AAAATCAGGA CAAGACGAGC ACGACGACCA ATCTTGGAAC CTGGCAGCAG CCGGTCTCGA AGGTCATCAT CCAGCGAGTG ATATGGGATC TCGGCACCGG ATTTTCAATC AACCTCAGGG GCGATCGCAT CGCCATCCTC AATTCAACAT TCAAGCAGGC GATCAACTCC TGGAACGGCA GCGGAGCGGG GGCGCTGCTC ATCACCAACG CCACCAATCT GACGCTCAAG AACAACACGA TCAGATGGGC GAGCAATCAG AACGCCTTCG GGGATCTCGT CAACGCGATT TTTGAGAACA ACCACTTCAC GCGCAGCGCG TCGGATACGG TGATCGCGGG GCCGGACCAG CTCTCCTGGC CGTGGATCGT GAGGCCGATC AGGCTGGGCG ACGTGCTATC GAGGACGGCC GCCGGCGGAA GGCAGGTCTC GCTGAACTTC GGCACAAATA TCGTGTTTCA GAACAATGTC TTCGATACCT CGGACGGCGT CATTCAATAT AATGCGGGCG ATGGCGAGAC CATTCTGAAC GAGGGCGGCG GCGGCTCGCC GCGTGAGGAC TTCGGGACCG TCACCACGGC GAGCGCCGCG ACCATAGCCG ACGATTCGAA ATGCTCTGGA GCCTGCGCCT GGAAAACCTA CCCGAATTCG AAAGTCGTCA TCGTCAGCGG CGCCGGCGCC GGTCAGTGGC GCAAGATCTT CGCGCAGAAC GGCAACAGCT TTACAGTCGA CCCGCCCTTC GACGTCGTTC CGGCCGCGGG GGATCATTTC ACGATCTCCG CTCCGTCCTA TGAGAACGCC ATCATCCGCA ACAACACGAT GGTCGGAAAT CCGATCGGGG TCGCCATGTA TCATGGCGTC TTTCTCAACG TCTCCGTCAT CGGCAACCAG CTCACCAACA ATGGCGGCAT CTATCTCCTG CCGTCGCAGG CCAATCAGCG CGCGGGACGG AACTTTTTTA ATGTCGCGCG CAATATTGAG ATCAACGGCA ATGTCCTGAA GAATCTCAAC GGCAACTATC CGTCATATGT CTCGATTGGC TTCGTCATGA TGACGCAGAA TGTATTCTGG GGCAAAAGCG TGCTCGCCGC CGAGGCCCGC AACAATCAGA TCACGGCGCG CAGCGGCACC TTTCCCTACA CATTCGGGGA AGGCTACAAT CACCTGACCT TTTATCAAAA TCCGGGCGGC GGCGCCTATG TCGAGGAAGG GAATGGGGCC ATGTGGGGAA CTGTATGGCA GGGCAACAGC TGCGCCAATT GCGCGGTCAA CTACAACCTC AGCACCGGAG CCATGGATAC GACGATCTGG AACGCAAAGG CGGCGAATTC CCCTGGCTTC GTCTCCACGA TCCTGAAAGA CACGACGATC GCAAACAGCA CCAAACAGGC CTCGACGCGA ACGCTTGTCG GCAAGGATTG A
|
Protein sequence | MLRIGSIYTC FLPRAHALEP WPPRRGRRRL AIAAMLIGLS INPPGAALAD SAAKPIIINK PFSAKPGDVF SLAGSGFGSA PRVYLKPSRL TATTELPVKT ADDATVVVEV PKAAAFDVYE VWVANGAATS PHVLINTPQP HHFDNPDIAT GAHFRIFGRN LSVGTLAPTA TLVDIGTGAQ IKAVVGVSAS NAYMLDVTPT GVVAGHAYKV LVSNGYASAL SEASVLGHAT GMDRFAIGQP WAYDFVTADG PGYKAGVKGT NLADHHVFNV RTDVVLATLA KGDGKTNDGP AIQGAINAAA RYGGVVYLPA GTYNIGSTSI SLTSNVLLQG QSASATKIIY TATKPGFIIG AGASMTGFVD LTLQNQDKTS TTTNLGTWQQ PVSKVIIQRV IWDLGTGFSI NLRGDRIAIL NSTFKQAINS WNGSGAGALL ITNATNLTLK NNTIRWASNQ NAFGDLVNAI FENNHFTRSA SDTVIAGPDQ LSWPWIVRPI RLGDVLSRTA AGGRQVSLNF GTNIVFQNNV FDTSDGVIQY NAGDGETILN EGGGGSPRED FGTVTTASAA TIADDSKCSG ACAWKTYPNS KVVIVSGAGA GQWRKIFAQN GNSFTVDPPF DVVPAAGDHF TISAPSYENA IIRNNTMVGN PIGVAMYHGV FLNVSVIGNQ LTNNGGIYLL PSQANQRAGR NFFNVARNIE INGNVLKNLN GNYPSYVSIG FVMMTQNVFW GKSVLAAEAR NNQITARSGT FPYTFGEGYN HLTFYQNPGG GAYVEEGNGA MWGTVWQGNS CANCAVNYNL STGAMDTTIW NAKAANSPGF VSTILKDTTI ANSTKQASTR TLVGKD
|
| |