Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0241 |
Symbol | |
ID | 7090558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 273289 |
End bp | 279096 |
Gene Length | 5808 bp |
Protein Length | 1935 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643463574 |
Product | alpha-2-macroglobulin domain protein |
Protein accession | YP_002360583 |
Protein GI | 217976436 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.516561 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATC TGGCGCTCGC CTGCGCTTTC CTCCTCGCCG CCGCTTGCTT GTCGTTCGCT CCGCATCCGG CGCGGGCCGA GAGCGCGCCG CAATTCGACA TTCTCGAACG GGCGGACGGC GCGCGGATCG TGCCCGAACG ATTCTTGCGC AGCTGGGACC CGGTGACGCT GTTTTTCAAA AGCGACGCCG GGCCGCAAGG CGGCGGCCCA GAGGACGCGC CGGCGCAATT CGTCAAGATG ACGCCCGACA TTCCCGGCGC CTGGCAATGG CTTGGGCCGC GCGCGCTGCA ATTCCGGCCT TCAGCGGCGT GGCGTCCGCT CGAACCGGTC ATTTTCGAGG CTGACGGCGC CAGCACGCGC CTCATCCCGC TGCTGCCCGA GCCGGCCTCC ACGAATCCCG CGGCCGAGTC GGAGCCGATC GCCGAACTCG ATCATATCGT TCTGACCTTC GCCGGTCCCG TCGATCTCGC CGCCTTGACG CGGCTCACTT CGATCGAACT TCGCCCCGCG CCCGGCCTTG CCGGGGGCGA CGGGCAATTT CTCGCCGCGC AGGATTTCAC GATCCTGCCG CTTGAGGGGG CCAAGAAGGA TGCCGGCCGG TCCTATCTGG TGCGGTTGAA GAATGCGGTT CCGGACGGGC GCGTCGCCAT CCTGCGGCTA AAACTGTCCA ACGAGCCGGG TCTTGACGAG CCCTCCTTCG AGCTGCGCCT GCAAAGCGCC GCGCCCTTCA CCGTCACCGG CGCTTCTTGC GGCGAGGGGC TTGAGCGCGC CAATCTCGAC GGGCTGCTTC ATTGCGCGCC CTCGAACGGC GACGACGAGC CGCGGGCGCG GAGTCTTTCG ATCGGTTTCA GCGCGGCGCC GGAACCCCTC GACATCGCCA AAGCGCGCGA CGCGTTGCGC ATCTCGCCGC CGGTCGAGAG CCTCAAAGTT GAGCCGGACG GCGCGCATCT CAAAATTTCA GGACAGTTTC TGGCTGACAC GATTTATGAG CTGACCGTTG CGCCGGGCAC TCTCGCCGAC GACCGCAAGC GCCAGCTCGC GGGCGCCCCT TTCCTGCAGC GCTTCGCCTT TGCGCCGGCG CGCGCAAGCC TGGCCTTCGA CGTCCGTCAG GGCATCGTCG AACGCTTCGG CCCGCAGCTG ATCCCGATGC GCGGGGCGGG CTTCGACAAG GTCGATCTCC GCATCTATGC GATCAATCCA CTTTCGCGCG ATTTCTGGCC ATTTCCGACG GAGGGCGTCG ATACCGACGA TTCCAAAGAG CCGCCGCTGC CCGGCAATGC GCCGGAGCGC TGGAGCAAGG ACGAAGACGC CGACGCCGAC GCGATCGCGC AAAGGATCAG AGCGCTCGGC TCGCCCGCCG TCTCGGAGCT CGCGGCGCTG CCGATCCGGC GGCAGGGGGC GGGCGCCAAA TTCGGGCTCG ATCTCAAACC CTTCTTCGAG CGCATCAAAG GCGCTGGCGA AGCCGGCGCC TATCTCGTCG GCATGCGTCC GATCGGCGCC GACAAGCGCA GTTTTATGCG CGTGCAGGTC ACGGACCTGA CGCTGAGCAC GGTGGAGGAG ACGGGTCGCG TCCGCTTCGT CGTCACCTCG CTGTCGACGG CCAAGCCGGT CGAGGGCGCC GAGATCAAAC TCGAAGGGTT GCACGAGGAT AAATATGTCG CCCTCGTCTC CGGCCGAACC GACGCCGAGG GCGCCTTTAC ATGGGACGTC GGCGAGCCCG CCGAGGCGAC CATCAAGCGC ATCATCGTCG CCAAAGGCCT CGACGTTCTG GCGCTCGATC CCGCCCATGG CCCGGCCGAA TATGCGAGCG AGAACTGGAC CAAGCCGGAG GAAGCGTGGC TCGCCTGGAC GGTCGATCCG CAGATCGACC GTGTGGAGCC TCCCCGCACG CTCTGCCATC TTTTCACGGA GCGGCCAATT TACCGGCCGG AAGAGCCGGT GGAGATCAAG GGCTATGTCA GGCGCTATCT CGGCGGCGCG CTGAGCTACG CCAAGGGCGG GGGCACGCTG CTCGTCAACG GCCCCGGCGG ACAGGAATGG CGCCTGCCGG CGGCGATCGA CGAGACCGGG AATGTCTACC GCAAATTCGA CGCCGCGACG CCGGCGACCG GGGATTATTC GGTCGCTTTC GAGCCTGACC CCGAGGTCAA GGACGACGCC GCGGACGAGG GCGCAGAGCC GCAGGACGAG GGAGCGGAGC CGGAGGATCA CAACGCGGCG CAAGCGGAGA ACGAGGGGCC GGTCTCCTGC GGGCAAACGC CGTTCAAGAA AGAGGCCTAT CGCCTGCCGA CCTTCGAGGT GCTGCTCAAC GCGCCGCAGA CCGCAACGCT CGACGGCGTG TTTTCGGTGG AGCTTCTGGC GCGCTATTTC GCCGGCGGCC TCGTCGCCGA CCGGCCGATC AAATGGCGCG CCAGCCAGTT CCCCTATGCC TGGACGCCGC CCGGCCGCGA GGGCTGGTTC TTCTCGACCG ACGCGCGTTT TTCCGGCGAC GGCAAGTTCA AATCGACTCC CGTCCTCGAA CGCGAGGGGA CGACGGACGG CGCCGGGGCG GCGAAAATTG CCTTCGATCC ATCGATCGAG CCGACTGCGC AGCCGCGCCG CTATCAGATC GAAGCGACGG TCGCCGGCGA CGATCAATCC GAGGTGCGCA ACGTCGTCTC GGTCGCGGCT TTGCCGCCCT TCGTGCTCGG CGTCAAAACG CCGCGCTACC AGAAGCAGCC GGGGCCGATC GACGCCGAAA TCCTCGCCGT CGACGCCAAG GGCGCGCCGA TCGAGGGGGT CGCGATGACG GCGCGCCTCG TGCGGCGCAA TTGGAGTTCG ACCCTGCAGG CGAGCGACTT CAGCCAGGGC GCGGCGAAAT ATGTAACCGA AATCGTCGAC GAGACGGTGA GCGAGAAACT GATCGCCAGC GGCAAGGACG CACAGAAGCT TGGCTTCGAG GCGCGCGCGG CCGGCGTCTA TCTGGTCGAG CTCGAGGCTT CCGACCGCAG CGGGCGAAGA CAGAAGATCG CGGTCGATTT CTTCGTCGGC GGCGACAGCC CGGCGACTTT TGCGCGCGCC CCGTCGCAGA CCGCCGAGGT CGCCGCCGAC AAGGAGGCCT ATGCCCCCGG CGAGACGGCG AGCCTCATCA TCCAAAGCCC ATTCCAGAAC GCCCGCGCGC TGGCGATCGT CGAGGATCCT TCCGGCCGCT TCCGCTACGA ATGGGTCGCC ATTGCCAATG GCTTTGGCCG CTTCGAAGTC GCCGTCGCGA CGCCCGATCT GCCGAAGCTT GCCGTGCATT TTCTGATCAT GCGCGGACGC CTGCCGGACG CCGGAGCCGA CGCCTCGGCG CCGTTCGATC AAGGCAAGCC GGTGACGATC GCCGCGACGA AATGGATCAA TGTCACGCCG GTCAAGAATA TCGTGACGGT CGCGCTCGAC TATCCGCAAA AGGCGCGGCC GGGCCAGGAG ATCGAGGTCG CGCTGAAACT TTCCGACGAT CTCGGCAAAC CCGTCGCCGG GGAGGCGACC TTCTGGATGG TGGATCAGGC GGTGCTCTCG CTCGCCAAGG AGCGTCCGCT CGATCCGCTG CCGAATTTCA TCGTCGACCG GCCGACGACG ATGGCCGCGC GCGACACGCG CAATATGGCT TTCGGACTGA TCCCGCTCGA CGAGGCGCCG GGCGGCGACG CCGGCCTCGA GGAATGGGGC TCCGACAACA ATGTATCGGT GCGCAAGAAT TTCACGCCCG TGCCGATCTA TCTGCCGAAA GTTGTCGTCG GTCCGAGCGG CGTGGTCAAA ATCAAAGTCA AGCTGCCGGA TTCTTTGACC ATCTTCAAGC TGCGCGCAAA GGCGATCAGC GGGTCGGAGC GGTTCGGCTT TGCGACGGGC GAAATGCTGA TCCGGCAGGA TCTCGTCGCG CAGCCGGCGC TGCCGCGTTT CCTGCGCAAT GGCGACGTCT TCTCGGCTGC ATTTCTGGGA CGCGTCGTCG AGGGGCCGGC AGGCTCCGGC CGCGCCAGCC TGGCGGTCGA GGGGCTGACG CTGCAAGGGT CCGGCGAACG CAATTTCGCC TTTGAGCCGA ACCATCCGGC GCGGCTCGAT TTTCCTGTCG TCGTCCCGCC GTCGGGCGAC AGCGCGCGCT TGCGCTTTGC GCTGAAGCGC GACGCCGATT CGGCTCGCGA CGCGGTGGAA ATCGAGCTGC CCATCAAGCC TGACAGGCCG GTGACGCGCG AGCGCAAAAT GCTCGACGTC GCCGCCGGCG GCGCGCTAAC CTTGCCGGCC ATCGCGGCGA AGCTGCGTCC CGGCTCGCTC CGGCGCAGCC TCGACGTCGC CTCCGATCCC GCGATCGTCC GGCTCGTCGC CGGCCTTAAC TATCTCGTCG AATATCCCTA TGGCTGCACC GAGCAGCGCA TCTCGCTCGC CTCGGCGGCG CTCGCGCTCA AACCCTTCGA ACCGGTCCTT GCCGCTTCCG ACCTCGGCGA CCGGCTGACC AATGACGTGC ATAATACCAT CGTTGCGATC TCGCAGAGCG TCGACGCCGA CGGCCTCGTC GCCTTCTGGC CGCGGGCGCG CGGAAATGTG TCGCTCACGG CCTGGGCCTA TGGCTTTCTC GTCGCGGCGC AACGCGCGGG CGAGCCGGTC GACAAGGCGC TGAGCGAGCG TCTTGCCGCC GTCTTGAAGC AGTCGCTGCG CTCCGATTAT GCGCGGCTTC GCACCGGCGA GGAATTGCGC GAGCGGGTCG AGGCGCTGAC CGCGCTCGCC GATGGCGGCA GGCTTGATCA GGCCTATGCG GCCGAGCTCA GCCGGCGCGC CGCGTCGATG CCGAATGTCT CGGTCGCGCT GATGACGCAG GTCGCGCTGC AACTGCCCGG CGATGGCAAG CAGGTCGCGG GCGCTCTCAT TGAAGACATG TGGACCCGCG TAAAATTCCT AAACCGCAAC GGCAAGGAGG TCTATGCGGG GCAGGCGGCT GATGACGGCG ATCCTGAAAT TTTGCCGTCC GAAGCGCGCA GCCTCGCCGA GATGACCCGC GCGGCGGCGC TCGTCGCGCC GCAGGATCCG CGCTCAAATG TGCTGCGCGA CGGCTTGCTG CGCCTTGGCG CCGGCGACGG CTGGGGAGAC ACCAACGCCA ATGCCGCCGC GGTTCGCGCG CTTGCCTCGA TCTGGCGGAA GACTTCCGCG CAAACGGATG TCAGCGTGAC GCAGGACGGC AAGACGGAGA CGGCTAGCCT CAGCGCCGGC GTCCCGGTCG CCCGCCTTGG CGTCAATGGC GCCGCCGAGG CGCGCATCGC CAATGCGGGC AGCGCCCCGA TCGTCGCGCT CTCAAATGTC TCCTATCTGC TGGCGGAGCC CGGCGATCAG GCGCAGGCCA GCGCGCAAGG GTTTGTCGTC TCGCGCAAGA CCTACAAAAT TGCGGAGGGC GCGCCGCCGC TCCTGGTGGA GCCCGGCGCC GATGGCGCGA TTCATATCCG AAGCGGCGAT GTGATCGAAG AGGCGATCGA GCTTGTCAGC CCGCAGGATC GCACGCATGT CGCGATCACT ATCCCGCTGG CGGCCGGCTT CGACCCGCTC AATCCCAATC TCGCGACGGC TCCGCGCGAT GCGACGCCCA GTTTCGATCC GACGCTGGCG CCGACCTGGA CCGCCTTTCA TGACGATCAG GTTTTTTACG CCTATGATTT TCTGCCGAAA GGAAACTATC GCTTCGCCTT CCGCGCCAAG GCGATGATCG AAGGTAGCTT TACGCAGCCG CCCGGCGAGA CTGAGACGAT GTATCAGGCA GGGGTCCATG GCTCGAGCGC CGGACGGCGA ATCATCATCG CCAAATAG
|
Protein sequence | MKHLALACAF LLAAACLSFA PHPARAESAP QFDILERADG ARIVPERFLR SWDPVTLFFK SDAGPQGGGP EDAPAQFVKM TPDIPGAWQW LGPRALQFRP SAAWRPLEPV IFEADGASTR LIPLLPEPAS TNPAAESEPI AELDHIVLTF AGPVDLAALT RLTSIELRPA PGLAGGDGQF LAAQDFTILP LEGAKKDAGR SYLVRLKNAV PDGRVAILRL KLSNEPGLDE PSFELRLQSA APFTVTGASC GEGLERANLD GLLHCAPSNG DDEPRARSLS IGFSAAPEPL DIAKARDALR ISPPVESLKV EPDGAHLKIS GQFLADTIYE LTVAPGTLAD DRKRQLAGAP FLQRFAFAPA RASLAFDVRQ GIVERFGPQL IPMRGAGFDK VDLRIYAINP LSRDFWPFPT EGVDTDDSKE PPLPGNAPER WSKDEDADAD AIAQRIRALG SPAVSELAAL PIRRQGAGAK FGLDLKPFFE RIKGAGEAGA YLVGMRPIGA DKRSFMRVQV TDLTLSTVEE TGRVRFVVTS LSTAKPVEGA EIKLEGLHED KYVALVSGRT DAEGAFTWDV GEPAEATIKR IIVAKGLDVL ALDPAHGPAE YASENWTKPE EAWLAWTVDP QIDRVEPPRT LCHLFTERPI YRPEEPVEIK GYVRRYLGGA LSYAKGGGTL LVNGPGGQEW RLPAAIDETG NVYRKFDAAT PATGDYSVAF EPDPEVKDDA ADEGAEPQDE GAEPEDHNAA QAENEGPVSC GQTPFKKEAY RLPTFEVLLN APQTATLDGV FSVELLARYF AGGLVADRPI KWRASQFPYA WTPPGREGWF FSTDARFSGD GKFKSTPVLE REGTTDGAGA AKIAFDPSIE PTAQPRRYQI EATVAGDDQS EVRNVVSVAA LPPFVLGVKT PRYQKQPGPI DAEILAVDAK GAPIEGVAMT ARLVRRNWSS TLQASDFSQG AAKYVTEIVD ETVSEKLIAS GKDAQKLGFE ARAAGVYLVE LEASDRSGRR QKIAVDFFVG GDSPATFARA PSQTAEVAAD KEAYAPGETA SLIIQSPFQN ARALAIVEDP SGRFRYEWVA IANGFGRFEV AVATPDLPKL AVHFLIMRGR LPDAGADASA PFDQGKPVTI AATKWINVTP VKNIVTVALD YPQKARPGQE IEVALKLSDD LGKPVAGEAT FWMVDQAVLS LAKERPLDPL PNFIVDRPTT MAARDTRNMA FGLIPLDEAP GGDAGLEEWG SDNNVSVRKN FTPVPIYLPK VVVGPSGVVK IKVKLPDSLT IFKLRAKAIS GSERFGFATG EMLIRQDLVA QPALPRFLRN GDVFSAAFLG RVVEGPAGSG RASLAVEGLT LQGSGERNFA FEPNHPARLD FPVVVPPSGD SARLRFALKR DADSARDAVE IELPIKPDRP VTRERKMLDV AAGGALTLPA IAAKLRPGSL RRSLDVASDP AIVRLVAGLN YLVEYPYGCT EQRISLASAA LALKPFEPVL AASDLGDRLT NDVHNTIVAI SQSVDADGLV AFWPRARGNV SLTAWAYGFL VAAQRAGEPV DKALSERLAA VLKQSLRSDY ARLRTGEELR ERVEALTALA DGGRLDQAYA AELSRRAASM PNVSVALMTQ VALQLPGDGK QVAGALIEDM WTRVKFLNRN GKEVYAGQAA DDGDPEILPS EARSLAEMTR AAALVAPQDP RSNVLRDGLL RLGAGDGWGD TNANAAAVRA LASIWRKTSA QTDVSVTQDG KTETASLSAG VPVARLGVNG AAEARIANAG SAPIVALSNV SYLLAEPGDQ AQASAQGFVV SRKTYKIAEG APPLLVEPGA DGAIHIRSGD VIEEAIELVS PQDRTHVAIT IPLAAGFDPL NPNLATAPRD ATPSFDPTLA PTWTAFHDDQ VFYAYDFLPK GNYRFAFRAK AMIEGSFTQP PGETETMYQA GVHGSSAGRR IIIAK
|
| |