Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3570 |
Symbol | |
ID | 7113703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 3758265 |
End bp | 3761597 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643526306 |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_002422318 |
Protein GI | 218531502 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.571273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGA CTGCCCCGAT CAGCCTCGAC AGCTTCGACC CGGAGGTACT CGCGGCTGCC CGCGAGGTCG CCCGGCGAGC GGGGGTGCCG CTGGAGAGCT GGATCGCCTC GGTGGCGACG CCCGATCCGT CGAAGCCGGG CCCGCGGCGG CGGCGCGCGG ATGCGACGTC CCCAGCCAGG GAGGCCGGCG CGGAGCCGGC CCGGCAGGTC GGAGGCCGGG CGCCAGAGAA GGCGCCCACG CCCGCTTCAC GGAAGGATGG GCCGCAGCAC AAGCGCCGGG AAGCCACTCA GGGCGCTTCG GCCGCCGAGG CGTCCGAAAC GGCATCGCTG GAAGCCTCGC TCGGCGCGAT GATGCGGCGG CTCGACGCCC TCGACCGCTC GATCAGCGAG GAGCGCGAGG CCTCCAAGGC CGATGCCGCC CGGATGATCG ACGAGATCGA GGCGCGGTTG ACCACCGCCC GCCAGCCGGT CGCACCGGAA ATGATCGCCG GACGCATCGC CGACATCGAA CGCAAGATGG GCGAGATCGC CGGCCAGCTC GACACGCCCC GCCCGCTGGG TCGGCGCGGG CGCCCGCTCG CCACCGAGGT CCGCGATGCC GTCGCCGAGG TGCGCCGCCG CCAGCGCGAG CTGGAAGACG GCATCGCCGA GTGGAACGCC GCCAGCGCGG GAGAATTGAA GGACGGGCCA CAGGATGCCC CGGCCGCCCT TGCGGTGACG CCTGCCGAGG ATGAGTCCGG CCAGGAGCCG TCCCCGGCCA TCGCCGAACT GCAACGCGAG ACGAACCGGC TGCGCGACGC GCTCGGCAGC CTCGCCACCG GCCGCGACGT CAGCGAGCTG GAGCGGACCA TGCAGGCCGT CGCCAGCGAC CTTCAGCGGG CGCGCGCGCC GCAGGAACTC GCCGCCATCG CCGCCCCGGT CGAGCTGATG CGCCTCCAAG TGGAGCGGAT CGCCAAAGAC GTGGCGGACA ACGTCCATGC CCGCATCGCG GGTGAGGTGG AGCGCTTGGC CGGAAAGGTC GATGCCGTTC TCTCCGGCGT CTTGTCCGGC CCCGCCGACC AGAGCGCCCT CGACGGCGTG TTCCGCGAAC TCGACGAGAT CCGCCGCCTC GTGGCGTCCC TGGCAGGGCC GGAGCGCATC CAGAGCCTCG CCCAGGGCGT GCAGGCGATC AGCGCCCAGA TCACGCAGCT CCAGCGGGAC GAGGATGCGG GCATCGCCAC CCTCAAGCCG CTGCTGGAGG AGATCCGCGG CGAGCTGAAG GCGCCCGATT CCTCCCGCGA GCTTCCCGGC GCGCTTCTCG GACGTTTCGA GGCGCTCGCG CAGCGACTCG ACGGGGCCGA GTCCGGCTCG GTCGGCGAGC TGATCGAGCG GCTCGAAGGC GTGGCCGAAA AAGTCGATCG CGTCAGCGCG GGCGGCAGCG GGCTCGATGC CCTGGAGCGC CACGTTCTCG CCCTGGCGAG CCGGCTCGAA GCGCCGCGCG ACACCGATCC GGCCGTGGCG CGCCTCGAGC GCTCCATGGG CGACCTGCTC GCTCAGGTCA CGGCCCTGCG CAACGGAACC GATCTGGAGG CCACCGTCGC GCAGGCGGTC CGTGAGGCCG TGGCGGGCTC GACCGCCCCG CTCGCGGCCG GCGGCGGTTT CGAACTGCTG CGGGCCGATC TCGCCGAGAT GCGGGCCAAC CAGAAGGGCG CGGACCAGCG CCTCCAATCG ACGATGGAAG GCGTCCAGTC GGTGCTGATG CGGCTGAGCG AGCAGCTCGA CCGCACCATG ACCTCGTCCG CGGCCCTCAC CGCCGCCGCC CCGCAGGAGC GCGCGCCCGT CGTGCTGTCC TCGGCCGAGC GCGTCTCGCA CGAGCGCCCC GCCGCCCCGA AGACGGTCGC CAAGCCGTCA TCCGAGCCGT CGTCCCAGAA CCTTGCCCGT CCGAACCGCC CCGCAACCTC CGACGAGGCC GGCGGGACGG AGGCGAGCCG CCTGTCCGAC GAGCTGCTGG AGCCCGGTGC CGGCCGGCCC GGATCCGGCC GCCCGGCCGC GCCGGAGGCG AGCCCCGCCG CGACCGGCGG CGCCGACATC AAGACCAGCT TCATCGCCGC CGCGCGGCGC GCGGCGCAGG CAGCGCAGGC CGAATCGGCC ACCGAGGCGC CGCTGACCGC GCGGCTGCGC GACAAGGTCG CTCCGGCCCG CATGCCGGGC GCGGAAACGA CGCCCCTTTC GCGGATCCGC GGTGCGCTCG ACAGCCGCCG CCGCACGCTC CTGCTCGGCC TCGCGGCCGT GGTGCTGGCG CTCGGCGCCT ACCAAGCCTT CGTCGCGGGC AAGGGCACTC CGACCGGAAC TCCGACCGGC GACCCGGCCG CGCCGGAGGC CCGTCCGGTG GCGAGCACCG CCCCGGCGGC CTCCGCCGAC GTCGCCGCGA GCCGCACCGA GACCACGGCC GAGCCCGCTC AGGCGGCGTC GCAGGCCCAG GCTCCGTCCG AGACGTCCCC CCAGACCGGG ACATCCGCTC AGACGACGCC CGACCCGGCG ACGACCCAAT CCATCGCCGA GCCGAAATCC GCGCCGACCA AGCGCGGGCT GCCCCAGGTC GCGGGCATGA GCACGCTCGG CCCCGACCTC GCCGGCCTGC CGCCGGCCCT GGCCAAGCTC AAGCAGGATG CGCTCGACGG CGACGGCGCC GCGGTCTGGG AGATCGCCTC CCGCGAGGCC GAGGGCCGGG GCGTGACGCG CGACCTCGCG GTCGCCGCCA AGCTCTACGA GCGGCTCGCG AATGCCGGCT ACGCGCCGGC TCAGTTCAAG GTCGGCAACG CCTACGAAAA GGGCTCGGGC GTGGTCCGGG ACATCGAGAA GGCGAAGGCC TGGTACGGCC GCGCCGCGGA TCAGGGCAAC ATCCGCGCGA TGCACAACCT CGCCGTGCTG CATGCCGAGA ATCCGGCGGC CAACGGCAAG GCGGATTTCG TGACCGCGGC GAACGCCTTC CGCCGGGCGG CGGAACACGG GGTGCGCGAC AGCCAGTACA ACCTGGCCGT ACTCTACGCC CGCGGCCTCG GCGTCGGGCA GGATCTCGTC CAGTCCTATC TCTGGTTCTC GGCCGCTGCC ACGCAGGGGG ACCAGGAAGC GGGCCGCAAG CGGGACGAGG TCGCCGCCAA GCTCTCGCCG AAGGATCTCA CCGAGGCCAG GAGCCTCGCT GGGAGCTTCA AGGCGAAGGC CGTCGATCCG GCCGCGAACG AGGCCCCGTC CCAGAAGGCC ACCGCCGCAG CGGGGATGTC CCTGATGGGC GCGCCGTCGC CGGGCATGCC GACCGCCGCT TCCCCATCGG CGCAGAAGCG CTTCGGGGTC TGA
|
Protein sequence | MKQTAPISLD SFDPEVLAAA REVARRAGVP LESWIASVAT PDPSKPGPRR RRADATSPAR EAGAEPARQV GGRAPEKAPT PASRKDGPQH KRREATQGAS AAEASETASL EASLGAMMRR LDALDRSISE EREASKADAA RMIDEIEARL TTARQPVAPE MIAGRIADIE RKMGEIAGQL DTPRPLGRRG RPLATEVRDA VAEVRRRQRE LEDGIAEWNA ASAGELKDGP QDAPAALAVT PAEDESGQEP SPAIAELQRE TNRLRDALGS LATGRDVSEL ERTMQAVASD LQRARAPQEL AAIAAPVELM RLQVERIAKD VADNVHARIA GEVERLAGKV DAVLSGVLSG PADQSALDGV FRELDEIRRL VASLAGPERI QSLAQGVQAI SAQITQLQRD EDAGIATLKP LLEEIRGELK APDSSRELPG ALLGRFEALA QRLDGAESGS VGELIERLEG VAEKVDRVSA GGSGLDALER HVLALASRLE APRDTDPAVA RLERSMGDLL AQVTALRNGT DLEATVAQAV REAVAGSTAP LAAGGGFELL RADLAEMRAN QKGADQRLQS TMEGVQSVLM RLSEQLDRTM TSSAALTAAA PQERAPVVLS SAERVSHERP AAPKTVAKPS SEPSSQNLAR PNRPATSDEA GGTEASRLSD ELLEPGAGRP GSGRPAAPEA SPAATGGADI KTSFIAAARR AAQAAQAESA TEAPLTARLR DKVAPARMPG AETTPLSRIR GALDSRRRTL LLGLAAVVLA LGAYQAFVAG KGTPTGTPTG DPAAPEARPV ASTAPAASAD VAASRTETTA EPAQAASQAQ APSETSPQTG TSAQTTPDPA TTQSIAEPKS APTKRGLPQV AGMSTLGPDL AGLPPALAKL KQDALDGDGA AVWEIASREA EGRGVTRDLA VAAKLYERLA NAGYAPAQFK VGNAYEKGSG VVRDIEKAKA WYGRAADQGN IRAMHNLAVL HAENPAANGK ADFVTAANAF RRAAEHGVRD SQYNLAVLYA RGLGVGQDLV QSYLWFSAAA TQGDQEAGRK RDEVAAKLSP KDLTEARSLA GSFKAKAVDP AANEAPSQKA TAAAGMSLMG APSPGMPTAA SPSAQKRFGV
|
| |