Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5674 |
Symbol | |
ID | 7119023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011758 |
Strand | - |
Start bp | 306752 |
End bp | 309673 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643528327 |
Product | WD-40 repeat protein |
Protein accession | YP_002424323 |
Protein GI | 218533508 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGCG CGAAAGAACG GATCGGCATC GTGCGAACCA TCCTAAGCCC GGGCCGCTCG CCTAACGTTG CACTCTCCGC GTTCGACGCG TTCCTGTCGT ACAGCACCGC TCTGGACGGA CGCTTGGCGC CTGCCCTGCA GAGTGGCATC GAGCGGTACG CGAAGCCTTG GCACAAACTT CGCGCCTGCC GCGTCTTCCG CGACACGACC AACCTCAACG TTTCCCCGGA CCTCTGGCAG GCCATCGAAG TCGCGCTCGA TCGCGCACCG CGCCTGATAC TGCTGGCCTC TCCCGAGGCG GCGGGATCGA AATGGGTGCG CAAGGAGGTG CAGCACTGGC TTTCCCGGCA TGACCCATCG TCTCTTTTCC TGGTCCTTAC CGGTGGCGAG ATCGCGTGGG ACGACGAGGC GGCGGACTTC GATTGGGAAA GGACAGACGC GCTTCCGCCG ACGCTGGCAG GCGTGTTCCA ACGGGAGCCC TTGTTCGTTG ACATGCGTTG GGCGCGGGAT CCCGCCGTCG ATCTCTCGCT GACCAATCCG TTGTTCGCGC AAGCCGTGCT CTCGCTGGCT TCGCCCATTC GCGGTATGCC GATGGATGAA CTCGGAGGCG AGGCGGTACG TCAATTCCGG GCGACTCGAC GGCTTGCGAC GGCTGCGGTC GCGGTGGTGG TGTGCCTTGG CATCGCGGCT GCCGTCGCCG CATGGACGGC GGCGCGGGAG GGCGGTCGCG CGAACGAACG TGCCGAGGAG GCGCTTCGAC GCGAGTCGCT CCGCCTGGCA GCCGCTTCGC GTTCGGAAAC CGCTTCCGGA AACGCGACCG ACGGGCTGAC GCTTGCCCTG GCGGCCCTGC CGCAGACCTT CGAGAAGCAA GGCTGGCTGT CTTGGGCGAC AGCCCCGGCG CCACGACCCG TCGTTCCCGA CGCCTTGGGG GCTCTATCGG AGGCCATGAG CCTGCAGCGT GAATACAGGG TGCACGCTCC GCTGCCTGGA GCGAGCGACG GCCGGTACAT GGGAAGCGGC TTCCTCGACG GCGGTGCCAA GCTGTTCGGA TTCACGCGCG GTGGTCGGCT TCTCATCTGG TCGACGCTGG ACGGGACGGT GCTTCTCGAC CTGCAGAGCC CGTCCAGGCA GCCGAAGGAA TTCCTGCTCA ACGAAACCGG TACCGAGGCG GTGGTTGCCG ACGAGGGCGG CCAGATCACG TTCTGGCAGC CCGGACTACC CACTCCTCCG ACCACCCTGC CCCTGACCAC GCCCTCGCGC GAAGGCTGGC AGGATCGGCT CTTCGTGATC TGGTCGCAGA GCGCTCCAAG GCTCCTGCTC GCTCGGCCTG ATGAGAATCT CACGCTGTTT GAGCTGCCGT CGGGCCGGTC GCTCGCAATC CTACCCCGAA AGGGAGAGGA CGTGTTCACG ATTTCCAGCT CGACCGACCA AGCTTCTGCA CTCCTGCTGC TAAGGGTCGG TGGGAAAACC GAGCGTTACG ACGCAATAGA CGGCCGGCGC AGAGAGGACG TCAGGAGCGA TGTGCCTGAC GCCGACTTCG TGAGGCGGAG CCGCGATGGC GACATCCTCA TCTACGGAGA TTCCCTCAAG AGCGCGATTG TCGCGACGCG TCTACCGGAG GGCGGATACC AAACGCGGGG CGTGATCGAA CTTGCGCCGA GGGACCAGTT ACGTCTGTCG CCGGACGGCC GCTGGGCAGC TCGGGTGCAG CCGGACGGCA CCGTGCGCAT GTCCAATCCG CTCTCACGCG TTCGCGAAGC GTGGCTTTCG GCCGGCGACC AGGTCGCGGC AGCCGAGTTC GACAGCACCG GGACACGCTT CGTTGCCCGG CTTGAGAATG GTGCGATCGC GCTGTTCGCC GCCGGTCCGC ACGGCTGGGT GGACGGTTAT TTGGCTGCGC CCGGGCCGGG ATACGACGGG TTGATCGACC CGGATGGCCG TTCGATCACA ATGGGAACGA TCAAGGTCGC TGACCGTGGG CTTGGGGGAG AGGTCGTCAC GTGGGACCGG TTAACGGGCG AGGTGCGGCG CCGCCGAAAG ACCGGCAGGT GGCCCTTGCG AGTTCGCAGG ACCCGTACCG GGGCCGCACT GGCCGCCGTC CTGACCAACG GGGGCTTGCG TGTGTGGCTC GACGAAGCGA CGGAGCCTTC ATGGACCGGC GAGGTGGCCT CCGGGGAGGC TGACTTCATC GACCTTGCTT TCGTCGGCGA TGAGAGGCTT CTGGCGGCAA CGCGATCCGG GACGATCATC GACTTCGATC TCAGGACGGG ACGGGAGAGA AGGCGCGTCG CCTTCGGAGC GGACGACGTC ACGTCCTTCG CGGCTTCGCC TGGCGGTGAC ATCCTCGTCG CAACGTCCCC CAGTGTGGGA GCCATCCGAG GTGAGTTGGA TAGCTTCGAA CGGACGCGTC GGTCACTCGT CGGGGCGGAT AAGGGTGTTC GTCGTGCCGC GGTCAGTGCA TCCGCGATCC TGGTGGCCGA CCGTGAGGCG ATCCAAGCCT TCGACCTGGG GACGGGGCGA TTGCTAAGGT CCTGGGCCGT TCCCGGCGAG AGCATCAACT CGCTGGCCTT CAACGACGAC GGCACGCGCT TCGCCGCTTC CACCGAGAGC GGGACAGTCT ACGTTATGGA TTGGCACGAC TCGACGCGCG ACCTCGTGCT GCGAACCGGC AATGTCCGGA TCTACTCCGC AACGTTCGCT CCGGACGATG CCGGAATTCT CACCACGGGC CACGATGGTC GCGCCCGCTT TTGGCAGACA CGCCGAACCC CTGAAGAACT CGTGGCTTCG GCGCGTGGCC GGGTTTGGCG CTGCATGCCG CGAGACCGGG CCCTCAAGTT CGGTGCGGCC GACCTGATCA CCTCGTCCGT CGGAACAGGA TGCGCGTCGG TCGAAAGGCG TCGTGCGCCC GAACTCGATT GA
|
Protein sequence | MDGAKERIGI VRTILSPGRS PNVALSAFDA FLSYSTALDG RLAPALQSGI ERYAKPWHKL RACRVFRDTT NLNVSPDLWQ AIEVALDRAP RLILLASPEA AGSKWVRKEV QHWLSRHDPS SLFLVLTGGE IAWDDEAADF DWERTDALPP TLAGVFQREP LFVDMRWARD PAVDLSLTNP LFAQAVLSLA SPIRGMPMDE LGGEAVRQFR ATRRLATAAV AVVVCLGIAA AVAAWTAARE GGRANERAEE ALRRESLRLA AASRSETASG NATDGLTLAL AALPQTFEKQ GWLSWATAPA PRPVVPDALG ALSEAMSLQR EYRVHAPLPG ASDGRYMGSG FLDGGAKLFG FTRGGRLLIW STLDGTVLLD LQSPSRQPKE FLLNETGTEA VVADEGGQIT FWQPGLPTPP TTLPLTTPSR EGWQDRLFVI WSQSAPRLLL ARPDENLTLF ELPSGRSLAI LPRKGEDVFT ISSSTDQASA LLLLRVGGKT ERYDAIDGRR REDVRSDVPD ADFVRRSRDG DILIYGDSLK SAIVATRLPE GGYQTRGVIE LAPRDQLRLS PDGRWAARVQ PDGTVRMSNP LSRVREAWLS AGDQVAAAEF DSTGTRFVAR LENGAIALFA AGPHGWVDGY LAAPGPGYDG LIDPDGRSIT MGTIKVADRG LGGEVVTWDR LTGEVRRRRK TGRWPLRVRR TRTGAALAAV LTNGGLRVWL DEATEPSWTG EVASGEADFI DLAFVGDERL LAATRSGTII DFDLRTGRER RRVAFGADDV TSFAASPGGD ILVATSPSVG AIRGELDSFE RTRRSLVGAD KGVRRAAVSA SAILVADREA IQAFDLGTGR LLRSWAVPGE SINSLAFNDD GTRFAASTES GTVYVMDWHD STRDLVLRTG NVRIYSATFA PDDAGILTTG HDGRARFWQT RRTPEELVAS ARGRVWRCMP RDRALKFGAA DLITSSVGTG CASVERRRAP ELD
|
| |