Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2937 |
Symbol | |
ID | 7115744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 3089385 |
End bp | 3091190 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643525687 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_002421704 |
Protein GI | 218530888 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.919612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00396871 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGCGTG CCCATACGAT GCGATTCGGC AGCGAGTTTG CCGAGGACGG TGTGCGGTTC GCCCTCTGGG CACCGACCGC CAAGGACGTG ACCCTCGTTG TCGACGGGAC GGACCATCCG ATCCCCGATG TCGGCGAGGG CTGGCGCCGC ACGACGCTGC CGGGGGTGAA GGCCGGCGCT CGCTATGGCT TCCGCATCGA TGGCGACCTC GTGGTGCCGG ACCCCGCCTC CCGTTTCCAA CCCGAGGACG TCTCAGCACC GTCCGAGGTG ATCGACCCCA ACGCTTATGC GTGGTCCGAC CAAGAGTGGA CGGGACGGCC CTGGGAAGAG GCGATCGTCT ACGAGGTGCA TGTCGGCACG GCGACGCCTG AGGGGACTTA TGCCGGGCTG GAGAAGCGCC TCGACGATCT GGTCGATCTC GGCGTCACGG CGATCGAGTT GCTGCCGCTC GCCGACTTCA AGGGAACGCG CAACTGGGGC TATGACGGCG TGCTGCCCTA CGCCCCGGAC TCGGCCTATG GCCGCCCGGA TGACCTTAAG CGCCTGATCG ACACCGCCCA TTCCAAGGGC CTGATGGTGC TGATCGATTG CGTCTACAAT CACTTCGGTC CGGCCGGGAA CTACCTGCAC GCCTACGCCA AGACCTTCTT CACCGAGCGT CACCAGACGC CGTGGGGTGC CGGCATCAAT TTTGACGGCC AGGAGTCGGG CCCGGTCGTG CGCGACTACT TCATCGAGAA TGCGCTCTAC TGGCTGAAGG AGTATCACTT CGACGGCATC CGCTTCGACG CGGTGCATGC GATCCTCGAT GATTCCGAGA AGCATTTCCT GACGGAACTC GGCGAGACGA TCCGCAAGAC CCTACCGGAT CGGCACGTCC ATCTGATCCT GGAGAACGAG GCCAATCAGG CCCGCTGGCT GGAGCGCGAC GACAGCGCGT CTCCCGTGCT TCACACTGCG CAATGGGCGG ACGACCTGCA CCATTGCTGG CACGTGCTTC TGACCGGCGA GGATGCCGGC TACTACGAAA GCTTCGCCGA CAAGCCAGTG GAGCACCTGG CCCGCTGCCT TGCAGAGGGT TTCGCCTACC AGGGCGAGCC CTTCCCGACC CTCGACAACC ATCCGCGCGG CGAGCCCTCG GCGCATCTGC CGCCTTCGGC CTTCGTCACC TTCCTCCAGA ACCACGATCA GGTCGGCAAC CGGGCGCTCG GCGAACGACT CAGCCATCTC GCCGACCCCA AGAAGCTGGC GCTCGCCCGC GCGGGGCTGC TGCTCGCGCC GCAGATTCCG ATGCTTTGGA TGGGCGAGGA ATGGTCGGCA TCCGCGCCCT TCCTGTTCTT CGTGGATTTC GCGCCCGACG AGGACTTGAA CAAGGCGGTG CGTGAGGGTC GCCGCCGCGA GTTCAAGAGC TTTGCCGCCT TTGCCGACGA CACCTCGGTG ATCCCCGATC CGACGGAGGC GCGAACCTTC GAGGATTCGA AGATCGATTG GGACGAGGCC GAGACCGAGC CGCATCGCGC CATTTGGGCC GACACGCGCA ACATCCTCCA GATCCGCCGG CAGAGCGTGG TGCCCCTGAC CAAGAGCCGC TACCTCGGGG CCGACGCAAA GATCCCGGCG CCGGGTGTCG TGGATTGTAC CTGGCGCTAC GCCGACGGCT GGCTGCGCTT TATCGCGAAT GTCGGTGACG CGGAATTCTC GGCGAGTGCC GATGGCGGCC AGGTGATCTG GTCGAGCGGG GCTGTCCGAC AGGCGATGGA GCTGCCGTCC TGGACGGGCG TCTTCCTGAT CGGTGGCACC AAGTGA
|
Protein sequence | MRRAHTMRFG SEFAEDGVRF ALWAPTAKDV TLVVDGTDHP IPDVGEGWRR TTLPGVKAGA RYGFRIDGDL VVPDPASRFQ PEDVSAPSEV IDPNAYAWSD QEWTGRPWEE AIVYEVHVGT ATPEGTYAGL EKRLDDLVDL GVTAIELLPL ADFKGTRNWG YDGVLPYAPD SAYGRPDDLK RLIDTAHSKG LMVLIDCVYN HFGPAGNYLH AYAKTFFTER HQTPWGAGIN FDGQESGPVV RDYFIENALY WLKEYHFDGI RFDAVHAILD DSEKHFLTEL GETIRKTLPD RHVHLILENE ANQARWLERD DSASPVLHTA QWADDLHHCW HVLLTGEDAG YYESFADKPV EHLARCLAEG FAYQGEPFPT LDNHPRGEPS AHLPPSAFVT FLQNHDQVGN RALGERLSHL ADPKKLALAR AGLLLAPQIP MLWMGEEWSA SAPFLFFVDF APDEDLNKAV REGRRREFKS FAAFADDTSV IPDPTEARTF EDSKIDWDEA ETEPHRAIWA DTRNILQIRR QSVVPLTKSR YLGADAKIPA PGVVDCTWRY ADGWLRFIAN VGDAEFSASA DGGQVIWSSG AVRQAMELPS WTGVFLIGGT K
|
| |