Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2940 |
Symbol | |
ID | 7115747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 3099811 |
End bp | 3103077 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643525690 |
Product | trehalose synthase |
Protein accession | YP_002421707 |
Protein GI | 218530891 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00795057 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGATC GCAGCGATCC GCAGTGGTAC CGTGACGCCA TCATCTACCA GATCCACGTC AAGTCGTTCT TCGACTCGTC GAATGACGGG ATCGGCGATT TCGAGGGGCT GACGCAGAAG CTCGATTATG TCCGCGACCT CGGGGTCACG GCGATCTGGC TGATGCCGTT CTATCCCTCG CCGCTGCGCG ACGACGGCTA CGACATCGCC GACTACCGCG ATGTGAACCC GTCCTACGGG ACGATGGAAG ATTTCCGCCG CTTCGTGGCT GCTGCACACG AGCGGGGGCT GCGCGTCATC ACCGAGTTGG TCATCAACCA CACCAGCGAT CAGCACCCCT GGTTCCAGCG CGCCCGCGAG GCGCCGGCCG GGTCGCCTGA GCGCGATTTC TACGTGTGGT CGGATACCGA CGAGAAGTAT TCCGATACGC GCATCATCTT CCTCGACACG GAAGTGTCGA ACTGGACCTG GGACCCGGTC GCCAAGCAGT ACTTCTGGCA CCGCTTCTAC AGCCACCAGC CCGACCTGAA CTTCGACAAC CCGGCCGTGC TGGAGGCCGT GATCGAGGTT ATGCGCTACT GGCTCGACAT GGGCGTTGAC GGGCTGCGGC TCGACGCGAT CCCCTACCTG ATCGAGCGCG ACGGCACGAA CTGCGAGAAC CTTTCCGAGA CCCACGACGT CATCAAGGCG ATCCGTGCGG CGCTCGACGC CGAGTATCCC GACCGGATGC TGCTGGCCGA GGCCAACCAG TGGCCGGAGG AAACCGCGCA GTATTTCGGC GAGGGCGACG AATGCCACAT GGCATTCCAC TTCCCGCTGA TGCCGCGGAT GTACATGGCG ATCGCCCGGG AGGACCGGCA CCCGATCACC GACATCATGC GCCAGACGCC CGAAATCCCC GAGGGCTGCC AGTGGGCGAT CTTCCTGCGC AACCATGACG AACTGACGCT CGAAATGGTG ACGGCCGAAG AGCGTGATTA CCTCTGGTCG TTCTACGCCG CCGAGCGGCG CGCCCGCATT AATCTCGGCA TCCGCCGCCG CCTCGCGCCG CTGCTGGAGA ACGACCGCCG TAAGATCGAG CTGATGAAGT CCCTCGTCCT GTCGATGCCC GGCACGCCGG TGCTCTACTA CGGCGACGAG ATCGGCATGG GCGACAACAT CTATCTCGGC GACCGCGACG GTGTGCGCAC GCCGATGCAG TGGTCGCCGG ATCGCAATGG CGGCTTCTCG CGGGCCACCC CGCAGAAGCT GTTCCTGCCC TCGATCCAGG ACCCGATCTA CGGCTACGAC GCGATCAACG TCGAGGCGCA GATCCAGGCG CAGACCTCGC TGCTGAACTG GACGCGGCGC ATGATCGCGA TCCGCAACAA CTCGGTCGCG CTTGGGCGCG GCGCGATCCA GTTCCTGTAT CCGGCGAACC GCAAGGTGCT CGCCTGGATT CGCGAGCACG AGAACGAGCG CATCCTCTGC GTCGCCAACC TCTCGCGGGC ACCGCAGGCG GTGCAGCTCG ATTTGTCGGA TCTGCGCGGC GCGATCCCGA TCGAGCTGAC CGGCGGCACG GCCTTCCCGG CGATCGGCGA ACTGCCCTAC CTGCTGACCC TGCCGGCCTA CGGTTTCTAC TGGTTCGCCC TCTCGGTCGC GAATACCGGC GAGATCGGCC CGCAGCCGCA GAGCCCGGAA CTGTTCACCC TGGTGCTCAC CGGCGGCATC GAGACCCTGA TCAAGGGCCG CGAGCGCACA GCCTTCGAGC GCACCGTGGC GCCGCCCTTT ATCGCCTCGC GCCGCTGGTT CGGCGCCAAG GGCACGCGCA TCAAGTCGGT GCAGGTCGAT GACAGCGCCA TCGTCAAGGA CGGCTCCGGC GAAGGCAAGT TCCTGCTGCC GCGGGTGGCC GTGACCCTCG CCAATGGCGA GCGCCAGGAC TACTTCGTGC CGCTCGCTGT CGAGGAGGGC CGCGAGGACG AGACTCTGCT GGGTCACGCG GTCGCCCGCG TGCGCCGCGG GCCCCGCACC GGCCTGCTCT ACGAGGCGGC GGCCGCCGCG CCGTTCGCCG TCGCCCTGAT CGAGGCGATG CGCGACGGCG TCACGATCCC ATCCGAGCGC GGGAGCCTCG TCTTCTCGAC GACCTCGGCT TACGATCCGG AGGTGCCGTT CGAGGCCGCC GACGTGCGCC GCCTCTCGGC CGAGCAGAGC AACACCTCGA TCGCCATCGG CGCGCGGATG ATGCTCAAGC TGCTGCGCCG CCTCCAGCCC GGCACGCATC CCGAGATCGA GATCGGCCGG TTCCTCACCG AGCAGGCCCA CTTCGCCAAC ACGCCGGCTT TGCTCGGCGT GGTGGAGCAT GTGGCGGCGG ATGGCACCCG CACCGCCCTG GCTCTGCTCC AGAAGTTCGT CCTCAACCAG GGCGACGCCT GGACGCTGAT GCTGGAAGGC CTGCGGCGCG ACTTCGAGAC CGTGGTGCTC GCACCCGAGA GCGAGGCGCC GGCGCCGGAG GATGCCTTCA ATGCGCATCT GCGCTGGGCG GAGCTGCTCG GCCAGCGGAC GGCCGAACTG CATCGGGCCT TCGCCATCGA CACCGACGAT CCGGCCTTCG CGGTCGAGCC CTTCGATGAG GCGGATCTCG CTTCGCTTGC GGATGATGCC CGCCATCAGG CCGCGCGTGC GTTCAAGGGG CTCGACGCGA TCACCGCTTG GTCGCCTGGC TCGGCCAGCG AGGCGCTCGC GTCCCGCCGT AGCGAGGTCG AGGCGCTCAT CACCGATCTG GCCTCCGGCC CGCTGCGGGG TGCCACGAAA ACCCGCATCC ACGGCGATTA CCATCTCGGT CAGGTGCTCG CCTCCGAGGG CGATCTCATC ATCGTCGATT TCGAGGGCGA GCCCTCGCGT CCGGTGGAGC AGCGCCGGGC CAAATCGACG CCGATGCGCG ACGTGGCGGG CATGCTCCGC TCCTTCGCCT ACGGCGCCGA AACGGTGGTG CGTGAGATCA CGGCTCGCTT CGGCGACAGC GAGGAACGGG CGCGCAACGC GGCGATCGCG TGGCGCGGCA TGATCGATGC AGCGTTCCTC GACGGCTACG GGCAAGCCGT GGCCGGCAGC CGGGCGGCGG TGGAGGATGC GGAGACGCAA TCCCGTCTGC TGCGTCTGAG CCTGCTGACC AAGGCGCTCT ACGAGGTCGA TTACGAAGTG AACAACCGCC CCGACTGGAT CGAAATCCCA GCACGTGGTG TTCTCAACAT ACTGGATGAA GCCAAGCGCG ACCGCGCTTC GGTGTGA
|
Protein sequence | MIDRSDPQWY RDAIIYQIHV KSFFDSSNDG IGDFEGLTQK LDYVRDLGVT AIWLMPFYPS PLRDDGYDIA DYRDVNPSYG TMEDFRRFVA AAHERGLRVI TELVINHTSD QHPWFQRARE APAGSPERDF YVWSDTDEKY SDTRIIFLDT EVSNWTWDPV AKQYFWHRFY SHQPDLNFDN PAVLEAVIEV MRYWLDMGVD GLRLDAIPYL IERDGTNCEN LSETHDVIKA IRAALDAEYP DRMLLAEANQ WPEETAQYFG EGDECHMAFH FPLMPRMYMA IAREDRHPIT DIMRQTPEIP EGCQWAIFLR NHDELTLEMV TAEERDYLWS FYAAERRARI NLGIRRRLAP LLENDRRKIE LMKSLVLSMP GTPVLYYGDE IGMGDNIYLG DRDGVRTPMQ WSPDRNGGFS RATPQKLFLP SIQDPIYGYD AINVEAQIQA QTSLLNWTRR MIAIRNNSVA LGRGAIQFLY PANRKVLAWI REHENERILC VANLSRAPQA VQLDLSDLRG AIPIELTGGT AFPAIGELPY LLTLPAYGFY WFALSVANTG EIGPQPQSPE LFTLVLTGGI ETLIKGRERT AFERTVAPPF IASRRWFGAK GTRIKSVQVD DSAIVKDGSG EGKFLLPRVA VTLANGERQD YFVPLAVEEG REDETLLGHA VARVRRGPRT GLLYEAAAAA PFAVALIEAM RDGVTIPSER GSLVFSTTSA YDPEVPFEAA DVRRLSAEQS NTSIAIGARM MLKLLRRLQP GTHPEIEIGR FLTEQAHFAN TPALLGVVEH VAADGTRTAL ALLQKFVLNQ GDAWTLMLEG LRRDFETVVL APESEAPAPE DAFNAHLRWA ELLGQRTAEL HRAFAIDTDD PAFAVEPFDE ADLASLADDA RHQAARAFKG LDAITAWSPG SASEALASRR SEVEALITDL ASGPLRGATK TRIHGDYHLG QVLASEGDLI IVDFEGEPSR PVEQRRAKST PMRDVAGMLR SFAYGAETVV REITARFGDS EERARNAAIA WRGMIDAAFL DGYGQAVAGS RAAVEDAETQ SRLLRLSLLT KALYEVDYEV NNRPDWIEIP ARGVLNILDE AKRDRASV
|
| |