Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5544 |
Symbol | |
ID | 7119326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011758 |
Strand | - |
Start bp | 168076 |
End bp | 169326 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643528212 |
Product | glycosyl transferase family 28 |
Protein accession | YP_002424208 |
Protein GI | 218533393 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00888029 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGGTTCC TGCTCGCGGT GCTGGGCACA CATGGAGACG TTCTGCCCTT CCTCGCCCTC GGGAGCGCGT TGCTGCGGCG GGGGCATGAG GTCGCGCTCA GCGCTCCTGC TCCATTCGAG AAACACGCGG ATCGCGCCGG GCTGTCCTTC CACGCGATCG GTACCCAGGC CGACTTCGAT CGCGTCGTGC GCGAGCCGGA ACTCTGGCAC CCGCGCCGCG GCATTGAGAC CGCGTTCCGG TATGGCCTGG ATTTCGCGCC GGACGTGTTC CGATGGATCG AGGCCAACAG CGCGAAGCCC TGCATCGTCA TCGCGTCCCC GAGCAGCTTC GGGGCGCGGA TTGCGCAGGA TCGCTTCGGC CTACCGTTGG TGACGCTGCA CGTCATGCCG CTCCTGATCG AGAGCCGGTA CGATCCGCCC AGACTGCCGG GCCTCCCGCT TCCCGATTTT CTCCCCGCTC GGTTTCGGCA TTGGGTTGGC CGGGGAGCAG ACAAGTACGT GATCGATCCA GCCGCCTTGC CGAGGCTCAA CGCGTTTCGA GCTAGCCTCG ACCTGCCGCC CGTGCGGCGC CTGCGTCATT GGTGGAACAG CCCGACCCAG GTGCTCCTGA TGTTTCCGGA GTGGTTTGCT CCCCCTCAGC CGGATTGGCC GAAGCAGGCC GTACAAGTCG GCTTCCCGAT GTCGGACCGC TTCGGCGACG TTGGAGAGCT CAGCCCGGAG CTGGCCGCCT TTCTCAACGC TGGCGAGCCG CCGTTGGCTT TCACCTACGG TTCTGGGATG AGGCAAGGGC AGGCCTTCTT CGAGACAGCC GTTGCCGCGT GCGCGCGTCT CGGCCACCGC GGCGTACTTC TGGCGCCTCA GACCGGACAG GTCCCCGCAG GTCTGCCAAC GAGCATCCTG CATCTGCCCT ACGCTCCGTT CAGCAAGCTT CTTCCGCACT GCTCCGCACT TGTTCACCAT GGCGGGATCG GAACGGTCGC GCAGGCGCTG GCCGCAGGCA TCCCCCAGCT GGTCGTGCCG GTCGCCTTCG ACCATTTCGA CGAGGCGCGG CGCCTGAAGC ATCTAGGTCC GGGTAGGGCG CTGAGCCGAC GCCGCTTCAC CCCGGCTCGG GCTGCGCGTG AGATCCGCCG CATGCTGAGG GATCCTAAGG TCCAGGAAGC ATGTAACCAG GCTAAATGCC GGTTGATCGA CGAGGATGGC GTGCAAGCAG CCTGTGACGC TGTCGAGCGC CTTTTGGCCA TCCGCCCGTG A
|
Protein sequence | MRFLLAVLGT HGDVLPFLAL GSALLRRGHE VALSAPAPFE KHADRAGLSF HAIGTQADFD RVVREPELWH PRRGIETAFR YGLDFAPDVF RWIEANSAKP CIVIASPSSF GARIAQDRFG LPLVTLHVMP LLIESRYDPP RLPGLPLPDF LPARFRHWVG RGADKYVIDP AALPRLNAFR ASLDLPPVRR LRHWWNSPTQ VLLMFPEWFA PPQPDWPKQA VQVGFPMSDR FGDVGELSPE LAAFLNAGEP PLAFTYGSGM RQGQAFFETA VAACARLGHR GVLLAPQTGQ VPAGLPTSIL HLPYAPFSKL LPHCSALVHH GGIGTVAQAL AAGIPQLVVP VAFDHFDEAR RLKHLGPGRA LSRRRFTPAR AAREIRRMLR DPKVQEACNQ AKCRLIDEDG VQAACDAVER LLAIRP
|
| |