Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4972 |
Symbol | |
ID | 6131674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5450376 |
End bp | 5451713 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641645108 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001771734 |
Protein GI | 170743079 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGGT CAGGTCTTCG CCTGCTCTCG GTGAACAACT ACTTCTACCG CCGCGGCGGG GCCGACGTCG TCTTTCTCGA GCAGAACGAG TTGTTCGAGG CGGCGGGTTG GGAGGTTGCG CCCTTCGCGA TGCAGCATCC TCAGAATCTG CCCACCCCAT GGGCGTCGTA CTTCGTGGAC GAGATCGAGC TCGGACGCCG TTACGGGGTG GGGCGGAGCA TCACGAATGC CGGCCGCGTG ATGTATTCCC TGCAGGCCCG GTCCCGGCTG GCGAAGCTGC TCGATCGGTT CCGGCCGGAC ATCGCCCACA TCCACAACAT CTACCACCAT CTCTCGCCCT CGATCCTGCC GCTGATGCGG GCCCGGGGGA TCCCGATCGT GATGTCGGTG CACGACATGA AGCTGGTCTG CCCCGCCTAC ACGGCGTTCC GCGACGGCCA GGTCTGCCGC GCGTGCCAGG GCGCGAAACT GTACAATGCC GTGCGCTACC GCTGCCTCAA GGGGTCTCGC CTGCTCTCCG GCTTCGTCGC GCTCGAAACC TCCCTGCATC GGCTGCTGGG CCTCTACGCG TCGAACGTGA GCCGCTTCGT CGTACCCAGT CGCTACTACC GCGACATACT GTTGGAGGCG GGCTGGCCGA GCGAGAAGGT CGCCCACGTT CCGAACTTCG TCGATCCGTC CCACTACGAC CCGTCCCCCG AGATCGGCGA CCGGTTCGTC TATTTCGGTC GCCTCGACCG GCTCAAGGGC ATCGAGACGC TGCTGCGCGC GGCGGCGCAG GCCGGCGTTC CGCTGACCCT GGCGGGGCGC GGCCCCGACG AGGACAGCTT CCGGCAACTC GCCGAGTCGC TCGGCAGCGA CGTGCACTTC GCCGGTCACC TCGACCGGCC GGGCCTGGCC GCCCTCCTGC GGACCGCGCG CGCGGCCGTG CTGCCGTCCG TGTGGAACGA GAATGCCCCG GTCTCCGTGC TGGAGGCCTA CGCGGCGGGC CGGCCCGTCA TCGCCAGCCG GATCGCGGGC ATCCCCGAAC TGATCCGGGA GGGCGAGACG GGCGTGCTCG TGCCGCCCGG CAACGTCGCG GCGCTCGCCG ACGCCCTGTC GGATTTCGCC GCGATGCCGC CGGGACGGGT CGCCGCGCTG GGCGCCGCGG CGCGCGCCTG GGCGGCCCGC GACTTCTCGC CCGAGGCCTA CCGGTCGCGC GTGCTCGGCC TCTACGCCGA GCTCGGTGTC AGCGAGGGGA CGGGAGCCCG GGGGGCCTCC AGTGAGGCGG GTCCCGGCCA GGGCCACGGC CTCGCCGGCG CGGGCTCAGC ATCGTTCGCC GGAGGGGTCG CCGGCTAA
|
Protein sequence | MSRSGLRLLS VNNYFYRRGG ADVVFLEQNE LFEAAGWEVA PFAMQHPQNL PTPWASYFVD EIELGRRYGV GRSITNAGRV MYSLQARSRL AKLLDRFRPD IAHIHNIYHH LSPSILPLMR ARGIPIVMSV HDMKLVCPAY TAFRDGQVCR ACQGAKLYNA VRYRCLKGSR LLSGFVALET SLHRLLGLYA SNVSRFVVPS RYYRDILLEA GWPSEKVAHV PNFVDPSHYD PSPEIGDRFV YFGRLDRLKG IETLLRAAAQ AGVPLTLAGR GPDEDSFRQL AESLGSDVHF AGHLDRPGLA ALLRTARAAV LPSVWNENAP VSVLEAYAAG RPVIASRIAG IPELIREGET GVLVPPGNVA ALADALSDFA AMPPGRVAAL GAAARAWAAR DFSPEAYRSR VLGLYAELGV SEGTGARGAS SEAGPGQGHG LAGAGSASFA GGVAG
|
| |