Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mrad2831_4799 |
Symbol | |
ID | 6140867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium radiotolerans JCM 2831 |
Kingdom | Bacteria |
Replicon accession | NC_010505 |
Strand | - |
Start bp | 5122582 |
End bp | 5125374 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641630509 |
Product | cellulose synthase catalytic subunit (UDP-forming) |
Protein accession | YP_001757442 |
Protein GI | 170751182 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03030] cellulose synthase catalytic subunit (UDP-forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.105872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCG CGACGCCCCT CGGCGGCCCG GCGCCCGCCC AGGCCCTTCC AACCGCCATC CCGGGCGCCA TCACGGGCGC CCCGGCGGCG CGCCCGCGCC TCCTGATGGC CCTCCTGTGG GCGACCTGGG CGCTCTGCGC CGCCCTCGCC CTCGGCTTCC TGACGCAGCC GGTGGGCCTG ACGGCCCAGG GCGTGCTCTG CGCCGCCGCC GGAGCCGGGA TGCTCGCGCT CTGGCTGCTG TTCCCCCGCC GGGGCCTCGC CCGGATGGCC TTCCTGGCGC TCGGCACGGC CGTGGTCATC CGTTACGCCT ACTGGCGGGT GACCGGCACG CTGCCGACCC TCGACGATCC CGTCAGCTTC GGGCTCGGCG TCGTGCTGGC GCTGGCGGAA CTCTACTGCG TGCTGATCCT CACGGTCAGC CTGATCATCA ACGTCGCGCC GCTGGCCCGC GGCGCCGCGC CGGTCCTGCC CGAGGCGGAC CTTCCCACCG TCGACATCTT CATTCCGTCC TACAACGAAT CCGCCGAGAT CCTGGGCCTC ACGCTGGCGG CGGCGCGCAA CCTCGACTAC CCGGCCGGCC GCGCCACCGT CTGGCTGCTG GACGACGGCG GCACGGACCA GAAATGCGCC GACCCGGACC CCGCCAGGGC CGGCGCGGCG CGGGCCCGCC GCGCCGCGCT GCAGGCGCTC TGCGCCGGGC TGGGCGTCCG CTACCTGACC CGGGCGCGCA ACGCGCACGC CAAGGCCGGC AACCTCAACA ACGGCCTGAC CCAGGCCCGC GCCGACCTCG TTCTGGTGCT CGACGCCGAC CACGCGCCGT TCCGGCCGTT CCTGCGGGAG ACCGTGGGCC TGTTCGCCCG CGACCCGAAG CTGTTCCTGG TGCAGACCCC GCACGTCTTC ATCAACCCGG ACCCGATCGA GCGGAACCTG CGGACCTTCA CCCGGATGCC GTCCGAGAAC GAGATGTTCT ACGGGGTCAC GCAGGCCGGC CTCGACAAGT GGAACGGCTC GTTCTTCTGC GGCTCGGCCG CGCTCCTCCG GCGGAGCGCC CTCGACGCGG TCGGCGGGTT CTCGGGCGTC ACGATCACGG AGGATTGCGA GACCGCCTTC GAGCTGCACG CCCGCGGATG GACCAGCGCC TATGTCGACC GGCCGCTGAT CGCCGGCCTC CAGCCCGAGA CCTTCGCCGA CTTCATCGGC CAGCGGGCGC GCTGGTGCCA GGGCATGTTC CAGATCATGC TCCTGAAGAA TCCGCTGTTC AAGCGCGGCC TGAAGCCGAT CCAGAGGCTC TGCTACCTGT CGAGCATGAC CTTCTGGTTC TTCCCGCTGC CGCGCCTGAT CTTCATGCTC GCGCCGCTGC TGCACATCTT CTTCGATGTG AAGATCTTCG TCTCCTCGAT CGACGAGGCG CTGGTCTACA CGGCGACCTA CGTGGTCGCC AACATGATGA TGCAGAACTA CCTCTACGGG CACGTGCGCT GGCCGTGGGT CTCCGAACTC TACGAGTACG TGCAGGGCGT CTACCTCGCC CGCTCCATCG TCTCGGTGGT GCTGTCGCCC CGCAAGCCGA GCTTCACCGT CACCAACAAG GGCCTCGGGC TCGACCGCGA CCACCTGTCG GGGCTGGCGT GGCCGTTCTT CGCGATCTTC GGGGCGCTGG CGGCGGGCTG CGCGACGGCC GCGTGGCGCT ACCTCTACGA GCCGGGCGTC ACCAGCCTGA TGCTGGTCGT CGGGCTCTGG TGCCTGTTCA ACCTCGTCAT CGCGGGCGCG GCCCTCGGCG TCGTGGCCGA GCGCCGTCAG ACCGAGCGCA GCCACAGCCT GCCGGTGAAC CGGCGCGCCG TCGCGTCGGT CGGCGGCGCC GTGTTCGAGG TCGTGGTGGA GCGCGCCTCC GCCGAGGGCT GCCGCCTGCG CCGCTGCGAC GGGTCGACCT GGCCGGCCGC GGCCGAGGCC GGGAGCCCGG GCCGGATCGT CCTCGCCGAG GGGTCCGGCG CCGTTCTCGC CTTCCGCCCG CGCGCCGGCC TGTCGGGCGA CACCTGGGAG GTCGCGTGGG AAGTCGCGCC TGCGGAAGCC GGCGCCCCGC TGTTCCGCGG GCTGGCCGAG CTGATCTACG GCGACGTCTC CGCCCTGCAG GCGTTCCTGT CGGGCCGGCG CCGGCCGAAG GACCTGCTCT CCGGAAGCCT CCGCTTCCTG GCCTGGGGCA TCACCGAGCC GGTCCGGGCC GTCACCTACG CGCTCCGGGA CCGCGGCGCC GCTCCGGCGG CCGCTCCGGA GGTCCCGGCC GCGATCGTCC CGGCGCCGGC GGCCCCCGCG GAGATCCCGC AGGCGGCGTC GATCCCCCTG CCGGTCGTCG ACGTGCCCGT CGCCGAGCGG GCTGCCGCGG CCGCCGCCCC GGCGGTGCCG GCGCCCGTCG CGGTGACCGC TCCGGTCGGC GCCCCGGCGA CCGCCGAGCC GGCGCCGGCC AAGCCCGTGC CCGCGCCCGC GCCCGCGCCC GCGCCGGCGC CGACCCCGCT CCCGGCGGCC GAGCCGGACG TCTCCGTCGC GATGCCGCTG ACGGAAGCGG CGTGGAGGCA GGCGGTCGCC AGGCTCGCCG CGGCCGAGGC CCCCGCCGCG GCATCCGGCA CGGACGCTCC CGAGCCGCGC CTCGACCCGG CCGCCTGGCT CGCGGGGATC TTCGCCCTCG CCGGCGCCGA GCGCCCCGAG CCGGCCCGCG CCGCCGACAT CGTCCGCGGC CCCGACTTCC GCGCGCTCGT CCGCGCGCCC GCCGATCCCG CCGCCATCCG CTCCGCCGCC TGA
|
Protein sequence | MARATPLGGP APAQALPTAI PGAITGAPAA RPRLLMALLW ATWALCAALA LGFLTQPVGL TAQGVLCAAA GAGMLALWLL FPRRGLARMA FLALGTAVVI RYAYWRVTGT LPTLDDPVSF GLGVVLALAE LYCVLILTVS LIINVAPLAR GAAPVLPEAD LPTVDIFIPS YNESAEILGL TLAAARNLDY PAGRATVWLL DDGGTDQKCA DPDPARAGAA RARRAALQAL CAGLGVRYLT RARNAHAKAG NLNNGLTQAR ADLVLVLDAD HAPFRPFLRE TVGLFARDPK LFLVQTPHVF INPDPIERNL RTFTRMPSEN EMFYGVTQAG LDKWNGSFFC GSAALLRRSA LDAVGGFSGV TITEDCETAF ELHARGWTSA YVDRPLIAGL QPETFADFIG QRARWCQGMF QIMLLKNPLF KRGLKPIQRL CYLSSMTFWF FPLPRLIFML APLLHIFFDV KIFVSSIDEA LVYTATYVVA NMMMQNYLYG HVRWPWVSEL YEYVQGVYLA RSIVSVVLSP RKPSFTVTNK GLGLDRDHLS GLAWPFFAIF GALAAGCATA AWRYLYEPGV TSLMLVVGLW CLFNLVIAGA ALGVVAERRQ TERSHSLPVN RRAVASVGGA VFEVVVERAS AEGCRLRRCD GSTWPAAAEA GSPGRIVLAE GSGAVLAFRP RAGLSGDTWE VAWEVAPAEA GAPLFRGLAE LIYGDVSALQ AFLSGRRRPK DLLSGSLRFL AWGITEPVRA VTYALRDRGA APAAAPEVPA AIVPAPAAPA EIPQAASIPL PVVDVPVAER AAAAAAPAVP APVAVTAPVG APATAEPAPA KPVPAPAPAP APAPTPLPAA EPDVSVAMPL TEAAWRQAVA RLAAAEAPAA ASGTDAPEPR LDPAAWLAGI FALAGAERPE PARAADIVRG PDFRALVRAP ADPAAIRSAA
|
| |