Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_0415 |
Symbol | |
ID | 4615298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 459008 |
End bp | 460984 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639790090 |
Product | cellulose synthase (UDP-forming) |
Protein accession | YP_936422 |
Protein GI | 119866470 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.269998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGCACC AACTGCCCGA AATCTCCCCC GGCAAATACC CGTCGCCCTG GTTGCGGCTG CTGATCCTCT GCACCGCGCT GTTGGGCATC AACTACATCG TGTGGCGCTG GTTCGGGTCG ATCAACTGGG CCGCCTGGTG GATCGCGGTA CCGCTGGTGA TCGCCGAGAC CTACAGCGTC ATCGACTCGC TGCTGTTCGC GATGACGATG TGGAAGATGT TGCGGCGCAA CCCACCTCCG CCGCCGCCCG ACGACGCGAC CGTCGACGTC TTCATCACCA CCTACAACGA GCCGATCGAC ATGGTGCTGG AGACGGCCGA GGCCGCCCAG CGGATCCGCT TCCCGCACTC GACCTGGATC CTCGACGACG GCGACCGCCA CGACCTGGCC GAAGCGGCCG CCGAGCGCGG CATCGGCTAC ATCACCCGAT CGTCGAGTTG GACCCCCGAC AAACCGCGCC ATGCCAAGGC GGGCAACCTC AACAACGCGC TGTTCGAGAC CCACGGCGAG TTCATCCTCG TACTCGACGC CGACCAGGTG CCCGAACCCG AGATCCTCGA CAGGACCCTG GGCTACTTCC GCGATCCGCA CATGGCGCTG GTGCAGACGC CGCAGTACTT CCACAACGTC CCGTTCAGCG ACCCGCTGGG TAGCCAGGCG CCGCTGTTCT ACGGGCCGAT CCAACAGGGT AAGGACGGGT GGAACGCGGC CTACTTCTGC GGGTCGAATG CGGTGTTGCG CCGGGAAGCG CTGATGCGAT TGGGGATTCG CGGATACGTG CGCGCCGTCG AGGAGGGCGT CCGGCGGACG CTCTATGCGG CCCGCAAGAT GATCAGGACC GCGCGTAAAC AACCGGGCGC CGACCAGCCC GAGGTGCAGG AGGCGCTGGA GTCGGTGCTG CAGGCGGTGC GCGACGCGCG CCGTCAGTTG CGGGACAAGC GGGCCCTGGC CGACATCACC TTCGACTTCC AGCAGCGCGT CGACGCGGCC GCACGCACGG TGGTCGACGC CGACATCACC GCCATGCGGG CCGATCTCGA GGTGATCACC GCGCTGAGCG AACACCCCGA GGCCACCGCC ACGACAGTGG TGTTCGACGA CGAAGCGCTG GAGTCGCTGG CGGGCCGGGA GTGGTCCCCG CTCGGCGCGA TCGAGTCGAT CGGCGCGATG ATCCGCGCCG TCGACGTGGG CCGCGACGAC GAGGCGCAAC CCATGCTGCC GATGGCCACC ATCTCGGTCA CCGAGGACAT GGCCACCTGT ATGCGGCTGC ACGCACTGGG CTGGCGCTCG GCCTACCACC ACGAGGTCCT CGCCCGCGGT CTGGCGCCCG ACGACGTGCG GACCATGCTC ACCCAGCGGC TGCGCTGGGC GCAGGGCACC ATCCAGGTGA TGCTGCGGGA GAACCCGTTC GTGCAGAAGG GACTCTCGAT CGGCCAGAAG CTCATGTACT GGGCGACCAT GTACAGCTAT CTGGCCGGGT TCGCCGCGCT GGCCTACATC GCCGCCCCGG CGATCTACCT GATCTTCGGC ATCATGCCGG TGACCGCGTA CAGCTGGGAC TTCTTCGGGC GGCTCATCCC GTTCCTCGTG CTCAACCAGC TGATGTTCAT CATCATCAGC CGCGGCACCC CGACCTGGCG CGGCCAGCAG TACAGCCTCG CGCTGTTCCC GGTGTGGATC CGGGCCTGTT ACACGGCGTT CCTCAACGTG GTGTTCGGGC GACCGCTGGG CTTCGCGGTC ACCCCGAAGA CCAGACAGGA GGCGACGGCG ATCCCGTGGC ACCTGGTGAA GTGGCAACTC GCCGCGATGG CCATGCTGGT TGTCGCATCG ATCATCGGCA TCGTGCAGCT GTACTTCGGT GCGATCTCCG TGCTCGGTGT CGGTGTGAAC CTCTTCTGGG TGATATTCGA CCTGTTGATT CTGAGCGTGG TGATCCAGGC GGTGCGCTTC CGCGGGCACC AGGACGAAGG AGTGTGA
|
Protein sequence | MRHQLPEISP GKYPSPWLRL LILCTALLGI NYIVWRWFGS INWAAWWIAV PLVIAETYSV IDSLLFAMTM WKMLRRNPPP PPPDDATVDV FITTYNEPID MVLETAEAAQ RIRFPHSTWI LDDGDRHDLA EAAAERGIGY ITRSSSWTPD KPRHAKAGNL NNALFETHGE FILVLDADQV PEPEILDRTL GYFRDPHMAL VQTPQYFHNV PFSDPLGSQA PLFYGPIQQG KDGWNAAYFC GSNAVLRREA LMRLGIRGYV RAVEEGVRRT LYAARKMIRT ARKQPGADQP EVQEALESVL QAVRDARRQL RDKRALADIT FDFQQRVDAA ARTVVDADIT AMRADLEVIT ALSEHPEATA TTVVFDDEAL ESLAGREWSP LGAIESIGAM IRAVDVGRDD EAQPMLPMAT ISVTEDMATC MRLHALGWRS AYHHEVLARG LAPDDVRTML TQRLRWAQGT IQVMLRENPF VQKGLSIGQK LMYWATMYSY LAGFAALAYI AAPAIYLIFG IMPVTAYSWD FFGRLIPFLV LNQLMFIIIS RGTPTWRGQQ YSLALFPVWI RACYTAFLNV VFGRPLGFAV TPKTRQEATA IPWHLVKWQL AAMAMLVVAS IIGIVQLYFG AISVLGVGVN LFWVIFDLLI LSVVIQAVRF RGHQDEGV
|
| |