Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_3115 |
Symbol | |
ID | 4111947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 3298419 |
End bp | 3299684 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638032245 |
Product | glycosyl transferase family protein |
Protein accession | YP_640278 |
Protein GI | 108800081 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.448802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAATTCG TGGTGGCCGT CCACGGCACC CGTGGCGATG TCGAACCCTG TGCGGCCGTC GGGCTCGAAC TCGCCCGGCG CGGGCACGAG GTGCGGACTG CCGTGCCGCC CAACCTCATA TCCTTCGTCG AGGAATGCGG ACTCGGTGTC CCGGTGTCCT ACGGCGTCGA TTCGCAGCAA CAACTCGACG CGGACATCTT CCGCGAGTGG TACCGGCTGC GGAACCCGAT GACGGTGCTG CGCGAAGCGC GCGAGTACGT CGTCGAGGGA TGGGCGGAGA TGAGCCGTTC GCTCGACGCG CTTGCCGACG GCGCCGACCT GATCCTCACC GGTACGACGT ACCAGGAACT CGCCGCCAAT GTGGCTGCGG CGCACGCTAT CCCGCTGGCC GCGCTGCACT ACTTCCCGGT GCGCCCGAGC ACCAAGTCGC TGCCGGTACC CGTGCCGTCC GCGGTGGTCG GCCCGGTCTG GGCCGTCGGT GAGTGGGCGC ACTGGCGGGT GCTCAAACAG GCCGAGGACG AGCAACGGCG CGAGCTGGGC CTGCCTCCGG CGAGCACCCG CGCGGTGCGT CGCATGCTCG ACGACGGCGC GTTGGAAATC CAGGCCTACG ACCGGGTTTT CTTTCCCGGA CTGGCCGAGG AGTGGGGTCC GCAGCGGCCC CTGGTCGGCG GTATCACCCT CGAGAAGAAC ACCGACGCCG ACGACGATGT GGTCTCCTGG ACAGCCGCCG GGACACCGCC CGTCTACTTC GGATTCGGCA GCATGCCGGT GAAGTCGCCC GCCGACGCGG TGGCGATGAT AGAAGCGGCG TGCGCCGATC TCGGCGAGCG GGCGCTGATC TGCTCGGGAG TGTGGGACGT CGACGAACTG CCGCACGCTG CGCACGTGAA GATCGTGCGG AGCGTCAACC ACGCGGCGGT CTTCCCGTTG TGCCGCGCCG TGGTTCACCA CGGTGGCGCG GGTACGACGG CGGCCGGTGT CCGCGCCGGT GTTCCCACGC TGGTGCTGTG GGTGGGTGCC GAACAACCGA TCTGGGGTTC GCGGGTCAAA CACCTCGGTG TGGGTGATTA CCAACGGTTC TCGTCCACCA CACGCAAATC GCTGCGTCGC GCCCTGAGCA GGGTGCTGGG ACCGCGATAC GTCGAGCGCG CACGCGAGGT CGCCGCAGCG ATGACGAAAC CGGCCTCGAG TGTGGGTACC GCGGCCGACC TTCTCGAAGA TGCGGCGCGT CAGGAGCGCC GACATGGTCA GACGATCTCG CCGTAG
|
Protein sequence | MKFVVAVHGT RGDVEPCAAV GLELARRGHE VRTAVPPNLI SFVEECGLGV PVSYGVDSQQ QLDADIFREW YRLRNPMTVL REAREYVVEG WAEMSRSLDA LADGADLILT GTTYQELAAN VAAAHAIPLA ALHYFPVRPS TKSLPVPVPS AVVGPVWAVG EWAHWRVLKQ AEDEQRRELG LPPASTRAVR RMLDDGALEI QAYDRVFFPG LAEEWGPQRP LVGGITLEKN TDADDDVVSW TAAGTPPVYF GFGSMPVKSP ADAVAMIEAA CADLGERALI CSGVWDVDEL PHAAHVKIVR SVNHAAVFPL CRAVVHHGGA GTTAAGVRAG VPTLVLWVGA EQPIWGSRVK HLGVGDYQRF SSTTRKSLRR ALSRVLGPRY VERAREVAAA MTKPASSVGT AADLLEDAAR QERRHGQTIS P
|
| |