Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_5121 |
Symbol | |
ID | 4113950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 5415397 |
End bp | 5417187 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638034279 |
Product | trehalose synthase-like protein |
Protein accession | YP_642281 |
Protein GI | 108802084 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCACA GCAGCGGCTC ACCCGCCCAC CCCGATCACG ATCCGGCCGA GGGCAGCCAC ATCGAGGACG GGGTGGTCGA ACATCCGACC GCAGGGGACT TCGGCCACGC GCGGATGGTC CCCGAGGACC GGACGTGGTT CAAGCGGGCC GTGTTCTACG AGGTGCTCGT GCGTGCGTTC CACGATTCGG ACGCCGACGG TTCCGGTGAC CTGCGCGGGC TGACCGAACG ACTGGACTAC CTGCAGTGGC TCGGCGTCGA CTGTCTGTGG CTGCCGCCGT TCTACGATTC ACCGCTGCGC GACGGTGGAT ACGACATCCG CGACTTCTAC AAGGTGCTGC CCGAATTCGG CACCGTCGAG GACTTCGTCA CGCTGCTCGA CGCCGCCCAC CGCCGCGGCA TCCGGGTGAT CACCGACCTG GTGATGAACC ACACCTCGGA CTCCCACCCG TGGTTCCAGG AGTCGCGCCG CGACCCGGAC GGACCCTACG GCGACTTCTA CGTCTGGAGC GACACCAGCG ACAGGTACGC CGACGCGCGG ATCATCTTCG TCGACACCGA GGAGTCCAAC TGGACCTTCG ACCCGGTGCG GCGGCAGTTC TATTGGCACC GCTTCTTCTC CCACCAGCCG GATCTGAACT ACGACAACCC GGCCGTGCAG GAGGCGATGC TCGACGTGCT GCGCTTCTGG CTCGACCTCG GCATCGACGG GTTCCGGCTC GACGCCGTGC CGTACCTGTT CGAACGCGAG GGCACCAACT GCGAGAACCT GCCGGAGACC CATGCGTTCC TGCGGCACTG CCGCAAGGTG ATCGACGACG AGTATCCGGG CCGGGTGCTG CTGGCCGAGG CCAACCAGTG GCCGGCCGAC GTGGTCGCGT ACTTCGGTGA CCCGGACACC GGCGGCGACG AGTGCCATAT GGCGTTCCAT TTCCCGCTGA TGCCAAGGAT TTTCATGGCC GTCCGGCGCG AGTCGCGGTT CCCGATCTCC GAGATCCTCG CGCAGACACC GGAGATCCCG GATATGGCGC AGTGGGGGAT CTTCCTGCGC AACCACGACG AGTTGACCCT CGAGATGGTC ACCGACGAAG AACGTGACTA CATGTACTCC GAATACGCCA AAGACCCACG GATGAAAGCG AATGTCGGCA TCCGGCGGCG TCTGGCACCA CTACTGGAGA ACGACCGCAA TCAGATCGAA TTGTTCACCG CGCTGCTGCT CTCACTCCCC GGGTCACCGG TGCTGTACTA CGGCGACGAG ATCGGCATGG GCGACATCAT CTGGCTCGGT GACCGCGACG GTGTCCGCAC CCCGATGCAG TGGACGCCGG ACCGCAACGC GGGCTTCTCG AAGGCCACGC CCGGCCGCCT GTATCTGCCG CCCAACCAGG ACGCCATCTA CGGTTACCAG GCGGTGAATG TCGAAGCGCA GCGGGACAGT TCGAATTCGC TGCTGAACTG GACGAAGACC ATGCTCGGGG TGCGCAGACG CCACGACGCG TTCGCGATCG GCGCGTTCCG CGAACTCGGC GGGTCGAACC CGTCGGTGCT GGCGTTCGTG CGTGAGACCG CCACCGACAC GGTGCTCTGC GTCAACAACC TGTCCCGCTT CCCGCAGCCC ATCGAACTGA ATCTGCAGCA GTGGAACGGT TTCACGCCGG TCGAGATGAC CGGCTACGTC GACTTCCCGA GTATCGGGGC GCTGCCCTAC CTGCTGACCC TGCCCGGCCA CGGGTTCTAC TGGTTCCAGC TACGCGCCCC CGACCCCGAA CCCGAAGGAG TGCAGCCATG A
|
Protein sequence | MDHSSGSPAH PDHDPAEGSH IEDGVVEHPT AGDFGHARMV PEDRTWFKRA VFYEVLVRAF HDSDADGSGD LRGLTERLDY LQWLGVDCLW LPPFYDSPLR DGGYDIRDFY KVLPEFGTVE DFVTLLDAAH RRGIRVITDL VMNHTSDSHP WFQESRRDPD GPYGDFYVWS DTSDRYADAR IIFVDTEESN WTFDPVRRQF YWHRFFSHQP DLNYDNPAVQ EAMLDVLRFW LDLGIDGFRL DAVPYLFERE GTNCENLPET HAFLRHCRKV IDDEYPGRVL LAEANQWPAD VVAYFGDPDT GGDECHMAFH FPLMPRIFMA VRRESRFPIS EILAQTPEIP DMAQWGIFLR NHDELTLEMV TDEERDYMYS EYAKDPRMKA NVGIRRRLAP LLENDRNQIE LFTALLLSLP GSPVLYYGDE IGMGDIIWLG DRDGVRTPMQ WTPDRNAGFS KATPGRLYLP PNQDAIYGYQ AVNVEAQRDS SNSLLNWTKT MLGVRRRHDA FAIGAFRELG GSNPSVLAFV RETATDTVLC VNNLSRFPQP IELNLQQWNG FTPVEMTGYV DFPSIGALPY LLTLPGHGFY WFQLRAPDPE PEGVQP
|
| |