Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3997 |
Symbol | |
ID | 4611937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 4212650 |
End bp | 4213915 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639793681 |
Product | glycosyl transferase family protein |
Protein accession | YP_939979 |
Protein GI | 119870027 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG2246] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.254168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCTCA TGACAGAACT CGCCGTGGAA CTCGACGCTG ATCGCGCCGG CTTCGGGCCC CGTCCCGACG TCGTGGAGGC GGCCCGCGCC GCCGGTGTGC CGGTGCTCGA CGTCGTCGTA CCGGTCTACA ACGAGCAGGC CGCGCTGGCG GCGTCGGTGC GCCGCCTGCA CCGGCATCTG CACGACCACT TCCCGTTCCC GGCCCGCATC ACCATCGCCG ACAACGCCAG CGCGGACGCC ACCCCGCGGA TCGCCGCGCA GCTGGCCGCC GAACTGCCCG ATGTGCGGGT GGTGCGGCTC GAGGAGAAGG GCCGCGGTCG TGCCCTGCAC GCGGTCTGGT CGCAGTCCGA CGCACCGGTG CTGGCCTACA TGGACGTCGA CCTGTCCACC GATCTGGCCG CCCTCGCACC GCTGGTCGCC TCCCTGATCT CCGGGCACTC CGACCTAGCG ATCGGCACCC GGCTGAGCCG CGGTTCGCGC GTGGTGCGCG GCGCCAAGCG CGAGTTCATC TCGCGGTGCT ACAACCTGAT CCTGAAATCG ACTCTGGCCG CGGGCTTTTC CGATGCCCAG TGCGGGTTCA AGGCGATTCG CGCCGACGTC GCCCGTCAGC TGCTGCCGTA CGTCTCCGAC ACCGGATGGT TCTTCGACAC CGAACTGCTG GTCCTGGCCG AACGCAGCGG TCTGCGCATC CACGAGGTCC CGGTCGACTG GGTCGACGAC CCCGACAGCC GCGTCGACAT CGTCGCCACC GCGACCGCCG ACCTCAAGGG AATCGGTCGA CTGCTGCGCG GATTCGCGAA CGGGTCGATC CCGGTGCAGT TGCTCGCCGA CCAGTTGGCG CCGTCGCGGT CGGCGGCGGC ACCCCGATCG CTGCTGCGCC AGGCCGTCCG GTTCGGGGCG GTGGGTGTGG TGTCCACGCT GGCTTATCTA CTGCTGTTCA TGTTGACCCG CGGCTGGCTC GGCGCTCAGG CCGCCAACCT GATCGCGCTG GCGGTGACGG CGGTCGGCAA CACGGCGGCC AACCGGCGCT TCACCTTCGG CGTCGCGGGT CGCCGCGGCG CCGCGCGCCA CCACTTCGAA GGTTTCATCG TGTTCGCGAT CGCGCTCGGC ATCACCAGCG GGTCGTTGGC TCTGCTGCAC TCCGTCTCCG CCGAACCGCA CCGGTTGGTC GAACTCGGCG TCCTGGTGGC CGCCAACCTC GCCGCGACCG TCATGCGGTT CGTTCTGTTG CGCGGCTGGG TCTTCCACCC GCGGCGCACC CGCTGA
|
Protein sequence | MVLMTELAVE LDADRAGFGP RPDVVEAARA AGVPVLDVVV PVYNEQAALA ASVRRLHRHL HDHFPFPARI TIADNASADA TPRIAAQLAA ELPDVRVVRL EEKGRGRALH AVWSQSDAPV LAYMDVDLST DLAALAPLVA SLISGHSDLA IGTRLSRGSR VVRGAKREFI SRCYNLILKS TLAAGFSDAQ CGFKAIRADV ARQLLPYVSD TGWFFDTELL VLAERSGLRI HEVPVDWVDD PDSRVDIVAT ATADLKGIGR LLRGFANGSI PVQLLADQLA PSRSAAAPRS LLRQAVRFGA VGVVSTLAYL LLFMLTRGWL GAQAANLIAL AVTAVGNTAA NRRFTFGVAG RRGAARHHFE GFIVFAIALG ITSGSLALLH SVSAEPHRLV ELGVLVAANL AATVMRFVLL RGWVFHPRRT R
|
| |