Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_4034 |
Symbol | |
ID | 4112864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 4300445 |
End bp | 4301371 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638033177 |
Product | LmbE-like protein |
Protein accession | YP_641195 |
Protein GI | 108800998 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | [TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGCCG ATTCGGCGGT CGCGTGTGTC ATCTCGGCGC TACGGGCCGT TGGCGTAGAT TCACCGCTGA TGTCAGAACT CGTCCCTCGC CTGCTGTTCG TCCACGCCCA TCCGGACGAC GAGACGCTCA CGACCGGCGG CACCATCGCT CATTACGTCC GCCGCGGTGC GGACGTTCGC GTGGTCACCT GCACGCTCGG CGAGGAAGGT GAGGTGATCG GGGAGCAGTA CGCCCAGCTC GCTGTCGACC ACGCCGACCA GTTGGGCGGT TACCGGATCG CCGAACTCAC CGCCGCGCTC GCCGCGCTGG GAGTGGATGC ACCGCATTTT CTCGGTGGCC CCGGCCACTG GCGCGACTCC GGGATGGCCG ACACCCCGGC GCGCCACCAG CCGCGCTTCG TCGACGCCGA CATGGCCGAG GCCGCCGGCC TGCTCGCCGC GATCCTCGAC GACTTCCGCC CGCACGTGGT GGTCACCTAC GACCCCGACG GAGGGTATGG ACACCCGGAC CACGTCCAGA CCCACCGCGT CACCACCGCG GCCGTCGAGC GTGCGCAGTG GCAGGTGCCC AAGTTCTACT GGACCGTCAT GTCCAGGAGC GGGATGGGCG ACGCCTTCGC CGTTGCCCGC GACGTCCCCG AGGAGTGGTT GCAGGTCAGC GTCGACGACG TGCCTTTCCT CTATACCGAC GACCGGATCG ACGCCGTCGT CGACGTCAGC GACAGCATCG AGGCGAAGGT CGCCGCCATG CGCGCCCACG CCACGCAGAT CTCGGTCGCG GCCAACGGCC AGTCCTGCGC GCTGTCGAAC AACATCGCGA TGCCGATCCC CGGCGTGGAG CACTACGTGC TCGTCTCCGG TGCGCCGGGC CCACGCGACG CGCGTGGCTG GGAAACCGAC CTGCTCGCCG GCGTGAACCT GGCGTAG
|
Protein sequence | MFADSAVACV ISALRAVGVD SPLMSELVPR LLFVHAHPDD ETLTTGGTIA HYVRRGADVR VVTCTLGEEG EVIGEQYAQL AVDHADQLGG YRIAELTAAL AALGVDAPHF LGGPGHWRDS GMADTPARHQ PRFVDADMAE AAGLLAAILD DFRPHVVVTY DPDGGYGHPD HVQTHRVTTA AVERAQWQVP KFYWTVMSRS GMGDAFAVAR DVPEEWLQVS VDDVPFLYTD DRIDAVVDVS DSIEAKVAAM RAHATQISVA ANGQSCALSN NIAMPIPGVE HYVLVSGAPG PRDARGWETD LLAGVNLA
|
| |