Gene Mjls_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5066 
Symbol 
ID4880764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5312717 
End bp5313622 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content71% 
IMG OID640142376 
Productshort chain dehydrogenase 
Protein accessionYP_001073321 
Protein GI126437630 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.727463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.567593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGC TCGACGGCCG CGTGGTCATC GTGACGGGTG CGGGCGGCGG CATCGGCCGT 
GCGCACGCAC TGGCATTCGC CGCCGAAGGC GCGCGCGTGG TGGTCAACGA CATCGGTGTC
GGCCTGGACG GATCGCCGGC CGGCGGCGGC AGCGCCGCGC AGAGCGTCGT CGACGAGATC
ACCGCCGCCG GTGGGGAAGC CGTCACCAGC GGTGCCAACG TCGCGGACTG GGCGCAGGCC
GAGGGACTGA TCCAAACGGC GGTCGACTCG TTCGGCGGAC TCGACGTCCT GGTCAACAAC
GCCGGCATCG TGCGGGACCG GATGTTCGCC AACACCAGCG AAGAGGAGTT CGACGCGGTC
ATCGCGGTGC ACCTCAAGGG GCATTTCGCC ACCATGAAGC ACGCTGCGGC GTACTGGCGC
GCACAGTCCA AGGCCGGGAA GACCGTGGAC GCCCGCATCG TCAACACCAG TTCCGGTGCC
GGCCTGCAGG GCAGCGTCGG ACAGGCCAAC TACAGCGCCG CCAAGGCGGG TATCGCGGCC
ATGACGCTGG TGGCCGCCGC CGAGATGGGC CGCTACGGCG TCACCGTGAA CGCCATTGCG
CCGTCGGCGC GGACCCGGAT GACCGAGACG GTGTTCGCCG AGATGATGTC CACCCAGGGC
AACGACTTCG ACGCCATGGC GCCGGAGAAC GTCTCCCCGC TGGTCGTGTG GCTGGGTAGC
ACCGAGTCCC GCGACATCAC CGGGCAGGTG TTCGAGGTCG AAGGCGGCAA GATCCGCGTG
GCCGAGGGGT GGGCCCACGG GCCGCAGGTC GACAAGGGCG CCCGTTGGGA CCCCGCCGAA
CTCGGACCCG TCGTCGCGGA TCTGCTGGCA AAGGCGCGGC CGCCGGTGCC GGTCTACGGC
GCCTGA
 
Protein sequence
MGLLDGRVVI VTGAGGGIGR AHALAFAAEG ARVVVNDIGV GLDGSPAGGG SAAQSVVDEI 
TAAGGEAVTS GANVADWAQA EGLIQTAVDS FGGLDVLVNN AGIVRDRMFA NTSEEEFDAV
IAVHLKGHFA TMKHAAAYWR AQSKAGKTVD ARIVNTSSGA GLQGSVGQAN YSAAKAGIAA
MTLVAAAEMG RYGVTVNAIA PSARTRMTET VFAEMMSTQG NDFDAMAPEN VSPLVVWLGS
TESRDITGQV FEVEGGKIRV AEGWAHGPQV DKGARWDPAE LGPVVADLLA KARPPVPVYG
A