Gene Namu_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4041 
Symbol 
ID8449660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4454923 
End bp4455864 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content73% 
IMG OID645043086 
Productshort chain dehydrogenase 
Protein accessionYP_003203322 
Protein GI258654166 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.138801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0368212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGA CCATCGACAT CACCGTCCCC GACCTGAGCG GGCGGCGCGC GGTCGTCACC 
GGGGCCAGTG ACGGCCTCGG CGTCGGCCTG GCCGGCCGGT TGGCCGCGGC CGGCGCCGAG
GTGATCATGC CCGTGCGCAA CCAGCGCAAG GGCGAGGCGG CGATCGACCG GATCCGGCGG
TCGGCACCGG ATGCCACCGT GTCGCTGCGC GAGCTGGACC TGTCGAGCTT GGATTCGGTG
GCTGAACTCG GCCGGACGCT GACCCAGGAG GATCGGCCGA TCCACCTGCT GATCAACAAC
GCGGGGGTGA TGACCCCGCC GGAACGGCAG AACACGGCCG ACGGCTTCGA GCTGCAGTTC
GGGTCCAACC ACCTGGGCCA CGTCGCCCTG GTCGCGCACC TGCTGCCGCT GCTGCGGGCG
GGGCAGGCCC GGGTCACCTC ACAGGTCAGC GTCGCGGCGG CCCGGGGGTC CATCAACTGG
GACGACCTGA ACTGGGAACG GTCCTACGAC GGGATGAAGG CCTACCGCCA GTCCAAGATC
GCGCTCGGAC TGTTCGGGCT GGAGCTGGAC CGGCGCAGCC GAGCCGCCGG CTGGGGCATC
AGCAGCAACC TGGCGCACCC CGGGGTCGCC CCGACGAACC TGCTGGCCGC CCGACCCGAG
GTGGGCCGGG CCAAGGACAC CCTGGGCGTG CGCGTCATTC GCGCCCTGTC CGCGCGCGGG
CTCCTGGTCG GCACGGTCGC CAGCGCCGCG CTCCCGGCGG TGTACGCGGC CACCTCGCCC
GACGCCCAGC CGGGGCGGCT GTACGGGCCC GGCGGGCTGG GCCACCTGGG CGGTGCGCCG
GCGGAGCAGA AGCTCTACCC CACCCTGCGC GGCGACGAGC AGGCCGATCG CATCTGGCGG
GTCTCACAGG AGCTGACCGC GGTGCCGTTC CCGCAGGACT GA
 
Protein sequence
MTKTIDITVP DLSGRRAVVT GASDGLGVGL AGRLAAAGAE VIMPVRNQRK GEAAIDRIRR 
SAPDATVSLR ELDLSSLDSV AELGRTLTQE DRPIHLLINN AGVMTPPERQ NTADGFELQF
GSNHLGHVAL VAHLLPLLRA GQARVTSQVS VAAARGSINW DDLNWERSYD GMKAYRQSKI
ALGLFGLELD RRSRAAGWGI SSNLAHPGVA PTNLLAARPE VGRAKDTLGV RVIRALSARG
LLVGTVASAA LPAVYAATSP DAQPGRLYGP GGLGHLGGAP AEQKLYPTLR GDEQADRIWR
VSQELTAVPF PQD