Gene Namu_2340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2340 
Symbol 
ID8447951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2581351 
End bp2582385 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content75% 
IMG OID645041461 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003201705 
Protein GI258652549 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000105685 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00137453 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGGCGG TGGTGTTCGC CGGGGACGGC CAGGTGCGGG TCGATTCGGT GCCCACGCCG 
ACGATCTCGC AGCCGCAGGA CGCGCTGGTC CGGGTCCGCC GGGCGGCCGT CTGCGGCACC
GACCTGCACG CCGTCGCCCA CCCCGACGGG CTGCCGGTCG GGACCGTGCT CGGGCACGAG
TTCGCCGGCG AGGTGATCGA GGTCGGCCCG GCCGTGCAGA CCCATCAGCC CGGGACCATC
GTGTACGGCT CGGATTTCAC CGCCTGCGGG CACTGCTGGT GGTGCCGGGC CGGCGACCAC
TGGGAGTGCC CGCAACGACG GTTCTTCGGC ACCGGCACCG CCTTCGGCCC GGCCCTGCCG
GGCGCGCAGG CGCAGTTGCT ACGGGTCCCG TTCGCCGACA CCGCGCTGCG TGCGGTGCCG
GCCGGGCTGA GCCTGGACGC GGCCGTGTTC CTGGGCGACA CGTTGGCTAC CGGCTACGCC
GCGGTGAGCC GGGCGCAGCT GCGCGCGGGG GGCACCGTCG CGGTCCTGGG CGGCGGCCCG
GTCGGCCAGT TGATCAGCCT GGCCGCCCAG GCATGCGGAG CGGGCGTCGT CGTGGTGGTC
GAACCGGTGG CCGATCGGCG GGAACTGGCC GCCGCGCAGG GCGCGGTGGT CGCCGAACCG
GAACTGGCCC GGACCCTGAT CGACCGGGTC ACCGACGGGC GGGGCGCCGA TGCGGTCATC
GACGCGGTCG GCGGGCCGCG GGCCCTGGAC ACCGCCTGCG CGCTGGTCCG GCGCCGCGGT
TCGGTGATCT CGGTCGGCGT GCACCGGGAC CTGGCCTGGT CGCTGCCGGT GGCCCGGGCC
TTCGCCGACG AGCTGACCCT GCGCTTCGTG ATCGGCGATG CGATGCGTGA CGGCGACGCC
CTGGTGGACC TGGTCCGTTC GGGCGCGATT GACCCCACGG TGCTGGTCTC GGACACGGTC
GGTCTCGACG ACGTGCCCGA GGCGTACCGC CGGATGGCGG ATCGACGTAC GCTCAAGACA
CTCATCGCGG TGTGA
 
Protein sequence
MKAVVFAGDG QVRVDSVPTP TISQPQDALV RVRRAAVCGT DLHAVAHPDG LPVGTVLGHE 
FAGEVIEVGP AVQTHQPGTI VYGSDFTACG HCWWCRAGDH WECPQRRFFG TGTAFGPALP
GAQAQLLRVP FADTALRAVP AGLSLDAAVF LGDTLATGYA AVSRAQLRAG GTVAVLGGGP
VGQLISLAAQ ACGAGVVVVV EPVADRRELA AAQGAVVAEP ELARTLIDRV TDGRGADAVI
DAVGGPRALD TACALVRRRG SVISVGVHRD LAWSLPVARA FADELTLRFV IGDAMRDGDA
LVDLVRSGAI DPTVLVSDTV GLDDVPEAYR RMADRRTLKT LIAV