Gene Noca_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1166 
Symbol 
ID4599341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1238590 
End bp1239645 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content69% 
IMG OID639775761 
Productalcohol dehydrogenase 
Protein accessionYP_922368 
Protein GI119715403 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.343531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAGA TCATGCAGGC CCTGGTGGTG CGCGAGCCCA ATGTGCTCGA GATCGCCGAG 
GTGCCTGTGC CCGAGCCCGG TCGCAACGAG GTCTTGGCGC GAGTCCGATC CGTCTCGATC
TGCGGCACCG ACGCCCACCT CATCAACGGT GACTACCCCG GGTTCTGGCC GCCGCAGTTC
CCCTTCACCC CCGGCCACGA ATGGGCGGGT GACGTCGTGG CCCTCGGTGA GGGTGCGGAC
ACCTTCGGCT GGCGGGTCGG CGACCGGGTC GCGGGCACGA GCCACAGTGC TTGCGGCGCC
TGCCAGAAGT GCGTCGAAGG CCAGTACAAC CTGTGTGAGA ACTACGGCAG GCCGGCGCTC
CATGCTCAGT ACGGACACAA CGCGCAGGGG GTCAACGCCA CCTATGCCGT CCACAACGTC
AAGTCGATCT TCCGGCTGCC CGATGAGGTG AGCTTCGATG TCGGTGCCCT GGCTGACCCG
GCCAGCATCG CCCTGCACGT AGCACGTCGC GGCAACATCA AGCCGGGGGA CACCGTTGCG
GTGACCGGTG CCGGCGCGAT CGGCCTCCTC GCGGCTGACG CCGCACTGAT CGATGGGGCC
GCGCGGGTGA TCGTGGTGGG GCGCGGCCAC CGTCTGGAGC GGGCTGCTGC GCTGGGGTTC
GAGACCGTCG ACACGCGCAC AGGCGACCCG GTCGCCGAGG TGCGTGGACT GACGAGTGGC
CTCGGCGCCG ACGTGGTCCT GGAATGTGCG GGCGTCCCCG AGACCCTGAT CTGGGGGATG
GCGATGCTGC GCCGCGGCGG TCGCTGCGCG ATGGTCGGCA TCCCGACCGA GGACGTCAGC
CTGAAGTGCC AGCCCCTCGT GCTCGATGAG CTGGAGCTGG TCGGCTCACG CGCCTCTGCC
GGTGAGATGC GCCGCGTCCT GCCCTTCATC GCCAACGGGA GGATGCGGGC CGAGGAGCTC
ATCACCCACC ACTTCCCGCT CTCGGAGTAC GACAAGGCGC TGGCCACGTT CAACGACCGT
GCCAGCGGTG CGATCAAGAT CATCGTCAAC CCCTAG
 
Protein sequence
MTQIMQALVV REPNVLEIAE VPVPEPGRNE VLARVRSVSI CGTDAHLING DYPGFWPPQF 
PFTPGHEWAG DVVALGEGAD TFGWRVGDRV AGTSHSACGA CQKCVEGQYN LCENYGRPAL
HAQYGHNAQG VNATYAVHNV KSIFRLPDEV SFDVGALADP ASIALHVARR GNIKPGDTVA
VTGAGAIGLL AADAALIDGA ARVIVVGRGH RLERAAALGF ETVDTRTGDP VAEVRGLTSG
LGADVVLECA GVPETLIWGM AMLRRGGRCA MVGIPTEDVS LKCQPLVLDE LELVGSRASA
GEMRRVLPFI ANGRMRAEEL ITHHFPLSEY DKALATFNDR ASGAIKIIVN P