Gene Noca_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3651 
Symbol 
ID4595763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3875558 
End bp3876655 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content73% 
IMG OID639778259 
Productalcohol dehydrogenase 
Protein accessionYP_924838 
Protein GI119717873 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR03451] mycothiol-dependent formaldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.198781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGG TCAAGGCCGT GATCGCCCGC GGCAAGGGGC AACCCGTCGA GGTGACCACC 
ATCAACGTGC CCGACCCGGG CCCGGGCGAG GCACTCGTCC AGGTGCAGGC CTGCGGGGTC
TGCCACACCG ACCTGCACTA CCGCGAGGGC GGCATCAACG ACGACTTCCC GTTCCTCCTC
GGTCACGAGG CCGCCGGCGT CGTCGAGGCG GTCGGCCCGG ACGTGACGGC CATCGCCCCG
GGCGACTTCG TGATCCTGAA CTGGCGGGCC GTGTGCGGCG AGTGCCGGGC CTGCGAGCGC
GGCGAGCCGT GGTACTGCTT CGCGACCCAC AACGCCACCC AGCGGATGAC CCTCGCCGAG
GGTCCCGACG CCGGCACCGA GCTCGCGCCG GCCCTCGGGA TCGGCGCGTT CGCCGAGAAG
ACCCTGGTCG CGGCCGGCCA GTGCACGAAG GTCGACCCGT CGGCCCGGCC GGCCGCCGTC
GGGCTGCTCG GCTGCGGGGT GATGGCGGGG ATCGGAGCCG CGATCAACAC CGGCGCGGTC
ACCCGCGGGA AGTCCGTCGC GGTCATCGGC TGCGGCGGCG TCGGCGTGGC CGCGATCGCC
GGCTCGGCGC TCGCCGGGGC CTCGCCGATC ATCGCGGTCG ACATCGACGC CCAGAAGCTG
GAGGCCGCGC GCCGGATGGG CGCCACCCAC GTCGTCGACT CCAGCCGGAC CGACCCGGTC
GCGGCGATCC AGGAGCTCAC CGGCGGCTTC GGCGCGGACG TCGTCATCGA GGCCGTCGGC
CGCCCGGAGA CCTGGAAGCA GGCGTTCTAC GCCCGCGACC TGGCCGGCAC GGTGGTGCTG
GTCGGCGTAC CGACGCCCGA GATGAAGGTC CCGGACCTCC CGCTCATCGA CGTCTTCGGC
CGGGGCGGGT CGCTGAAGTC GAGCTGGTAC GGCGACTGCC TGCCCAGCCG CGACTTCCCG
ATGCTCGTCG ACCTCTACCA GCAGGGCCGG CTGGACCTGG ACGCCTTCGT CAGCGAGGAG
ATCGGCATCG GCGACGTCGA GGCGGCGTTC GAGCGGATGC ACGAGGGCGG CGTGTTGCGC
TCGGTGGTGA TCCTCTGA
 
Protein sequence
MQQVKAVIAR GKGQPVEVTT INVPDPGPGE ALVQVQACGV CHTDLHYREG GINDDFPFLL 
GHEAAGVVEA VGPDVTAIAP GDFVILNWRA VCGECRACER GEPWYCFATH NATQRMTLAE
GPDAGTELAP ALGIGAFAEK TLVAAGQCTK VDPSARPAAV GLLGCGVMAG IGAAINTGAV
TRGKSVAVIG CGGVGVAAIA GSALAGASPI IAVDIDAQKL EAARRMGATH VVDSSRTDPV
AAIQELTGGF GADVVIEAVG RPETWKQAFY ARDLAGTVVL VGVPTPEMKV PDLPLIDVFG
RGGSLKSSWY GDCLPSRDFP MLVDLYQQGR LDLDAFVSEE IGIGDVEAAF ERMHEGGVLR
SVVIL