Gene Noca_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1999 
Symbol 
ID4598315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2140340 
End bp2141527 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content76% 
IMG OID639776603 
Productalcohol dehydrogenase 
Protein accessionYP_923196 
Protein GI119716231 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0396672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCGC TGGAGATGTT CCGGTCGGTC TCGCGGACCA TGGCCGGCAA GGCGATCGGC 
GGCCGGATGC CCGGCGTGCT GTCCGGCTTC GCCGCGCCGC TGCGGCTGGT CACCATCGAC
GAGCCGCGGG TCGAGCGGCC CGGCTGGGCG CGGCTGCGCA CCCGGCTCTC CGGCATCTGC
GGCTCCGACC TGGGCGCGCT CTCGGGCCGC ACCAGCCTGT ACTTCTCCGC GGTCGTGTCG
CTGCCGTTCG TGCCCGGGCA CGAGGTGGTC GCCGAGCTGC TGGACGACTG CGAGGACCTG
CCGGCCGGCA CCCGCGTCGT GGTCGACCCC GTGCTGGGCT GCGCCGCGCG GGGTGTCGAG
CCGTGCGAGG CGTGCGCGGC CGGCGCGACC AACCGGTGCG CCCGGATCAC CGTCGGGCAC
CTCTCACCCG GCCTGCAGAC CGGGTTCTGC CACGACACCG GCGGCGGCTG GGGCCAGCAG
CTGGCCGCCC ACCGCAGCCA GCTCCACCCG GTCCCGGAGG GCTTCTCCGA CGAGCAGGCG
ATCCTGATCG AGCCGGTGGC CTGCGCCGTG CACACGGCGC TGCGCGCCGG GGTCGCCGCC
GGCGACCGGG TGCTGGTCAG CGGGGCCGGG TCGGTCGGGC TGTTCGCCAC GCTCGCGCTG
CGCGAGCTCA CCGAGGCGGG CGAGATCATC GTGGTCGCCA AGCACCCCCA CCAGCGCGAG
CTGGCCCGCG AGCTGGGCGC GACCGAGGTC GTCGCGCCGG GCGAGGTGCT GCGGCGGGTA
CGCCGCTCCA CGGGCGCCTT CCAGCTCGAG CCGGAGTTCT CCACGCCGTA CCTCCTCGGC
GGCGTCGACG TCGCCGTCGA CGCGGTCGGG AGCAAGCAGT CGCTGGAGAG TGCCCTCCAG
GCCACCCGGG CCGGGGGCCG GGTGGTGCTG TCCGGCATGC CCGCCGCCGC CGACCTGTCC
GCCGCCTGGT TCCGCGAGCT CGAGGTGGTC GGCACCTACG CCTCGTCCCG CTCCGACGAC
GCGTTCGGGA GGGCGACCGA GCTGGTCGCC ACCGACGCCG TCCAGCAGCT TGCCAAGAGC
GTCGCCAGCT ATCCGCTGCA CCGGTGGCGC GAGGCGCTCG ACCACGCCCA CTCGGCCGGC
CGGCTCGGCA CGGTCAAGGT GGCCTTCGAC CCCCGCTCGT CCCACTGA
 
Protein sequence
MLALEMFRSV SRTMAGKAIG GRMPGVLSGF AAPLRLVTID EPRVERPGWA RLRTRLSGIC 
GSDLGALSGR TSLYFSAVVS LPFVPGHEVV AELLDDCEDL PAGTRVVVDP VLGCAARGVE
PCEACAAGAT NRCARITVGH LSPGLQTGFC HDTGGGWGQQ LAAHRSQLHP VPEGFSDEQA
ILIEPVACAV HTALRAGVAA GDRVLVSGAG SVGLFATLAL RELTEAGEII VVAKHPHQRE
LARELGATEV VAPGEVLRRV RRSTGAFQLE PEFSTPYLLG GVDVAVDAVG SKQSLESALQ
ATRAGGRVVL SGMPAAADLS AAWFRELEVV GTYASSRSDD AFGRATELVA TDAVQQLAKS
VASYPLHRWR EALDHAHSAG RLGTVKVAFD PRSSH