Gene Noca_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4173 
Symbol 
ID4596687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4406983 
End bp4407990 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content76% 
IMG OID639778779 
Productshort chain dehydrogenase 
Protein accessionYP_925357 
Protein GI119718392 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCC GCGTGGTCGT GGTGACCGGC GCGAGCGGCG GCATCGGCCG GGCCTGCGCC 
CGGGCGTTCG CGGCCCGCGG CGACGACGTG GCGCTGCTGG CCCGCGGGGA GACCGGGCTG
GAGGCGGCCG CCGCCGAGGC GACCGACGCC GGCGTCCGGG CGCTGCCGGT CGAGGTCGAC
ATGGCCGACG CGGCCGCGGT CGAGGCGGCC GCACTCCGGA TCGAGGCCGA GCTCGGCCCG
ATCGAGGTGT GGGTCAACGT CGCGTTCACC TCGGTGTTCG CGCGGTTCGT GGACATCGCG
CCCGAGGAGT TCGCCCGGGT GACCGAGGTC AGCTACCTCG GCTACGTCAA CGGCACCCGC
AGCGCGCTGC GCCGGATGAC GGCCCGCGAC CGCGGCACGA TCGTCCAGGT CGGCTCGGCG
CTGGCCTACC GGGGCATCCC GCTGCAGTCG GCGTACTGCG GCGCCAAGCA CGCGATCCAG
GGGTTCCACG AGTCGCTGCG CACCGAGCTG CTGCACGACG GCAGCCGGGT GCACGTGACG
ATGGTGCAGA TGCCGGCGGT GAACACCCCG CAGTTCGACT GGGTGCGCTC CCGGCTGCCG
CGGCACGCGC GGCCGGTGCC GCCGATCTAC CAGCCCGAGT TGGCCGCGGA CGCGGTCGTG
TACGCCGCCG ACCACCCGAG CCGGCGCGAG TACTGGGTCG GGGAGACCAC GGCGCTCACC
CTGCTCGCGA ACGCCGTCGC CCCGGGGTTG CTGGACCGCT ACCTGGCCCG CACCGGCTTC
AAGAGCCAGC AGGCCGACCG GCGCCGCGAC CCCGACCAGC CCGAGAACCT GTGGGCGCCG
GCCGACGGCG CGGCCGGCGC GGACTTCGGC GCGCACGGCG ACTTCGACGC CCGCTCGCAC
CGCCGCTCGC CGCAGGTGTG GGCCTCCCAG CACCACGGAC TGCTCGGCGC AGCCGCCGCC
GGCGGGATCG CGCTGGCCGG CGCGCTCGCC CGCAGGCGGG CCTCGTGA
 
Protein sequence
MTGRVVVVTG ASGGIGRACA RAFAARGDDV ALLARGETGL EAAAAEATDA GVRALPVEVD 
MADAAAVEAA ALRIEAELGP IEVWVNVAFT SVFARFVDIA PEEFARVTEV SYLGYVNGTR
SALRRMTARD RGTIVQVGSA LAYRGIPLQS AYCGAKHAIQ GFHESLRTEL LHDGSRVHVT
MVQMPAVNTP QFDWVRSRLP RHARPVPPIY QPELAADAVV YAADHPSRRE YWVGETTALT
LLANAVAPGL LDRYLARTGF KSQQADRRRD PDQPENLWAP ADGAAGADFG AHGDFDARSH
RRSPQVWASQ HHGLLGAAAA GGIALAGALA RRRAS