Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4434 |
Symbol | |
ID | 4596953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4687184 |
End bp | 4688134 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639779045 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_925618 |
Protein GI | 119718653 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.79228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGC CATGGAGGAC CGCGGCCACC TGGTCGCTCG CGGACATCCC GCCCCAGGAC GGGAGGACGG TGCTGGTCAC CGGCACCACG GTCGGCGGGC TGGGGCAGTT CACCGCGCTC GAGCTGGCCC GCCGCGGCGC CCGGGTGGTG CTCGCCGGCC GGACCGAGCA GCGGCTGGAG GAGACCCGGG CGGCGATCAC GACCGAGGTC CCGGCCGCCG CGCTGGAGAC CCTGGTCGTC GACCTGGCCG ACCTGGCGTC GGTACGCCGG GCCGCGGTCC AGGCGGCCGG CCTCGGCCCG ATCGACGTGC TGGTCAACAA CGCCGGCGTG ATGGGGACGC CGTACCACCG CACCGGGGAC GGGCTCGAGC TCCAGCTCGC GACCAACCAC TTCGGGCCGT TCCTGCTCAC CGGCCTGCTG CTGCCGCAGC TCGTCGCGAG CGGAGCCGGC ACCGTCGTCA CGGTCTCCTC GCAGATGCAC CGGGTCGCGC GCTCCGCGCC GCTGGACGAC CCCCGCAGCC AGCACGGCCG CTACCAGCGC TGGCCGACCT ATGCCCGGTC CAAGCTGGCC AACCTGCTGT TCACCTACGA GCTGGACCGG CGGGCCCGCC GCGCCGAGCT GCCGGTCCGC GCACTCGCCG CGCACCCCGG CTTCGCGGCC ACCCACCTCG CGGCGAACGG GCAGTACGGC CGCGCCCGCG GCGGCCGGGC CACGATCCTG GACGCGGCGA TCAAGGCGAT CTCGCCCAAT CGCGCCGACG AGGGCGCCTG GCCGACCCTG ATGGCCGCGA CCGCGGACCT GCCGGGCGGC ACCTACTGCG GGCCGAGCGG GTTCGCGGAG GGCGGCGGCG TCCCGCACGT GACCACCAGC AACAAGCTGT CCCACGACCA GGCCGCCCAG CGGCGGCTGT GGGAGCTCAG CGAGCAGACC ACCGGCATCC GCTACCCCTG A
|
Protein sequence | MTTPWRTAAT WSLADIPPQD GRTVLVTGTT VGGLGQFTAL ELARRGARVV LAGRTEQRLE ETRAAITTEV PAAALETLVV DLADLASVRR AAVQAAGLGP IDVLVNNAGV MGTPYHRTGD GLELQLATNH FGPFLLTGLL LPQLVASGAG TVVTVSSQMH RVARSAPLDD PRSQHGRYQR WPTYARSKLA NLLFTYELDR RARRAELPVR ALAAHPGFAA THLAANGQYG RARGGRATIL DAAIKAISPN RADEGAWPTL MAATADLPGG TYCGPSGFAE GGGVPHVTTS NKLSHDQAAQ RRLWELSEQT TGIRYP
|
| |