Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4196 |
Symbol | |
ID | 5703850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4763803 |
End bp | 4764567 |
Gene Length | 765 bp |
Protein Length | 254 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273615 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001538968 |
Protein GI | 159039715 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00656937 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACGCCG AGCCCCGTAC AGACCTCGCC GCACACGTCG TTGTCGTCAC CGGCGCCGGA TCTGGCATCG GCCGCGCCAC AGCACACACG TTCGCCCGCG CAGGCGCGCG GGTGCTGGGC ATCGGCCGCC GTAAGGATGC CCTTGAGGAA ACCGCCGCCG GACACCCCGA GATCGCCGTC CATCCGGCCG ACCTGAACGA CCCCGGCGCA CCGCAGGAAG CGGTTGACGC CGCAGTCGAC CGCTGGGGAC AACTGGACAT CCTGGTCAAC AACGCCGGAG CTACCAAGAT CATGCCGCTG GCACACACCA CGGCTGCGGG CATCGCCGCC CTGTTCGACC TCAATGTCGT CGCGCCCAGC CTGCTGGCGC ACGCGGCACT ACCGCACCTG CGCCGGAGCC GCGGCTCGAT CGTCAACGTC TCCAGTACCT ACGGCCACCG ACCCCTGCCA GGCGCCGCTC ACTACGCGGC CTCCAAAGCC GCCCTCGAAC AGCTCACCCG TAGTTGGGCG ATGGAACTCG CGGCTGACGG TATCCGCGTC AACGCCCTCG CTCCCGGCCC GACCGAAAGC CAGGCGCTGG CCGCCGCCGG ACTTCCCGAC CCGGCCATCG AAGAGATCAA ACGTGAAGAG GCCGGACGTA TCCCCCTCGG TCGCCGCGGT GATCCCGAAG AGGTCGCCAC CTGGATCCTG CGCCTGGCCG ACCCGGCCAC CACCTGGCTG ACCGGACAGG TCCTCACTGT CGACGGCGGA CTCGAGCTGA CGTGA
|
Protein sequence | MNAEPRTDLA AHVVVVTGAG SGIGRATAHT FARAGARVLG IGRRKDALEE TAAGHPEIAV HPADLNDPGA PQEAVDAAVD RWGQLDILVN NAGATKIMPL AHTTAAGIAA LFDLNVVAPS LLAHAALPHL RRSRGSIVNV SSTYGHRPLP GAAHYAASKA ALEQLTRSWA MELAADGIRV NALAPGPTES QALAAAGLPD PAIEEIKREE AGRIPLGRRG DPEEVATWIL RLADPATTWL TGQVLTVDGG LELT
|
| |