Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3002 |
Symbol | |
ID | 5707612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3409991 |
End bp | 3410791 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641272449 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001537817 |
Protein GI | 159038564 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000526062 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGACCACGC AGCCAACGGG CTCCCCCCGG GTCATCGTCG TCACGGGTGG GGCCGCCGGA ATCGGCCGGG CAGTCGTGGA GCGCCTGGCA GCAACTGGTG ACACGGTCGT CGCCGTTGAT CGCGACGAGG ACGCCCTGGG CGAGCTCGAC GACTCCTCCG GCACGTTCTC AGGGCGGGTG GTCCCGCACG TCGCGGATGT CGCCGACGAC TCGTCCCTCG CCGCGCTGGG CGTCCGCCTA CGCTCCGACT TCACACGCCT CAACGGGCTC GTCTGTGCCG CAGGGATCCA GCGCTACGGG ACCGTCGTGG AGACGACCGC CGACCTGTTC GACGAGATCA TGGGCGTCAA CCTGCGGGGC ACGTTCCTCA CCTGCCACCA CCTACTGCCA CTGATGCAGG GCGGCGGGGG GCAGGTCGTC ATCGTGGCGT CCGTACAGGC ATACGCCGCC CAGCACGGCG TCGCCGCCTA CGCGGCGACC AAGGGCGCAC TGCTGTCCCT CGCCAAGGCC ATGGCCGTGG ACCACGCGAC GGAGGGCATC CGGGTCAACG CGGTATGCCC TGGTTCGGTG GACACGCCAA TGCTGCGGTG GGCCGCCACC CTGTTCAGCG GGGACCAGCC AACCGACGAC GTGCTCGCCG ACTGGGGGCG CGCCCACCCT CTCGGACGGG TGGCGCAGCC AGCGGAGGTC GCCGACGTCG CCGCCTTCCT GCTCAGCGAC CAGGCGAGCT TCGTCACGGG CACGGACATT CGCGTCGACG GCGGGCTGAC CGCAGGGCTC CCCGTCGCTT TGCCGAAGTA G
|
Protein sequence | MTTQPTGSPR VIVVTGGAAG IGRAVVERLA ATGDTVVAVD RDEDALGELD DSSGTFSGRV VPHVADVADD SSLAALGVRL RSDFTRLNGL VCAAGIQRYG TVVETTADLF DEIMGVNLRG TFLTCHHLLP LMQGGGGQVV IVASVQAYAA QHGVAAYAAT KGALLSLAKA MAVDHATEGI RVNAVCPGSV DTPMLRWAAT LFSGDQPTDD VLADWGRAHP LGRVAQPAEV ADVAAFLLSD QASFVTGTDI RVDGGLTAGL PVALPK
|
| |