Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2544 |
Symbol | |
ID | 5706398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2897358 |
End bp | 2898299 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272007 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001537377 |
Protein GI | 159038124 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.638785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000422955 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCGTAGGT TCGACTTCAG CGCGGCGACC GCCGTGGTCA CCGGCGCTGC CAGCGGTATC GGCGCAGCCC TCGCCCATGG CCTGGCCGCC CGCGGTAGCG ACCTGGTCCT GCTCGATCGC GACGCCGCGC GCCTGGCCAC CGTCGCGGAC GCGATCCGTG CTGGGCACCC CGATCGGCGC GTCGATCGGG TCGTCGTTGA CCTTGCCGAC GCGGCGGCCA CAGCTCGGGC CGCCGCGCAG GTTCGCGCCC GCCATCCGCG GATCCGGCTG CTGGTCAACA ACGCAGGCGT GGCCTTGGGC GGTCGGTTCG ACCAGGTGAC CCTGGACGAG TTCCAGTGGG TGGTCGAAAT CAACTTCCGG GCGGTCGTGC AGCTCACGCA CGCGTTGCTG CCTGCCCTGA AGGCAGAGCC CGGTTCCCAC CTGGTGAACG TCTCCAGCGT GTTTGGGCTG ATCGCGCCGC CTGGGCAGGC CGCCTACTCG GCGACCAAGT TCGCCGTCCG TGGCTTTACC GAGGCCCTGC GCCACGAACT GATCGCCGAT GGTATCGGTG TCACGTCCGT GCACCCTGGG GGCATCGCCA CCCGGATCAC CGAGAACGCG CGTATCGGCA GTGGTGTCCG TCGGGATGAC TACGAGGAGG GCCGGCGGAA GTTCGACCGT CTGCTCAGCA TCCCACCTGC CCGGGCCGCC GGGGTCATCC TGCGTGGCGT GGAACGCCGC CGGCCTCGCG TCCTTGTCGG CTGGTCGGCG AAGCTGCCCG ACCTGATGGC CCGGATCGCT CCGGGATCGT CCGGGACGCT GCTACGGGCC GGGATCGGCC GGGGTACCGG TGCGCCGGTT CGCCGGCTGA CCACCGTGGC CGCGCCCCCG GAGGAGGCCG TGCCCCCGGC GGTGGCGAAC GACCGGCGCT CCGAGGGAGT GTCGACGGCG GACGAGGCAT GA
|
Protein sequence | MRRFDFSAAT AVVTGAASGI GAALAHGLAA RGSDLVLLDR DAARLATVAD AIRAGHPDRR VDRVVVDLAD AAATARAAAQ VRARHPRIRL LVNNAGVALG GRFDQVTLDE FQWVVEINFR AVVQLTHALL PALKAEPGSH LVNVSSVFGL IAPPGQAAYS ATKFAVRGFT EALRHELIAD GIGVTSVHPG GIATRITENA RIGSGVRRDD YEEGRRKFDR LLSIPPARAA GVILRGVERR RPRVLVGWSA KLPDLMARIA PGSSGTLLRA GIGRGTGAPV RRLTTVAAPP EEAVPPAVAN DRRSEGVSTA DEA
|
| |