Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3542 |
Symbol | |
ID | 5703923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4086063 |
End bp | 4087061 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272969 |
Product | aldo/keto reductase |
Protein accession | YP_001538335 |
Protein GI | 159039082 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00316128 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAATTTC GACACCTGGG CCGTTCCGGC CTGATGGTCA GCGAGATCTC GTACGGTAAC TGGCTCACCC ACGGCTCCCA GGTGGAGGAG GAGTCGGCCT TCGCCTGCGT CCGGGCCGCC CTGGACGCCG GCATCACCAC CTTCGACACC GCGGACGCGT ATGCGAGCAC CCGCGCGGAG GACGTCCTCG GTCGCGCCCT GCAGAACGAA CGGCGGGCCG GAGTCGAACT GTTCACCAAG GTGTTCTTCC CGACCGGTCC GGGTCACAAC GACCGTGGCC TGTCCCGTAA GCACATCATG GAGTCGATCG ACGGTTCGCT GCGTCGGCTG CGCACCGACT ACGTCGACCT CTACCAGGCG CACCGCTACG ATCACAGCAC TCCGATCGAG GAGACGATGG AGGCGTTCGC CGACGTCGTC CGCTCCGGGA AGGCCCTCTA CATCGGGGTC TCCGAATGGA CGGCGACGCA GCTGCGCCAA GCCCACCAGC TCGCCCGTGA GCTGCGGATT CCGCTGATCT CCAACCAACC GCAGTACTCG ATGCTGTGGC GGGTCATCGA GGCCGAGGTC ATACCGGCCA GCGAGGAGTT GGGCGTCGGC CAGATCGTCT GGTCCCCGAT GGCCCAGGGC GTCCTGTCCG GCAAGTACCG GCCGGGCCAC CCCCCGCCGA CGGGTTCCCG GGCCACGGAC GAGAAGTCCG GCGCGAACTT CATCGCCAAG TGGCTGACCG ACGACGTGTT GACCCGGGTG CAGCAGCTCA AGCCGCTCGC CGAGCAGGCG GGGCTGAGCC TGGCCCAGCT GGCCATCGCC TGGGTGCTGC AGAACCCGAA CGTCTCCTCG GCGATCGTCG GCGCGTCCCG GCCCGAGCAG GTGAACGACA ACGTCAAGGC AGCCGGAGTG CGGCTGGACG CCGACCTGCT CAAGGCGATC GACGACGTCG TCGAGTCGGT CGTCGAGCGG GATCCGGCCC GTACCGAGTC CCCCGCGCGA CGGCCCTGA
|
Protein sequence | MEFRHLGRSG LMVSEISYGN WLTHGSQVEE ESAFACVRAA LDAGITTFDT ADAYASTRAE DVLGRALQNE RRAGVELFTK VFFPTGPGHN DRGLSRKHIM ESIDGSLRRL RTDYVDLYQA HRYDHSTPIE ETMEAFADVV RSGKALYIGV SEWTATQLRQ AHQLARELRI PLISNQPQYS MLWRVIEAEV IPASEELGVG QIVWSPMAQG VLSGKYRPGH PPPTGSRATD EKSGANFIAK WLTDDVLTRV QQLKPLAEQA GLSLAQLAIA WVLQNPNVSS AIVGASRPEQ VNDNVKAAGV RLDADLLKAI DDVVESVVER DPARTESPAR RP
|
| |