Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3557 |
Symbol | |
ID | 5705050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4105014 |
End bp | 4106021 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641272984 |
Product | aldo/keto reductase |
Protein accession | YP_001538350 |
Protein GI | 159039097 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.729958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0108203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA CACGGACACT GGGCCACAGC GGAATCGAGG TCAGTGCGAT CGGAATGGGT TGCTGGGCGA TCGGGGGGCC GCTGTGGGGC GACGGCGGGC AGCCGTTCGG CTGGGGCGAC GTCGACGACG ACGAATCGGT GCGCACCGTC CACGCCGTAC TCGACCACGG CGGGACCTTC TTCGACACCG CCAGCAACTA CGGCGCCGGG CACAGTGAGC GGATCCTCGG CCGCGCCCTC GCCGGCCGCC GGGACCAGGT GGTGATCGCC ACCAAGTTCG GCAACTGCTT CGAGGAGACG ACCCGCCGGT GGACCGGGAC CGACCACCGT CCCGAGCACG CCGTGACGAG CCTGGAGGCG TCGCTGCGCC GCCTCGGGAC CGACCACGTC GACCTCTACC AGTTGCACCT CAACGAACTG CCGACGTCCG CCGCGCTCGA CCTGGTCGAC ACGCTGGAGG ACCTGGTCAG CAACGGCAAG ATCCGGGCGT ACGGCTGGAG CACCGACAAT CCCGAGTCGG CGGCGGCGTT CGCGGCGGCC GGCCCGCACT GCGCCACCGT CCAGCACGAC CAGTCGGTGT TGGCGGACAA CGCGGCAGTG CTGGCTATCT GCGACACGTA CGACCTGGCG AGCATCAACC GGGGCCCGCT GGCGATGGGT CTGCTCACCG GCTCGACCCG GGCGGTCGGC TCCGACGACA TTCGCGGAGT GGCTCCACCG TGGCTGGTCT GGTTCACCGA CGGCCAACCC ACACCGCGGT GGTCTCGGCG CGTGGCGGAG ATCCGGGACG TGCTCACCGC CGACGGTCGC ACCCTGGCGC AGGGCGCGCT GGGCTGGTTG CTGGCCCGCA GCCCGCGGAC CGTCCCGATC CCGGGCTGCC GCACCGTCGC CCAGGCAGCG GAGAACATCG GCACGCTCAC CCGTGGTCCG CTCCCAACGG ACGCGTACGC CGAGGTCGAG CGGCTGCTGT CGGATCTTCG GCAAACGCCA GCCGAACCGG TCAGGTGA
|
Protein sequence | MTMTRTLGHS GIEVSAIGMG CWAIGGPLWG DGGQPFGWGD VDDDESVRTV HAVLDHGGTF FDTASNYGAG HSERILGRAL AGRRDQVVIA TKFGNCFEET TRRWTGTDHR PEHAVTSLEA SLRRLGTDHV DLYQLHLNEL PTSAALDLVD TLEDLVSNGK IRAYGWSTDN PESAAAFAAA GPHCATVQHD QSVLADNAAV LAICDTYDLA SINRGPLAMG LLTGSTRAVG SDDIRGVAPP WLVWFTDGQP TPRWSRRVAE IRDVLTADGR TLAQGALGWL LARSPRTVPI PGCRTVAQAA ENIGTLTRGP LPTDAYAEVE RLLSDLRQTP AEPVR
|
| |