Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3074 |
Symbol | |
ID | 5210042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3862887 |
End bp | 3863888 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596665 |
Product | aldo/keto reductase |
Protein accession | YP_001277387 |
Protein GI | 148657182 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00558733 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00303424 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGTACC GTCTGTTTGG TCGCACCGGC GTGCGTGTGG CGCCATTGTG CATTGGCGCG ATGAACTTCG GCAATCCAAC CGACGAAGCG GAAGCGCTGC GCATCATTGA TCGCGCCCTC GACGCCGGGA TCAATATGTT CGACACCGCC AACAGTTACA ACAACGGGGA GAGTGAACGC ATCATCGGGC GCGCCCTGGC GCGCGACGGA AAGCGTGACC GGGTGTTCCT CGCCACCAAG GGACATTTTC CCGTCGGACC CGGACCCAAT GATCGGGGCA ATTCGCGCCT GCACCTGATG CGCGCCTGTG AGGACAGCCT CCGCCGCTTG CAGACCGATC ATATCGATCT TTATCAGATC CATCGTCCCG ATCCCGCCAC ACCGGTCGAA GAGACCCTGG CAGCGCTGAC CGATCTGGTG CGTCAGGGAA AGGTGCGCTA TGTCGGGTGT TCGACCCACC CCGCCTGGCG AGTGATGGAA GCGCTGATGG TAAGCGAGTT GAAGGGGTAT GTGCGCTACG TCTCGGAGCA ACCGCCCTAC AACCTGCTTG ATCGACGCAT CGAAAATGAA CTGTTGCCGC TCTGTCAAAC GTATGGTCTG GCAATTATTC CGTGGGCGCC GCTGGCTCAA GGGGTGCTGG CGGGGCGCTA TACCGATATT GCTGCGCCTC CGCCCGACTC GCGCGTCGCC CTGCGCGGCG GCATCTATGC CGAACGAGTC ACTGCGCGCG GCATCGAGGT CGGGCGCGCC TTTGCCGGGC TTGCGCGCGA GCATGGTCTC ACACCTGCGC AGCTTGCTCT GCTGTGGGTC AAGGATCAAC CCGGCATTAC GGCGCCGATC TTCGGTGTGC GCACCATTGC GCAACTGGAA GAAGCGCTGC CGGTGCTGGA GATGACGTTG AGCGATGATC TGCGCGTCGC GTGTGATGCG CTCGTGCCGC CGGGCAGCGC GGTCGTCGAT TTCCACAACA CATCGGGCTG GATGAAGATG CGACTTCCGT AA
|
Protein sequence | MEYRLFGRTG VRVAPLCIGA MNFGNPTDEA EALRIIDRAL DAGINMFDTA NSYNNGESER IIGRALARDG KRDRVFLATK GHFPVGPGPN DRGNSRLHLM RACEDSLRRL QTDHIDLYQI HRPDPATPVE ETLAALTDLV RQGKVRYVGC STHPAWRVME ALMVSELKGY VRYVSEQPPY NLLDRRIENE LLPLCQTYGL AIIPWAPLAQ GVLAGRYTDI AAPPPDSRVA LRGGIYAERV TARGIEVGRA FAGLAREHGL TPAQLALLWV KDQPGITAPI FGVRTIAQLE EALPVLEMTL SDDLRVACDA LVPPGSAVVD FHNTSGWMKM RLP
|
| |