Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2731 |
Symbol | |
ID | 5209700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3394142 |
End bp | 3396331 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640596331 |
Product | short chain dehydrogenase |
Protein accession | YP_001277053 |
Protein GI | 148656848 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.550366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACT CAACGCGCTT CCGCCATGTC CATTATGGTT GGGACGCCGC CTATGCCGCC ACACTCGATC TGGTCGGTCG TCTGGTGTAT CGTTCTAACC TGTTGGGAAG CGATCAGCGC ATCACAAACA CCGGCGGCGG GAACACGTCC GCCAAGATAA CCGGGCGCGA TCCATTGACC GGCGAGCCGG TCGATGTGTT GTGGGTGAAG GGTTCGGGCG GCGATCTCCG CACCAGCACG CGCGCCAATT TTGCATCGCT GGACCTGGGA AAACTGCACA CCCTGCGCTC TATCTATCTG CGCGACCCGG CACGCGGACC GAAGTCTGCA ATCGAAGATG CGATGGTCGA TCTCTACCCA CACTGCACCT TCAACCTCAA TCCACGCGCT TCTTCGATTG ATACGCCGCT CCATGCGTTC ATTCCCTACC GCCATGTCGA TCACATGCAC CCGAATGCGG TCATCGCAAT TGCCGCTGCA CGCCACGGTG AGCGTCTGAC GCGCGAGATT TATGGCGATG AGGTCATCTG GACGCCGTGG CAGCGCCCCG GTTTCGATCT TGGATTGACA CTCGAACGCA TCTGCCGCGA GCATCCGCAG GCGAAAGGCG TTCTTCTCGG CGGTCACGGG CTGATCAACT GGGCGGATGA CGATCAGGAG TGCTACGAGC GCACGCTCGA CCTGATCGAG CGTGCGGCGC GCTATATCGA AGCGCACGAC CGGGGTGAAG CAACATTTGG CGGACCGAAG TATGCCGCCC TGCCGTTCGA CCAGCGGCGC GCGATCTTCG CACGCATTCT GCCCTGGTTG CGGGGACAGA TCAGCCAGCA GCGTCGGTTC ATTGCGACCA TTCAGGATGA TGACGCCACT CTGCGCTTTG TCAACAGCGT CGATGCCCCG CGCCTGGCTG AACTGGGCAC AAGTTGCCCC GACCACTTCC TGCGCACCAA GATCAAGCCA CTCTATGTCG ACTGGAACCC GCACAGCGAG ACCATTGATG ATCTGAAACG CAAGCTGTCC TCCGGGCTTG AGCGGTATCG CGCCGATTAC GCCCGCTACT ACGAAACCTT CCGCCGACCC GACTCGCCGC CGATGCGCGA TCCCAATCCG ACGGTCATCC TCATCCCTGG CTTGGGGATG ATCGCCTGGG GCAAGGACAA GAGCGAGTCG CGGGTGACGG CGGAGTTTTA CACGGCTGCC ATCGAAGTGA TGCGCGGCGC GGAGGCGATT GACGAGTATA CCGCATTGCC GCTGCAAGAA GCGTTCGACA TCGAATACTG GCGCCTGGAA GAGGCAAAAC TGCGCCGGAT GCCGCCGGAG AAGGAACTGG CGCGCCAGGT GATCGCCGTT GTCGGGAGCG GCAGCGGTAT TGGACGCGAA GTGGCGCTGC GTCTGGCAAA CGAAGGTGCG CACATCGTCT GTGTCGATAA AGATGGCGCG GCAGCGCAGG CGACCGCGCA GATGATAATC GAACGGCATG GCATGGGGAT CGGGGTTGCT GGCAGTGACA TTTCCGCCTG CGGACCGGCG ATTGGGCTGA CTGCCGACAT TACTGATCGC GCCAGTGTGC AGGCGATGAT CCAGCAACTG CTGCTTGCCT ATGGCGGGCT TGACGCTGTT GCCGTCACTG CTGGCATTTT CGTCGCACCC GATGCCAGCG GGCGCATCCA CGATGAGCAC TGGGGGTTGA CCTTTGCCAT CAACGTTACC GGGCACTACA TCGTTGCCGA CGAAGCAGCG GCGATCTGGC GCGCACAGGG CTTGCCCGCC AGTCTGGTGC TGACCACCTC GGTCAATGCG GTGGTGGCAA AGAAGGGATC ACTGGCTTAC GACGCCAGCA AAGCGGCAGC CAACCATCTC ATCCGCGAAC TGGCTATCGA ACTGGCGCCG CTGGTACGGG TCAACGGTGT CGCTCCGGCA ACCGTCGTGC AGGGGAGCGG TATGTTCCCC CGCGAACGGG TGATCGCCTC ACTCACAAAG TATGGCATCC CCTTCGCACC AGATGAACCA ACCGAGTCGC TGACGGCAAA ACTGGCTCAG TTCTACGCCG ATCGTTCGCT CCTTAAGCGA CCAGTGACGC CAGCCGATCA GGCGGAAGCG TTCTTCCTCC TGCTCACCCG GCGTCTGGGG CAAACGACTG GACAGATCAT CACGGTTGAT GGCGGATTGC ATGAGGCATT CCTGCGCTGA
|
Protein sequence | MSNSTRFRHV HYGWDAAYAA TLDLVGRLVY RSNLLGSDQR ITNTGGGNTS AKITGRDPLT GEPVDVLWVK GSGGDLRTST RANFASLDLG KLHTLRSIYL RDPARGPKSA IEDAMVDLYP HCTFNLNPRA SSIDTPLHAF IPYRHVDHMH PNAVIAIAAA RHGERLTREI YGDEVIWTPW QRPGFDLGLT LERICREHPQ AKGVLLGGHG LINWADDDQE CYERTLDLIE RAARYIEAHD RGEATFGGPK YAALPFDQRR AIFARILPWL RGQISQQRRF IATIQDDDAT LRFVNSVDAP RLAELGTSCP DHFLRTKIKP LYVDWNPHSE TIDDLKRKLS SGLERYRADY ARYYETFRRP DSPPMRDPNP TVILIPGLGM IAWGKDKSES RVTAEFYTAA IEVMRGAEAI DEYTALPLQE AFDIEYWRLE EAKLRRMPPE KELARQVIAV VGSGSGIGRE VALRLANEGA HIVCVDKDGA AAQATAQMII ERHGMGIGVA GSDISACGPA IGLTADITDR ASVQAMIQQL LLAYGGLDAV AVTAGIFVAP DASGRIHDEH WGLTFAINVT GHYIVADEAA AIWRAQGLPA SLVLTTSVNA VVAKKGSLAY DASKAAANHL IRELAIELAP LVRVNGVAPA TVVQGSGMFP RERVIASLTK YGIPFAPDEP TESLTAKLAQ FYADRSLLKR PVTPADQAEA FFLLLTRRLG QTTGQIITVD GGLHEAFLR
|
| |