Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3667 |
Symbol | |
ID | 3837123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4213824 |
End bp | 4214678 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637827791 |
Product | Short-chain dehydrogenase/reductase SDR |
Protein accession | YP_428748 |
Protein GI | 83594996 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.847413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATG ACGTGCTACA ACCGCCCCAG GACCAAGACA GCCAGCCCGG TCATGAACTG GAGATGACGC CCCAACCCGA CTGCCAGCCC CGCCACCCCG GTTCGGACCG GCTGGCCGGC AAGGTCGCCT TGATTTCGGG CGGCGACAGC GGCATCGGCC GCGCCGTCGC CCTCGCCTTC GCCCGCGAAG GCGCCAAGAT CGCCCTGCTC TATCTCGACG AACACGACGA CGCCCGGGAA ACCGAGCGCT TGGTGATGGC CGAGGGACGC GAATGTCTGG TGATGCCGGG CGATGTCGGC GACGAGGCGA TCTGCATCGA CGCGGTCGCC CAAGTGATCG GCAATTTCGG CGCTCTCGAT ATCCTGGTCA ACAACGCCGC CGAACAACAC GAGGTCGAGG ACCTGAGCGA CCTGACGGCC GGACAACTGG AAAAGACCTT CCGCACCAAT GTCTTCGGCT ACATCTATCT GGCCAAGGCC GCCCTTCCCC ACCTGCCCAA GGGCGGGTCG ATCATCAATA CCACCTCGAT CACCGCCTAT CAGGGCCACC GCACGCTGAT CGACTACAGC GCCACCAAGG GGGCGATCGT CGCCCTCACC CGCAGCCTGT CGCAATCGTT GCTCGATCGC GGCATCCGGG TCAACGCGGT GGCCCCCGGG CCGATCTGGA CCCCGCTGAT CCCGGCCAGC TTCAGCCCCG ATCACGTCGC CAGTCATGGC GCCTCGGTGC CAATGGGCCG CGCCGGCCAG CCCAATGAGG TCGCCCCGGC CTATGTGTTC CTGGCCAGCG ACGACGCCTC CTATATCAGC GGTCAGGTGA TCCACCCCAA TGGCGGGGCG ATCATCGGGT CTTGA
|
Protein sequence | MSDDVLQPPQ DQDSQPGHEL EMTPQPDCQP RHPGSDRLAG KVALISGGDS GIGRAVALAF AREGAKIALL YLDEHDDARE TERLVMAEGR ECLVMPGDVG DEAICIDAVA QVIGNFGALD ILVNNAAEQH EVEDLSDLTA GQLEKTFRTN VFGYIYLAKA ALPHLPKGGS IINTTSITAY QGHRTLIDYS ATKGAIVALT RSLSQSLLDR GIRVNAVAPG PIWTPLIPAS FSPDHVASHG ASVPMGRAGQ PNEVAPAYVF LASDDASYIS GQVIHPNGGA IIGS
|
| |