Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0025 |
Symbol | |
ID | 6407666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 25778 |
End bp | 26653 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642709932 |
Product | short chain dehydrogenase |
Protein accession | YP_001989063 |
Protein GI | 192288458 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCAC TGCAAGGCAA GACGCTGTTC ATCACCGGGG CGAGCCGGGG AATCGGGCTC GCGATCGCCC TGCGGGCGGC GCGGGATGGA GCCAATGTGG CGATCGCCGC CAAGACCACC GAGCCGCATC CGAAGCTGAA GGGCACGATC TACACCGCGG CCGAGGAGAT CGTCGCGGCG GGCGGCAAGG CGCTGCCGAT GGTCTGCGAC ATCCGCGACG AGGCGCAGGT GATCGACGCG ATCGGCAAGA CGGTGGCGGA GTTCGGCGGT ATCGACATCT GCGTCAACAA CGCCAGCGCG ATCAGCCTGA CGAATTCGCA GGCGACCGAC ATGAAGCGCT ACGACCTGAT GATGGGAATC AACACCCGCG GGACCTTCAT GGTGTCGAAA TATTGCATCC CGCACCTGAA GAAGGCGGCC AATCCGCACA TCCTGATGCT GTCGCCGCCG CTCGACATGA AGGCGAAGTG GTTCGCAGCG TCGACTGCCT ACACCATGGC GAAGTTCGGC ATGAGCATGG TGGCGCTCGG TCTCTCGGGC GAACTGAAGC ACGCCGGCGT TGCGGTGAAT GCGCTGTGGC CGCGCACCAC GATCGCGACC GCCGCGGTCG GCAATCTGCT CGGCGGCGAC GCGATGATCC GCGCGAGCCG CACGCCGGAG ATCATGGGTG ACGCCGCCCA TGCGATCCTG GCCCGTCCCT CGCGTGAATT CACCGGCCAG TTCTGCATCG ACGACAGCGT GTTGTATGAA GCCGGCGTGC GCGACTTCGA GCCATATCGC GTCGATCCGA GCGTGCCGCT GATGTCGGAC TTCTTTGTTC CGGACGACAG CGTTCCGCCG CCCGGCGTGA CCGTCACGCC GCTGCCGATG GGGTAG
|
Protein sequence | MASLQGKTLF ITGASRGIGL AIALRAARDG ANVAIAAKTT EPHPKLKGTI YTAAEEIVAA GGKALPMVCD IRDEAQVIDA IGKTVAEFGG IDICVNNASA ISLTNSQATD MKRYDLMMGI NTRGTFMVSK YCIPHLKKAA NPHILMLSPP LDMKAKWFAA STAYTMAKFG MSMVALGLSG ELKHAGVAVN ALWPRTTIAT AAVGNLLGGD AMIRASRTPE IMGDAAHAIL ARPSREFTGQ FCIDDSVLYE AGVRDFEPYR VDPSVPLMSD FFVPDDSVPP PGVTVTPLPM G
|
| |