Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2521 |
Symbol | |
ID | 4076523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2661587 |
End bp | 2662363 |
Gene Length | 777 bp |
Protein Length | 258 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007845 |
Product | short chain dehydrogenase |
Protein accession | YP_614515 |
Protein GI | 99082361 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCAT TGGTAACCGG AGCGGGTAAG CGTTTGGGGC GGGCAATGGC CTTGGCGCTT GCAGAGGACG GCTATGACGT TGCCGTCCAC TATGCCTCAT CGGAGCAAGA CGCCAACGAT GTCGTGGCGC AAATCGAGGC ACGAGGCCAG CGGGCCGTCG CGTTGCGAGC GGACCTGTTG CGAGATACAG AAACCGAGGC GTTGCTCCCG CAGGCAGCTG AAGCGCTCGG TGGGGACATC ACCTGTCTCA TCAACAACGC GTCAATTTTT GAACCTGATG ATCTCGCCTC CGTGACCCGC GACAGTTGGG ATCGTCATAT GCAGAGCAAT TTGCGAGCGC CTGTGCTGTT GCTTCAGGCG TTGGCCGCGC AGTCTCTGCC GGATCTCTCG GATGAGGCCG GCGAGCCCCG CGCTGCGGCT GTGGCGATCA ACATGGTTGA CCAGCGGGTC AACAAGCTCA CCCCGGACTT CCTGAGCTAC ACACTGGCGA AATCCGCGCT TTGGACCCTG ACGCAGACGG CAGCGCAGGC CTTGGCGCCA CGGATCAGGG TGAATGCAAT TGGACCGGGA CCGACGCTTC AGGGGCCGCG CCAGAGTGTT GCCGACTTTG CTGCGCAACG TCGCGCCACC CCGCTGCAGC GCGGCGCGGG AGAGCAGGAC ATCACCTCGG CGCTGCGCTA TCTGGTCGGC GCGCCGGCCA TTACCGGACA GCTCATTTGC GTCGATGGCG GGCAGCATTT GGCCTGGCAG ACGCCAGATG CGTTGCTACC GGAATAA
|
Protein sequence | MRALVTGAGK RLGRAMALAL AEDGYDVAVH YASSEQDAND VVAQIEARGQ RAVALRADLL RDTETEALLP QAAEALGGDI TCLINNASIF EPDDLASVTR DSWDRHMQSN LRAPVLLLQA LAAQSLPDLS DEAGEPRAAA VAINMVDQRV NKLTPDFLSY TLAKSALWTL TQTAAQALAP RIRVNAIGPG PTLQGPRQSV ADFAAQRRAT PLQRGAGEQD ITSALRYLVG APAITGQLIC VDGGQHLAWQ TPDALLPE
|
| |