Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0491 |
Symbol | |
ID | 4078237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 513650 |
End bp | 514654 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005787 |
Product | aldo/keto reductase |
Protein accession | YP_612486 |
Protein GI | 99080332 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00588566 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.160086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGGC CAAACTCTGC GTTGCGCGCG CCCGGTCTGC GTGTCAGTCT CCGGCGCATG ACTGCTCAGA CTTGCCCCCC GCTCACCACC CTTGATGGCC ATGACGTTGG CCGGTTCGCC TTTGGCTGCA TGCAATTTGG CGGACGCGCC GATGCGCAGG CCTCTGCCGA AATGTATGAG GCCTGCCGCG CAGCAGGTCT GCGCCATTTT GATACCGCGT GGCTCTATAC CGAGGGCGCC AGCGAAGAGA TCCTCGGTCA GTTGATCGCC AAGGACCGCG AGAGCCTCTA TGTCGCGACC AAGGTTGGCT TCACCGGCGG CGCCAGCGCG GCGAATATGC GAGCGCAGTT CGATCAGTGC CGACAGCGCC TGAAGCTCGA TCAGGTGGAT CTTCTGTATT TGCACCGGTT TGACCCTGAA ACCCCGCTCG AGGACACGCT GACCTGTTTT GCCGAACTTA AACAGGAAGG GCACATCCGC CATGTTGGCC TGTCAAACTT CGCCGCCTGG CAGGTCATGA AAGCCGTTGC TCTCGCGGCG CGACTGGGCC TCCGGATTGA CGTGCTACAG CCGATGTACT CGCTGGTGAA ACGACAGGCC GAGGTGGAAA TCCTGCCGAT GTGTGCCGAC CAGGGGATTT TGCCCGTACC CTATTCGCCG CTGGGCGGCG GGCTCTTGAC CGGGAAATAT GCGCAAGGCG GCACAGGTCG GTTGAGCGAG GATGAAAACT ATCGTGCCCG CTATGGCCAG GATTGGATGC ACCGGACAGC CTCCGATCTT CTGCACTTGG CCGAGGATCT TGGCACCGAT CCCGCGACGC TGGCAGTCGC ATGGGCCGCA GGCCACCCCG CGCGCCCGGC TCCGATCCTC TCGGCACGTT CCGCAACCCA GCTTGCGCCC TCGCTCAAGG CCACGGAATT TGACATGTCT CCAGAACTTT ATGCGCGTAT CGAAGCCCTG AGCCCCCGCC CGGCCCCCGC CACGGACCGG CTCGAAGAAG CATGA
|
Protein sequence | MIRPNSALRA PGLRVSLRRM TAQTCPPLTT LDGHDVGRFA FGCMQFGGRA DAQASAEMYE ACRAAGLRHF DTAWLYTEGA SEEILGQLIA KDRESLYVAT KVGFTGGASA ANMRAQFDQC RQRLKLDQVD LLYLHRFDPE TPLEDTLTCF AELKQEGHIR HVGLSNFAAW QVMKAVALAA RLGLRIDVLQ PMYSLVKRQA EVEILPMCAD QGILPVPYSP LGGGLLTGKY AQGGTGRLSE DENYRARYGQ DWMHRTASDL LHLAEDLGTD PATLAVAWAA GHPARPAPIL SARSATQLAP SLKATEFDMS PELYARIEAL SPRPAPATDR LEEA
|
| |