Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0052 |
Symbol | |
ID | 4078715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 54785 |
End bp | 55993 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005339 |
Product | gamma-butyrobetaine,2-oxoglutarate dioxygenase |
Protein accession | YP_612047 |
Protein GI | 99079893 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | [TIGR02409] gamma-butyrobetaine hydroxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.543407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGGTG GCCACCACCG GGTCGTGCCG CTGTGCGCTA AGGCGCAGAC CGCTCTTACA CACGACATAT CTAACGCCAG GAGGTTTCTC ATGGCACAGG CCGCTTTGCA GCCACAGAAC GCTTGCGTTC TCCTCTCGTT TTCCGACGGA ACAACGGCGC AGTACCCCTA TATCTGGCTG CGCGACAACG ACCCGGAAGG GTTTCACCCT GACACGCAGG AACGGATCAC CGATCTTTCT GCAATATCGC CAGACATTAC GGTGGCAGAT GTCGAGCTGA ACGACTCTCA GCTTCTCATC CACTGGGAAG GCGCTGATTC CGCCACCAGC CGCTTTGACC TTGATTGGTT GCGCAGCTAT GTGCCGGGCA CACGCACTGC GGACCCCGCC CGCACCGGGT TTCAGCACTG GCGCTGCGAC CTGGGCGCAG GTGGGATTCC GCGCGCCACA GCACAAGAGA TCCTGAGCTC AGATCTTGCC CTGCGGACAT GGCTGGAACA GACCCAAATC TATGGGATCT CCATCGTCGA GGGGCTTGCG GACAGCACCG AGGCGGGCAT GGATGTGGCA CGCCGTATCG GTTTTTTGCG CCAAACCAAC TTTGGCGTGA CCTTCGAGGT CAAATCCAAA CCCAACCCCA ACAATCTCGC CTATACCCCG ATCGCGCTGC CCCTGCATAC GGATCTGACC AACCAGGAAT TGCCGCCCGG GTTTCAGTTC CTGCACTGTC TTGCGAACGA GGCCAGGGGC GGTGGTTCTC TGTTTTGCGA TGGATATGCC ATTGCCGAGG ACCTGCGCCG GGATGATCCC GAGAGTTTTG AGCTTCTATC GACCGTCTCG GTGCCGTTTC GGTTCCACGA TCAGGACACC GACATCCGAA ACCGCAAAAA GGTCATCACG CTGGATGAGG ACGGGCGCGT GATCGAGATC TGTTTCAATG CCCATTTGGC GGATATCTTT GACCTAGAGC CCGCGCTGAT GCAGCGCTAC TACCGCGCAT ACCGGAAATT CATGATCCTG ACGCGCTCAA CCAACTACCT CGTGACGCTC AAGCTCAAAG GTGGCGAGAT GGTTGTGTTT GACAACAGGC GTGTCCTGCA TGGCCGCGAG GCCTTTGATC CTCAGACCGG GTATCGGCAC TTGCACGGAT GCTATGTGGA CCGCGGCGAG TTCGAGAGCC GACTGCGCGT TCTGCATCGC GGGCAGTGA
|
Protein sequence | MTGGHHRVVP LCAKAQTALT HDISNARRFL MAQAALQPQN ACVLLSFSDG TTAQYPYIWL RDNDPEGFHP DTQERITDLS AISPDITVAD VELNDSQLLI HWEGADSATS RFDLDWLRSY VPGTRTADPA RTGFQHWRCD LGAGGIPRAT AQEILSSDLA LRTWLEQTQI YGISIVEGLA DSTEAGMDVA RRIGFLRQTN FGVTFEVKSK PNPNNLAYTP IALPLHTDLT NQELPPGFQF LHCLANEARG GGSLFCDGYA IAEDLRRDDP ESFELLSTVS VPFRFHDQDT DIRNRKKVIT LDEDGRVIEI CFNAHLADIF DLEPALMQRY YRAYRKFMIL TRSTNYLVTL KLKGGEMVVF DNRRVLHGRE AFDPQTGYRH LHGCYVDRGE FESRLRVLHR GQ
|
| |