Gene TM1040_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0052 
Symbol 
ID4078715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp54785 
End bp55993 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content59% 
IMG OID638005339 
Productgamma-butyrobetaine,2-oxoglutarate dioxygenase 
Protein accessionYP_612047 
Protein GI99079893 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID[TIGR02409] gamma-butyrobetaine hydroxylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.543407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGTG GCCACCACCG GGTCGTGCCG CTGTGCGCTA AGGCGCAGAC CGCTCTTACA 
CACGACATAT CTAACGCCAG GAGGTTTCTC ATGGCACAGG CCGCTTTGCA GCCACAGAAC
GCTTGCGTTC TCCTCTCGTT TTCCGACGGA ACAACGGCGC AGTACCCCTA TATCTGGCTG
CGCGACAACG ACCCGGAAGG GTTTCACCCT GACACGCAGG AACGGATCAC CGATCTTTCT
GCAATATCGC CAGACATTAC GGTGGCAGAT GTCGAGCTGA ACGACTCTCA GCTTCTCATC
CACTGGGAAG GCGCTGATTC CGCCACCAGC CGCTTTGACC TTGATTGGTT GCGCAGCTAT
GTGCCGGGCA CACGCACTGC GGACCCCGCC CGCACCGGGT TTCAGCACTG GCGCTGCGAC
CTGGGCGCAG GTGGGATTCC GCGCGCCACA GCACAAGAGA TCCTGAGCTC AGATCTTGCC
CTGCGGACAT GGCTGGAACA GACCCAAATC TATGGGATCT CCATCGTCGA GGGGCTTGCG
GACAGCACCG AGGCGGGCAT GGATGTGGCA CGCCGTATCG GTTTTTTGCG CCAAACCAAC
TTTGGCGTGA CCTTCGAGGT CAAATCCAAA CCCAACCCCA ACAATCTCGC CTATACCCCG
ATCGCGCTGC CCCTGCATAC GGATCTGACC AACCAGGAAT TGCCGCCCGG GTTTCAGTTC
CTGCACTGTC TTGCGAACGA GGCCAGGGGC GGTGGTTCTC TGTTTTGCGA TGGATATGCC
ATTGCCGAGG ACCTGCGCCG GGATGATCCC GAGAGTTTTG AGCTTCTATC GACCGTCTCG
GTGCCGTTTC GGTTCCACGA TCAGGACACC GACATCCGAA ACCGCAAAAA GGTCATCACG
CTGGATGAGG ACGGGCGCGT GATCGAGATC TGTTTCAATG CCCATTTGGC GGATATCTTT
GACCTAGAGC CCGCGCTGAT GCAGCGCTAC TACCGCGCAT ACCGGAAATT CATGATCCTG
ACGCGCTCAA CCAACTACCT CGTGACGCTC AAGCTCAAAG GTGGCGAGAT GGTTGTGTTT
GACAACAGGC GTGTCCTGCA TGGCCGCGAG GCCTTTGATC CTCAGACCGG GTATCGGCAC
TTGCACGGAT GCTATGTGGA CCGCGGCGAG TTCGAGAGCC GACTGCGCGT TCTGCATCGC
GGGCAGTGA
 
Protein sequence
MTGGHHRVVP LCAKAQTALT HDISNARRFL MAQAALQPQN ACVLLSFSDG TTAQYPYIWL 
RDNDPEGFHP DTQERITDLS AISPDITVAD VELNDSQLLI HWEGADSATS RFDLDWLRSY
VPGTRTADPA RTGFQHWRCD LGAGGIPRAT AQEILSSDLA LRTWLEQTQI YGISIVEGLA
DSTEAGMDVA RRIGFLRQTN FGVTFEVKSK PNPNNLAYTP IALPLHTDLT NQELPPGFQF
LHCLANEARG GGSLFCDGYA IAEDLRRDDP ESFELLSTVS VPFRFHDQDT DIRNRKKVIT
LDEDGRVIEI CFNAHLADIF DLEPALMQRY YRAYRKFMIL TRSTNYLVTL KLKGGEMVVF
DNRRVLHGRE AFDPQTGYRH LHGCYVDRGE FESRLRVLHR GQ