Gene TM1040_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0333 
Symbol 
ID4076229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp337542 
End bp338690 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content63% 
IMG OID638005628 
Productgamma-butyrobetaine,2-oxoglutarate dioxygenase 
Protein accessionYP_612328 
Protein GI99080174 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID[TIGR02409] gamma-butyrobetaine hydroxylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.561528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACAC TCTCCGTTGA CCCATCCGGA ACCTATCTGA GCCTTGAAAC CGCCACGGGT 
ACGCAGCGGT TCCATGCGAT CTGGCTGCGT GACAATGCGC AGGATGCCGA GACCCGCGCG
CCGGGGAACG GCCAGCGGTT GATCGCCCTG CGCGATATCC CCGCTGACAC AAGGATCGCC
ACCGCCCAGA TCAACCCGGA CGGGGTGGTG CAGGTGCGCT TTGCGCCAGA GGACAAGACC
GTTGACTATG ATCTTGCGTG GCTTGAGGCG CATTCCTACG ACCGCGCTGC CCCCAGCACC
CAAGGGTGGC TCGCCCAAGA AATCGAGCCA TGGGATGCGG GGCTGATGGG CAATGTGCCA
TGTGGGGATT TTGCGGCGCT CGCCGAGGGT GGTCAGGCGC TGCGTGACTG GCTTGGGCAG
ATTGTGCGCT ACGGCTTTGC CAAGCTCAAG AATGCCCCGG TGGAGCCGGG CGGGCTGTTC
AAGGTGGTCG ATCTCTTTGG CTTTGTGCGC GAGACCAACT ACGGGCGGCA TTTTGAGGTG
CGTACCGAGG TGAACCCCAC AAACCTTGCC TTCACCGGGC TCGGGCTTCA GGCGCATACG
GACAACCCCT ACCGCGATCC GGTGCCGACG CTTCAGGTGC TCTATTGCCT CGAGAGCTCT
GCCGCCGGAG GCGAGAACAT GGTGGTCGAC GGCTTTGCCG CCGCGCTGCG CCTGCGCGAA
GAAGACCCGG AGGGATTTGC GCTTTTGGCG ACGCATTGTG CAAAGTTTGA ATATGCGGGC
GAGGCGAGCG TGTGTCTGAC CTCTCGGCGT CCGATGATCG AGCTGGCACC GGATGGGGAG
CTGATCGGCA TTCGGTTCAA CAATCGCTCC TGCGCCACCA TCACGGATGT GCCCTTTGAC
AAGATGGAGG CGTATTATGC CGCCTACCGA CGCCTTGGCG AGATCATTGA TGATACGGCG
ATGGAGGTGA CGTTCCGGCT CGAGCCGGGT GAGGCCTTTG TGGTCGACAA CACCCGTGTT
CTGCACGCGC GCAAGGGCTA TTCGGGCGAG GGGACACGCT GGCTGCAAGG GTGCTACGCC
GACAAGGACG GGCTGCGCTC CGCCTATCAT GCGATGATGC GGGATGCGCG CGTGGAGGCT
GCGGAATGA
 
Protein sequence
MPTLSVDPSG TYLSLETATG TQRFHAIWLR DNAQDAETRA PGNGQRLIAL RDIPADTRIA 
TAQINPDGVV QVRFAPEDKT VDYDLAWLEA HSYDRAAPST QGWLAQEIEP WDAGLMGNVP
CGDFAALAEG GQALRDWLGQ IVRYGFAKLK NAPVEPGGLF KVVDLFGFVR ETNYGRHFEV
RTEVNPTNLA FTGLGLQAHT DNPYRDPVPT LQVLYCLESS AAGGENMVVD GFAAALRLRE
EDPEGFALLA THCAKFEYAG EASVCLTSRR PMIELAPDGE LIGIRFNNRS CATITDVPFD
KMEAYYAAYR RLGEIIDDTA MEVTFRLEPG EAFVVDNTRV LHARKGYSGE GTRWLQGCYA
DKDGLRSAYH AMMRDARVEA AE