Gene TM1040_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2006 
Symbol 
ID4077463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2111425 
End bp2112573 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content59% 
IMG OID638007321 
Productgamma-butyrobetaine,2-oxoglutarate dioxygenase 
Protein accessionYP_614000 
Protein GI99081846 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.949377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATC TCAGCTCTCC CCCCACGGTT GCGACCATTG CGGGCCTTGG TGACAAAGGT 
CTCGATATTA CCCTGGCCGA CGGCGCGACC CATTATTTCA ATTACTACTG GCTGCGCGAC
AACTGCCCCA GCTCCTTTAG CGCCATGACC CGCGAGCGCA GCTTTGACAT CTTTCATCTG
GAGACCGCCC CACGCGCAAG AACGGCCGAG ATTGACGGGG ACGCGCTGGT GATCGACTGG
CAGGACGAAG ACCACATCAC CCGCATGCCG CTCTCTTGGC TCAATGCCTA TGCGGGTGGG
CAGCGCCGCC CCGACCCAGC CGATCTGTCG CGCGTGGCCT GGTTTGGCGA TCACTACCCA
TCGGTGCCGC GGTTCTCGCA GCCCGATCTG GTCTCGGATG ACGCGACCCG CGCCAAATGG
ATCGAGGCGA TGCTGGTGCA TGGTTTCACG ATCGTGACCG ACATGCCCGA CAGCGATGCG
GCGCTCACCC AGACGGCAGA GCTCATGGGC TTTGTGCGGC CCACCTTCTT TGGCACCTAT
TTTGATGTCA AAACCCACAT CAACCCCACC AATACCGCCT ATACTGCGGG CGCACTAGAG
CTGCACACCG ACACCCCGGC CGAGGAATTT GCGCCGGGTA TCCAGTTCCT CCATTGCCGC
ATCAACACGG TTGACGGTGG CGAGAGCCTC TATGCCGATG GGGTGGCGGT GGCCAATGAC
TTTCGCAAGC GCGACCCAGA GGGCTTCAGG CTTCTCAGCG AAGTGCCGAT CCCGTTTTAC
TGCGAACACG ACACTTATGA TGCGCGCTCG CGCCAATATG TGATCGAGCT GGATCAACAC
GGCGAAGTCG AGGGGCTCAC GATCAGTCAG CATATGGCCG ATATTTTCGA CCTCGATCAG
AAACTGCTCG ATGACTACTA CCCCGCGTTC TGCCGCTTTG GTCGGATGCT GCAGGAAGAG
AAATACATGA TGCGCTTTTT GATGAAGGGC GGTGAATGCA TGGTCTTTGA CAACCATCGC
ATCGTGCATG GCCGCGCCGC CTATACCGCC TCCAGTGGTG ACCGGTATCT GCGCGGCTGC
TACGTGGATC GCTCCGAGAT GCGCTCCACC TATCGTGCAT TGGTCAGCGA AGGACGGTTC
AAGGCATGA
 
Protein sequence
MNDLSSPPTV ATIAGLGDKG LDITLADGAT HYFNYYWLRD NCPSSFSAMT RERSFDIFHL 
ETAPRARTAE IDGDALVIDW QDEDHITRMP LSWLNAYAGG QRRPDPADLS RVAWFGDHYP
SVPRFSQPDL VSDDATRAKW IEAMLVHGFT IVTDMPDSDA ALTQTAELMG FVRPTFFGTY
FDVKTHINPT NTAYTAGALE LHTDTPAEEF APGIQFLHCR INTVDGGESL YADGVAVAND
FRKRDPEGFR LLSEVPIPFY CEHDTYDARS RQYVIELDQH GEVEGLTISQ HMADIFDLDQ
KLLDDYYPAF CRFGRMLQEE KYMMRFLMKG GECMVFDNHR IVHGRAAYTA SSGDRYLRGC
YVDRSEMRST YRALVSEGRF KA