Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2006 |
Symbol | |
ID | 4077463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2111425 |
End bp | 2112573 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638007321 |
Product | gamma-butyrobetaine,2-oxoglutarate dioxygenase |
Protein accession | YP_614000 |
Protein GI | 99081846 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.949377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGATC TCAGCTCTCC CCCCACGGTT GCGACCATTG CGGGCCTTGG TGACAAAGGT CTCGATATTA CCCTGGCCGA CGGCGCGACC CATTATTTCA ATTACTACTG GCTGCGCGAC AACTGCCCCA GCTCCTTTAG CGCCATGACC CGCGAGCGCA GCTTTGACAT CTTTCATCTG GAGACCGCCC CACGCGCAAG AACGGCCGAG ATTGACGGGG ACGCGCTGGT GATCGACTGG CAGGACGAAG ACCACATCAC CCGCATGCCG CTCTCTTGGC TCAATGCCTA TGCGGGTGGG CAGCGCCGCC CCGACCCAGC CGATCTGTCG CGCGTGGCCT GGTTTGGCGA TCACTACCCA TCGGTGCCGC GGTTCTCGCA GCCCGATCTG GTCTCGGATG ACGCGACCCG CGCCAAATGG ATCGAGGCGA TGCTGGTGCA TGGTTTCACG ATCGTGACCG ACATGCCCGA CAGCGATGCG GCGCTCACCC AGACGGCAGA GCTCATGGGC TTTGTGCGGC CCACCTTCTT TGGCACCTAT TTTGATGTCA AAACCCACAT CAACCCCACC AATACCGCCT ATACTGCGGG CGCACTAGAG CTGCACACCG ACACCCCGGC CGAGGAATTT GCGCCGGGTA TCCAGTTCCT CCATTGCCGC ATCAACACGG TTGACGGTGG CGAGAGCCTC TATGCCGATG GGGTGGCGGT GGCCAATGAC TTTCGCAAGC GCGACCCAGA GGGCTTCAGG CTTCTCAGCG AAGTGCCGAT CCCGTTTTAC TGCGAACACG ACACTTATGA TGCGCGCTCG CGCCAATATG TGATCGAGCT GGATCAACAC GGCGAAGTCG AGGGGCTCAC GATCAGTCAG CATATGGCCG ATATTTTCGA CCTCGATCAG AAACTGCTCG ATGACTACTA CCCCGCGTTC TGCCGCTTTG GTCGGATGCT GCAGGAAGAG AAATACATGA TGCGCTTTTT GATGAAGGGC GGTGAATGCA TGGTCTTTGA CAACCATCGC ATCGTGCATG GCCGCGCCGC CTATACCGCC TCCAGTGGTG ACCGGTATCT GCGCGGCTGC TACGTGGATC GCTCCGAGAT GCGCTCCACC TATCGTGCAT TGGTCAGCGA AGGACGGTTC AAGGCATGA
|
Protein sequence | MNDLSSPPTV ATIAGLGDKG LDITLADGAT HYFNYYWLRD NCPSSFSAMT RERSFDIFHL ETAPRARTAE IDGDALVIDW QDEDHITRMP LSWLNAYAGG QRRPDPADLS RVAWFGDHYP SVPRFSQPDL VSDDATRAKW IEAMLVHGFT IVTDMPDSDA ALTQTAELMG FVRPTFFGTY FDVKTHINPT NTAYTAGALE LHTDTPAEEF APGIQFLHCR INTVDGGESL YADGVAVAND FRKRDPEGFR LLSEVPIPFY CEHDTYDARS RQYVIELDQH GEVEGLTISQ HMADIFDLDQ KLLDDYYPAF CRFGRMLQEE KYMMRFLMKG GECMVFDNHR IVHGRAAYTA SSGDRYLRGC YVDRSEMRST YRALVSEGRF KA
|
| |