Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5751 |
Symbol | |
ID | 6977141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 154475 |
End bp | 155404 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393207 |
Product | Taurine dioxygenase |
Protein accession | YP_002278025 |
Protein GI | 209546135 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.278464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC CGGTTCTCGT CAATCAGATC ATTCCCGAGT CCGATGTCAT CCCGCTGACC GGCCGTGTCG GGGCCGAAAT CAAAGGTGTC CGCCTTGGCG GAGACCTTTC GGATGCAACG GTGGCAGCCA TCAACCAGCT TCTTCTGAAG CACAAGGTGA TCTTCTTCCG CGATCAGGGA CATCTTGAGG ATTCCGAACA GGAAGCCTTC GCGCGCCGCC TCGGCGATCT CGTGCCGCAT CCGACCCAGG GACCGGTCGC CGGCACGGCT TCCATCCTCA ATCTCGATTC CAGCCGCGGC GGCGGCCGGG CGGACCAGTG GCACACTGAC GTAACCTTCG TCGACGCCTA TCCCAAATTT TCCGTCCTGC GCGGCGTCGT CATTCCGGCA GCCGGCGGTG ACACCATCTG GTCCAACACC CATGCCGCCT ATGAAAGCCT GCCGGCGCCG CTGAAACTGC TGGCGGAGAA TTTGTGGGCC ATTCACAGCA ACGCCTATGA CTATGCCGCC GTGCGCCCTC GCGCCACCGC TGAAGAGAAG AAGCATTTCG AGGAGGTTTT CACCTCGACC ATCTACGAGA CCGAACATCC GGTCGTGCGT GTCCATCCGG AAACCGGCGA GAGATCGCTG CTGCTCGGCA ATTTCGTTCA GCGCCTTGTC GGCCTGTCGA AGAGCGACTC CGCCAAACTC TACGAGGTAT TCCAATCCTA CGTCACTGCC CCGGAAAATA CCGTGCGCTG GCGCTGGAGA GCAGGAGATG TGGCGATCTG GGACAACCGC GCCACTCAGC ACTATGCCGT CAACGATTAT GGCGACCAGC ACCGCGTTGT CCGGCGCGCC ACCGTCGACG GCGACGTCCC CGTCAGCATC GACGGCCGCC GCAGCATAAC CCACGTCAAA ACTGCCAAGC CGCAGGCAAA GGCGGCGTGA
|
Protein sequence | MSNPVLVNQI IPESDVIPLT GRVGAEIKGV RLGGDLSDAT VAAINQLLLK HKVIFFRDQG HLEDSEQEAF ARRLGDLVPH PTQGPVAGTA SILNLDSSRG GGRADQWHTD VTFVDAYPKF SVLRGVVIPA AGGDTIWSNT HAAYESLPAP LKLLAENLWA IHSNAYDYAA VRPRATAEEK KHFEEVFTST IYETEHPVVR VHPETGERSL LLGNFVQRLV GLSKSDSAKL YEVFQSYVTA PENTVRWRWR AGDVAIWDNR ATQHYAVNDY GDQHRVVRRA TVDGDVPVSI DGRRSITHVK TAKPQAKAA
|
| |