Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4366 |
Symbol | |
ID | 8015140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4493783 |
End bp | 4494712 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826942 |
Product | Taurine dioxygenase |
Protein accession | YP_002978144 |
Protein GI | 241207048 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.105088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC CAGTTCTCGT CAATCAGATC ATTCCCGAGT CCGATGTCGT GCCGCTGACC GGCCGCGTCG GAGCCGAAAT CAGGGGTATT CGCCTCGGCG GAGACCTTTC CGACGCGACG GTCGCAGCCA TCAACCAGCT TCTCCTGAAA CACAAAGTCA TCTTTTTCCG CGATCAGGAC CATCTTGGCG ATTCCGAACA GGAATCGTTC GCGCGCCGCC TGGGCGACCT TGTGCCTCAT CCAACTCAGG GTCCGGTGGC CGGCACGGCT TCCATCCTCA ATCTCGATTC CAGCCGCGGA GGCGGCAGGG CGGACCAGTG GCACACCGAC GTCACCTTCG TGGATGCCTA TCCCAAATTC TCGGTCCTTC GCGGCGTCGT CATTCCGGCG GCTGGAGGTG ATACGATCTG GTCCAACACC CATGCCGCAT ACGAAAGTCT GCCAGCATCG CTCAAATTGC TGGCGGACAA TTTGTGGGCC ATTCACAGCA ATGCCTATGA CTACGCAGCC GTGCGCCCTC GCGCCACCGC TGACGAGAAG AAGCATTTCG AGGAAGTTTT CACGTCGACC ATCTACGAGA CCGAGCATCC AGTCGTGCGT GTCCATCCCG AAACCGGCGA AAGATCGCTG CTGCTCGGCA ATTTCGTTCA GCGTCTCGTC GGCTTGTCGA AGAGCGACTC CGCAAAACTC TACGAGGTGT TCCAGTCCTA TGTTACCGCG CCGGAAAATA CCGTGCGCTG GCACTGGAGA GCCGGTGACG TCGCAATCTG GGATAATCGC GCGACCCAGC ACTACGCTGT CAACGACTAC GGCGACCAGC ACCGGGTCGT GCGCCGCGCC ACCGTTGACA GCGACGTCCC CGTCAGCGTC GACGGCCGCC GCAGCGTAAC CCACGTCAAG GTCGCCAAGC CGAAAGCAAA GGCCGCGTGA
|
Protein sequence | MSNPVLVNQI IPESDVVPLT GRVGAEIRGI RLGGDLSDAT VAAINQLLLK HKVIFFRDQD HLGDSEQESF ARRLGDLVPH PTQGPVAGTA SILNLDSSRG GGRADQWHTD VTFVDAYPKF SVLRGVVIPA AGGDTIWSNT HAAYESLPAS LKLLADNLWA IHSNAYDYAA VRPRATADEK KHFEEVFTST IYETEHPVVR VHPETGERSL LLGNFVQRLV GLSKSDSAKL YEVFQSYVTA PENTVRWHWR AGDVAIWDNR ATQHYAVNDY GDQHRVVRRA TVDSDVPVSV DGRRSVTHVK VAKPKAKAA
|
| |