Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0752 |
Symbol | |
ID | 5083516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 761894 |
End bp | 762853 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640482310 |
Product | taurine dioxygenase |
Protein accession | YP_001166963 |
Protein GI | 146276804 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCGA TTTCTCAGGA TGTGCCCAAC ATCCCCCACA GCGACGTGAC CCCCCTCGCA GGCCGCGTGG GGGCCATCGT CTCGAACATC CGCCTGTCGG GCGATCTTCC CGACAGCACC ATCGCCCGGC TGGAGCAGCT GCTTCGCCTT CACAAGGTGC TGTTCTTTCG CGACCAGTCC CATCTTGACG ACGCCGAACA GGAACGGTTC GGCGCCCGCT TCGGAGAGCC CTTTGCCCAT CCGACGCAGG GTGCGCTGAG CGGGACAGCC TCGGTCCTGG ACCTCGACAC CCGTCGGGAC CGGGAGCCGA AGGCGGGCGA AGCCGGCGGC GCGCGGGCCG ATCAGTGGCA CACCGACATC ACCTTCGTCG AAGCCTATCC GCGCATCACG ATCCTGCGCA GCGTCGTGGC CCCCGCCTCG GGCGGGGACA CGGTGTTTTC GAACACCGTG GCGGCCTATG AATCACTGCC CGAGCCGTTG AAGGCGCTGG CCGACCGGCT TTGGGCGGTT CATTCGAACG CTTACGATTA TGCGGCGGTG CGCCCGCACG CGACGGCCGA TGAACAGAAG CAGTTCGCGC GCCAGTTCAC GTCGACCGTC TTCGAGACCG AACATCCGGT GGTGCGCGTG CTGCCCTCGG GCGAGCGGAC GCTTCTGCTG GGCAACTTCG TCCAGCGGTT CACGGGGATC GCGCGGGCCG ACTTCCAGAA GCTGTTCGCC CTGTTTCAGG ACCACATCCA GGCGCAGGAA AACACCGTGC GATGGCGCTG GCAGGCCGGC GACGTGGCGC TTTGGGACAA CACCGCAACC CAGCATTACG CGGTGAACGA TTACGGCGAC CAGCACCGGG TGGTGCGGCG CGTGACCATC GCGGGCGACG TGCCCGTGTC GGTCGATGGT AAGCGGAGCG TGGCGCGCAG CCGCTTCGAG AAACCCGACG CGGCGATTGC CGCGGAATAG
|
Protein sequence | MTAISQDVPN IPHSDVTPLA GRVGAIVSNI RLSGDLPDST IARLEQLLRL HKVLFFRDQS HLDDAEQERF GARFGEPFAH PTQGALSGTA SVLDLDTRRD REPKAGEAGG ARADQWHTDI TFVEAYPRIT ILRSVVAPAS GGDTVFSNTV AAYESLPEPL KALADRLWAV HSNAYDYAAV RPHATADEQK QFARQFTSTV FETEHPVVRV LPSGERTLLL GNFVQRFTGI ARADFQKLFA LFQDHIQAQE NTVRWRWQAG DVALWDNTAT QHYAVNDYGD QHRVVRRVTI AGDVPVSVDG KRSVARSRFE KPDAAIAAE
|
| |