Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1876 |
Symbol | |
ID | 5733765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2232323 |
End bp | 2233372 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279020 |
Product | taurine catabolism dioxygenase TauD/TfdA |
Protein accession | YP_001544647 |
Protein GI | 159898400 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAT CAGACGATAA AAAGCCAAGC CTCAAAGGCC AATTAGGTGC AGGTCGCCGT TCTGTGAGCG TTTCCCAAGA AAGTCTGATC AAATCGCAGC CTTTGTTGAG CGATAGTGCC TTGCCACTGC TGGTCACTCC CGCTGTCGAA GGCCTGAATT TGATCACCTG GGCTAATAGC CATCGCCCAT TTATCGAAAC CAATTTGGCC CAGCATGGCG GCATTTTATT CCGCAACTTC AACATTAAAA CCCTCGAAGA ATTCCAACGT TTTGTTAGCG AAATGTCGAA TGAGCTTTTA GAATATACCT ATCGTTCAAC GCCGCGCAGC AAAGTCAGCG GCAATATTTA TACCTCAACT GAATACCCCG CCGATCAATC GATTCCATTG CATAACGAAA TGTCGTATAC CACCAGCTGG CCGATGAAAA TTTGGTTTTG CTGTTTGATT GCTCCCCAAC AACAAGGCGA AACCCCGATT GCCGATAGCC GCCGCATCTA TCAACGGCTT GATCCAGCGA TTCGCGATCG TTTTGCTGAG AAAAAAGTGA TGTATGTGCG CAACTATGGC GAGGGCATCG ATCTTTCATG GGAGAACGTG TTCCAAACCG ATAATAAAGC TGACGTTGAA GAATTTTGCC GACTTAACCA GATTGATTTT GAGTGGAAGA GCGGCAATCG TTTGCGCACG CGCCAAGTCT GCCAAGCTGT TGCCAAGCAT CCCAAAACCA ACGAAATGGT TTGGTTCAAT CAGGCCCATC TCTTCCATGT GACCAGTCTG CCGGCGGCTG TGCGCGATAT GTTGCTGGCA GAATTTAACG ATGAAGATTT GCCACGTAAC ACCTACTATG GCGATGGTTC GCCGATCGAA CCAGAGGTTT TAGCCGAAAT TCGCCATGTG CTGGATCAAG AAACTGTGAT GTTTCCATGG CAAGAAGGCG ATGTGCTGAT GCTCGACAAT ATGTTGGTGG CTCATGCTCG CTCGCCTTTT GTTGGCCCAC GCAAAATTGT GGTGGGTATG GCCGAATCGG TTGATGCAGC GGCGATCTAA
|
Protein sequence | MTTSDDKKPS LKGQLGAGRR SVSVSQESLI KSQPLLSDSA LPLLVTPAVE GLNLITWANS HRPFIETNLA QHGGILFRNF NIKTLEEFQR FVSEMSNELL EYTYRSTPRS KVSGNIYTST EYPADQSIPL HNEMSYTTSW PMKIWFCCLI APQQQGETPI ADSRRIYQRL DPAIRDRFAE KKVMYVRNYG EGIDLSWENV FQTDNKADVE EFCRLNQIDF EWKSGNRLRT RQVCQAVAKH PKTNEMVWFN QAHLFHVTSL PAAVRDMLLA EFNDEDLPRN TYYGDGSPIE PEVLAEIRHV LDQETVMFPW QEGDVLMLDN MLVAHARSPF VGPRKIVVGM AESVDAAAI
|
| |