Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3171 |
Symbol | |
ID | 4028638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3536168 |
End bp | 3537463 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637968385 |
Product | taurine catabolism dioxygenase TauD/TfdA |
Protein accession | YP_575214 |
Protein GI | 92115286 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | [TIGR02409] gamma-butyrobetaine hydroxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAGGCC CATGTTCCCC TGCCCAGGAG GCCGCCATGA CATTCTCTCC CCCCACGACC GCCCCGCTCG ACGCCTTGCC GGCGACACCG GACTACGATG CCTGGCCTAT CGAGGTCGCC ATCCAGCACG TGAGCCACAC TCCCCGCCAG GTCGAAGTGC GCTGGGAAGA CGGTCGCGTC AGTCGATACC ACAGCATCTG GCTGCGCGAA AACGCCGCCG ACGACAGCAC CGTCAATCCG GCCACCCGCG AGCGCATCCT CGACCTGTCA CGCCTCGGGG CGTGGCCCAC GGTCAGCGAG GCGCGCCTCG ACGACGCCGG TGCGCTGGAG ATCGTCTTCG CCCCGGAGCA ACGCCGCCTG CGATTTCATC CCGGCTGGCT GCGAGCCCAC GATTACGCCA ACCTGGACAC GCCAGAAGCG CCACTGGTCC CCACGACGCT GTGGAAAGGC AGCGAGCGCG ACGCGCCCAC CACTCTCGAT GCGCATGACT GGCTAAGCGC CGACGACGAT CCCCTGGCAC CGGACGCCTG CCTCGAAAGC GCCCTCGAGG CCGTCATCGG CGAAGGCCTG GTGAGGCTAC GCAATCTGCC CACCGACCCC GGCAGCCTGG ATGCCATCGC CCGGCGCATC GGCCCGCCGC GCACCACCAA CTTCGGCACG CTGTTCGACG TGCGTGCCAA GCCCGACCCC GACTCCAATG CCTACACCTC GATCGCCCTG CCACCGCACG TCGATCTGCC GACCCGCGAG TACCAGCCCG GTCTGCAGTT GCTGCATTGC CTGGAAAACG ACACCGTCGG CGGCGATGCC GTGATGATGG ACGGTTTCGC CGTGGCCGAG GCCCTGCGCG AGCGTCATCC CGAACATTTC GCGACGCTCA CGCGCGTGCG CTGGTGCTAC GCCAACACGG CCCGCACCAC CGACCACGTA TGGTTCGACC CGATGATCAA GCTGGACGCC AACGGCCACT TCGACGAGGT ACGCATCGCC GACTTTCTGC GCGGCCCGCT GATGGCGCCG TTCGAGGACG TGGAGCCTGC CTATGCCGCC TTGATGGCGC TGCAGCGCCT GCTGCGCGAG CCGGAGTTTG CGTTGCGTTT CAGTTACGCG CCCGGCGACA TGGTGATCTT CGACAACCGC CGCTTGTTGC ACGCCCGCGA TGCCTTCGAT GTCGGCCAGG GCGGGCGCCG CTGGCTGCAG GGCTGCTACC TGGAGCGCGA CGAGGCCCGC TCGCGGCTGC GCATGTTGCG CCGTGCCCAC CGCCGGCAAT GCGTCGACGC CCTCGCGCAC GGCTGA
|
Protein sequence | MLGPCSPAQE AAMTFSPPTT APLDALPATP DYDAWPIEVA IQHVSHTPRQ VEVRWEDGRV SRYHSIWLRE NAADDSTVNP ATRERILDLS RLGAWPTVSE ARLDDAGALE IVFAPEQRRL RFHPGWLRAH DYANLDTPEA PLVPTTLWKG SERDAPTTLD AHDWLSADDD PLAPDACLES ALEAVIGEGL VRLRNLPTDP GSLDAIARRI GPPRTTNFGT LFDVRAKPDP DSNAYTSIAL PPHVDLPTRE YQPGLQLLHC LENDTVGGDA VMMDGFAVAE ALRERHPEHF ATLTRVRWCY ANTARTTDHV WFDPMIKLDA NGHFDEVRIA DFLRGPLMAP FEDVEPAYAA LMALQRLLRE PEFALRFSYA PGDMVIFDNR RLLHARDAFD VGQGGRRWLQ GCYLERDEAR SRLRMLRRAH RRQCVDALAH G
|
| |