Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0955 |
Symbol | |
ID | 8534097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1025227 |
End bp | 1026138 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 646383336 |
Product | Taurine catabolism dioxygenase TauD/TfdA |
Protein accession | YP_003262840 |
Protein GI | 261855557 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.3545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGTC TCGAAATGAA AAATAACGTA GCTGTTATAA GTCGCCAGGT TACGGAAATG ACAGATGATG AAATTCATAA CCTAAAGAAA ATTGTATTTG ATTCGGGAAT TGTTGTTTTA AAGGCACAGA ATGCGACAGC CTCTGATTTT GTGGATTTTG GTCGACGTAT TGGGGAGCTA AGTCCTTATT ACGAAGAGAT GTATCATCAC CCTAATCACA AGGAACTCTT CGTTTCATCT AATGTCCAGA CAGATGGGAA AGTGATTGGG GTGCCGAGAA CAGGGAAATT CTGGCATGCA GACTATGCCT TCATGGCAAA ACCCTTTGCA TTCACGATTA CCTATCCACA AGTAGTTTCG TCTCAAGAGC GAGGAACTTA TTTTATTGAT ATGGCCAGCG CTTATGAACG GCTTTCACCC GAGATGAAAA GGAAAATCGA AGGTGGGGTT GGTACGCACT CTGTTCGCCG GTATTTTAAG ATCAGACCAA CAGATGTGTA TCGACCCATT AGTGAGATTC TACATGAAAT TGATGCAAAG ACCCCAACGG TAACACACCC ACTTGTTGTT AATCATCCGG TAACGGGTGC GAAGATTTTA TATGTTAGTC GTGGATTCAC TGAAACTATA AGCCTAAAGG ATGATGACTT GGATGCAGAT GAAGTGTTGA AAGATTTACT TGTTGAGTCA GGGCAGTCTG ACGATACGTT TACTCACCCG GATATTCGCC AGATAAATAT TAACGAAGGT GATATTTTTT TGTGGGATAA CCGTCGTTAT GTTCATCATG CAAAGCATAA CGATAAGGTT GAGCCGACTA AGACTTACAG ACTTACCGCT TATGATGGAC TGCCGTTCAG TGCAGAGATT GACTTCGCAC TTGATAGTGT GAAGGAGGTG GGACTTGTCT AG
|
Protein sequence | MKGLEMKNNV AVISRQVTEM TDDEIHNLKK IVFDSGIVVL KAQNATASDF VDFGRRIGEL SPYYEEMYHH PNHKELFVSS NVQTDGKVIG VPRTGKFWHA DYAFMAKPFA FTITYPQVVS SQERGTYFID MASAYERLSP EMKRKIEGGV GTHSVRRYFK IRPTDVYRPI SEILHEIDAK TPTVTHPLVV NHPVTGAKIL YVSRGFTETI SLKDDDLDAD EVLKDLLVES GQSDDTFTHP DIRQININEG DIFLWDNRRY VHHAKHNDKV EPTKTYRLTA YDGLPFSAEI DFALDSVKEV GLV
|
| |