Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0653 |
Symbol | |
ID | 8806402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 693482 |
End bp | 694702 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Histone deacetylase |
Protein accession | YP_003459904 |
Protein GI | 289207838 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.625383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.171491 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGCCA CCCCCGAAGC CATCGATGCC TCCACCGGGG CCGCGGACCG GCACCCGGAG CCGGCCCGCC GACATGCCAC GCTGCTGCAT GCCCCGCGCT ATCGCGGGCA CAGCTACGGC GACAACCACC CGCTCGGGAT CCCGCGCGTC TCGCTGACCA TCGACCTGAT CGAGGCCTAT GACGCGATCA CGCCGCAGGA GATGGCCGTA TCGCGCAAGG CACAGCCGGC CGAGCTCGAG TGGTTCCACA CCCGCGAGTA CGTGGCCGCG ATGCAGCGCG CCGAGGCGCT GGGCAAGGTC TTCCAGCGCT ACCGCGACCG CCACAACATC GGCAACTTCG AGAACCCGTT CTTCGCCGGG TTCTTCGCGA CCCCGGCGAC CGCCACCTGG GGCTCCATCC AGGGCGCCGA GGTGGTGCTG GACGGGCGCA TGGCCTTCAA CCCGGCCGGT GGCATGCACC ACGCCGCCCC CGACCAGGCC CGTGGATTCT GCTACTTCAA CGACCCGGCG CTGGCGATCC TGCGCCTGCG TCAGGCCGGG CAGCGCGTGC TCTATCTGGA TATCGACGCC CACCATGGCG ACGGGGTCGA GGCCGCGTTC AGCGACGACC CGGACGTGCT CACCGTCTCC CTGCACATGG ACACCGAGTA CGGCTACCCC TTCGAGGGCG GGCGCATCGA GGACACCGGA CCGCTTGGCA ACGCCGTGAA CCTGCCGCTG CCCAAGGAGA CCAACGACAG CGAGTTTCGC GCCGCCTTCG CGGCCGTCTG GGAGGCCGTG CGCACCCGCT GGCAGCCGGA TGCGGTCGTC GTTCAGGCCG GTACCGATGC GCTGCTGCCG GACCCGCTCG GCAAGTTCGG CATCTCCACC CAGTGCTTCC TCGCCTGCGT CGACGACGTG ATCGAGACCA GCCCGCGGCA CGCCGACGGC ACACCGAGGC TGCTGGTCCT GGGCGGTGGC GGCTATCACC CGCTGGCGCT GGCACGCTGC TGGACCGGGG TCTGGGCCCT GCTGACCGGT CGCGAGCTCC CCGCTGCGAT CCCGGAGGCC GGCCAGGCCC TGCTGCGCGA GGTTGACTGG GACATGGACG AAGAAGAGGA ATGGTACGAG TCCCTCTTTA CCTCCCGTCT CGATGCGGAT CGCGCCGGTC CGGTGCGCGA CGAACTGCGC AATCGCCTTG AACGGCTGCT GGCAGGGCAC CCGTTGTTCA GTTGCTCGTA G
|
Protein sequence | MSATPEAIDA STGAADRHPE PARRHATLLH APRYRGHSYG DNHPLGIPRV SLTIDLIEAY DAITPQEMAV SRKAQPAELE WFHTREYVAA MQRAEALGKV FQRYRDRHNI GNFENPFFAG FFATPATATW GSIQGAEVVL DGRMAFNPAG GMHHAAPDQA RGFCYFNDPA LAILRLRQAG QRVLYLDIDA HHGDGVEAAF SDDPDVLTVS LHMDTEYGYP FEGGRIEDTG PLGNAVNLPL PKETNDSEFR AAFAAVWEAV RTRWQPDAVV VQAGTDALLP DPLGKFGIST QCFLACVDDV IETSPRHADG TPRLLVLGGG GYHPLALARC WTGVWALLTG RELPAAIPEA GQALLREVDW DMDEEEEWYE SLFTSRLDAD RAGPVRDELR NRLERLLAGH PLFSCS
|
| |