Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1785 |
Symbol | |
ID | 3761826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 1955929 |
End bp | 1957068 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637786528 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_392051 |
Protein GI | 78486126 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000000374535 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGCA GAAGAGACTT TATTAAACAT TTATTAGCCC TGAGTGCACT TGGTACTGGC TCACTCAGTA CGGTCAGCTA CGGAAAAGCA ACCAGGCCTG GTCATTTTTC AACCGGCGTG GTCCTCGACC CTTTATTCTT TAAACACGAC ATGAGCGGCC ACCCTGAAAA TGCACAACGC TTAGTGGCTA TTAATAATGA AATGGAAAAA CAAGGCATCT GGCCTCAACT AACGCCCGTG GCAACACGAT TAGCCACCAA TGAAGAATTA TTACTTGCTC ATACACAAAG CTATATTGAT GAAATAGAAA TATTAAGCGA TTCTGGCGGC GGCTTTTACG AACCTTATCA GGGCGACACT TACTTAAATG CGTCCAGCTT TGACGCAGCC AAAATGGCCG CCGGCAGTAA CATCAACTTA AACCTCGCCA TTTATGATCG AAAGATCGAC CATGGTTTCG CCCTGCTTCG TCCGCCAGGC CACCATGCAT TGCAAAATAA AGCCATGGGG TTTTGCATTT TCAACTCCGA CATCATTGCC GCCCGCGCAT TACAAAAATA CCGCGGCGTA AAACGCATTG CCATTATTGA TTTCGATGTT CATCACGGCA ATGGCACCCA AGATTTATCG GATAACGATC CATCCATTAT GTCGATCTCG ATCCACCAAC ACCCTTTTTG GCCAATGACG GGCGGGCACA CATTTACCGG AAAAGACAAG GCAAAAGGCA CAGTCGTTAA TTGCCCATTC CCAAAAGGAG CTGGTGATCA AACCTACTTA AATGTCTACG ACCAAGTCAT TCACCCAAAA TTAGAAGCTT TCAAGCCAGA GCATATTATT GTCTTTGCGG GGTATGATGC GCACTGGCAA GATCCTTTAG CCCAGCATCA AGTCTCGGTA GCCGGGTTCA ATCAACTCGT AGACAAATGC CTCAAGTCTG CCAAAGAACT TTGCGGCGGT CGAATCAGTT TTTCTCTGGG CGGCGGCTAT AATTTAAACC CGTTAGCACA ATGCGCTGTC GGCACTTTCC ACACCTTATT AGGCAACCCT GAAAAAAACA TCGACTCTAT TGGAAAAGCA CCAACCCCTG AGGTCGATTA TCAACAACGG ATACAAGAAC TGGTCATACA CCACTTATAA
|
Protein sequence | MSSRRDFIKH LLALSALGTG SLSTVSYGKA TRPGHFSTGV VLDPLFFKHD MSGHPENAQR LVAINNEMEK QGIWPQLTPV ATRLATNEEL LLAHTQSYID EIEILSDSGG GFYEPYQGDT YLNASSFDAA KMAAGSNINL NLAIYDRKID HGFALLRPPG HHALQNKAMG FCIFNSDIIA ARALQKYRGV KRIAIIDFDV HHGNGTQDLS DNDPSIMSIS IHQHPFWPMT GGHTFTGKDK AKGTVVNCPF PKGAGDQTYL NVYDQVIHPK LEAFKPEHII VFAGYDAHWQ DPLAQHQVSV AGFNQLVDKC LKSAKELCGG RISFSLGGGY NLNPLAQCAV GTFHTLLGNP EKNIDSIGKA PTPEVDYQQR IQELVIHHL
|
| |