Gene Tcr_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1785 
Symbol 
ID3761826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1955929 
End bp1957068 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content46% 
IMG OID637786528 
Producthistone deacetylase superfamily protein 
Protein accessionYP_392051 
Protein GI78486126 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000374535 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCA GAAGAGACTT TATTAAACAT TTATTAGCCC TGAGTGCACT TGGTACTGGC 
TCACTCAGTA CGGTCAGCTA CGGAAAAGCA ACCAGGCCTG GTCATTTTTC AACCGGCGTG
GTCCTCGACC CTTTATTCTT TAAACACGAC ATGAGCGGCC ACCCTGAAAA TGCACAACGC
TTAGTGGCTA TTAATAATGA AATGGAAAAA CAAGGCATCT GGCCTCAACT AACGCCCGTG
GCAACACGAT TAGCCACCAA TGAAGAATTA TTACTTGCTC ATACACAAAG CTATATTGAT
GAAATAGAAA TATTAAGCGA TTCTGGCGGC GGCTTTTACG AACCTTATCA GGGCGACACT
TACTTAAATG CGTCCAGCTT TGACGCAGCC AAAATGGCCG CCGGCAGTAA CATCAACTTA
AACCTCGCCA TTTATGATCG AAAGATCGAC CATGGTTTCG CCCTGCTTCG TCCGCCAGGC
CACCATGCAT TGCAAAATAA AGCCATGGGG TTTTGCATTT TCAACTCCGA CATCATTGCC
GCCCGCGCAT TACAAAAATA CCGCGGCGTA AAACGCATTG CCATTATTGA TTTCGATGTT
CATCACGGCA ATGGCACCCA AGATTTATCG GATAACGATC CATCCATTAT GTCGATCTCG
ATCCACCAAC ACCCTTTTTG GCCAATGACG GGCGGGCACA CATTTACCGG AAAAGACAAG
GCAAAAGGCA CAGTCGTTAA TTGCCCATTC CCAAAAGGAG CTGGTGATCA AACCTACTTA
AATGTCTACG ACCAAGTCAT TCACCCAAAA TTAGAAGCTT TCAAGCCAGA GCATATTATT
GTCTTTGCGG GGTATGATGC GCACTGGCAA GATCCTTTAG CCCAGCATCA AGTCTCGGTA
GCCGGGTTCA ATCAACTCGT AGACAAATGC CTCAAGTCTG CCAAAGAACT TTGCGGCGGT
CGAATCAGTT TTTCTCTGGG CGGCGGCTAT AATTTAAACC CGTTAGCACA ATGCGCTGTC
GGCACTTTCC ACACCTTATT AGGCAACCCT GAAAAAAACA TCGACTCTAT TGGAAAAGCA
CCAACCCCTG AGGTCGATTA TCAACAACGG ATACAAGAAC TGGTCATACA CCACTTATAA
 
Protein sequence
MSSRRDFIKH LLALSALGTG SLSTVSYGKA TRPGHFSTGV VLDPLFFKHD MSGHPENAQR 
LVAINNEMEK QGIWPQLTPV ATRLATNEEL LLAHTQSYID EIEILSDSGG GFYEPYQGDT
YLNASSFDAA KMAAGSNINL NLAIYDRKID HGFALLRPPG HHALQNKAMG FCIFNSDIIA
ARALQKYRGV KRIAIIDFDV HHGNGTQDLS DNDPSIMSIS IHQHPFWPMT GGHTFTGKDK
AKGTVVNCPF PKGAGDQTYL NVYDQVIHPK LEAFKPEHII VFAGYDAHWQ DPLAQHQVSV
AGFNQLVDKC LKSAKELCGG RISFSLGGGY NLNPLAQCAV GTFHTLLGNP EKNIDSIGKA
PTPEVDYQQR IQELVIHHL