Gene Rcas_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1078 
Symbol 
ID5538544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1397016 
End bp1398056 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID640893213 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001431196 
Protein GI156741067 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTTG TTGCGTCCGA CACCCATCTT CAGCACAATC CGCCCCACGA GTTCCTCGAT 
GGCGCGCTGA TTCCAATCTA CGAGACGCCG GAACGTGTGG CGATTATTCG CGCCGCGCTC
GAACGTGCTG CCATTGGTCC TGTCGTCGAG CCGCGCGCGT TCGGAATCGA ACCGGTGCGC
ACCGTTCACG ATGACGGATA TCTGACCTAT CTTGAGACGA TCTACGACCG CTGGGTTGTT
GCAGGCGGTG CGCCGGAAGC GGTCATTCCC GGAACGCTGG CAGTGCGCTG GATGTCGCGA
CCGCACGATC ACCCGCTGGC TGCACCCGGG TATTATACGT TCGATACGTG CGCTCCAATT
GTGGCAGGCA CTTATGCCGC CGCGCGTGCT GCCGCCGATG TAGCGCTGAC CGGCGCCGAG
TTGCTCCTGA TCGGTTCACG TTGTGCGTAT GCGCTCTGTC GCCCGCCGGG ACATCATGCC
GGGCGTGATC TGTGCGGCGG GTACTGTTTC CTCAACAATG CTGCGATTGC AGCGGCGTAT
CTGGTGCGCA ACGCACCAGA CGCGACGTGC GCCATTCTCG ACATCGATTT CCACCACGGC
AATGGCACAC AACAGATCTT TTACGAACGC AGTGATGTGC TGTTTGTCTC GATCCACGCC
GCACCGTCGT ACCAGTATCC CTTCTTTCTG GGCTACGCCG ATGAGCGCGG CGCCGGCGCC
GGTGAGGGGT ATAATCTGAA CCTGCCGCTG GATGCGGGCG TAGGGGACCG TGAGTATCTG
GTGGCGCTCG ACCGGGCGCT CGATGCGATT GCAGCGTTTG CGCCGCGTTT TCTGGTGCTT
TCGGCAGGGT TCGATACGTT CGAAGGCGAT CCGGTCGCCA ATCATGGCGA CAGTTTTGCG
CTTAGCATGG CAGTCTACCC TGAGATCGGG CGGCGCATTG CTGCGCTCAA TCTGCCGACA
CTGGTCGTTC AGGAGGGAGG ATACGCCATC GACGCACTCG GCGATAATGT CGTCGGGCTG
TTACAGGGTC TCGAGGAGTA A
 
Protein sequence
MIVVASDTHL QHNPPHEFLD GALIPIYETP ERVAIIRAAL ERAAIGPVVE PRAFGIEPVR 
TVHDDGYLTY LETIYDRWVV AGGAPEAVIP GTLAVRWMSR PHDHPLAAPG YYTFDTCAPI
VAGTYAAARA AADVALTGAE LLLIGSRCAY ALCRPPGHHA GRDLCGGYCF LNNAAIAAAY
LVRNAPDATC AILDIDFHHG NGTQQIFYER SDVLFVSIHA APSYQYPFFL GYADERGAGA
GEGYNLNLPL DAGVGDREYL VALDRALDAI AAFAPRFLVL SAGFDTFEGD PVANHGDSFA
LSMAVYPEIG RRIAALNLPT LVVQEGGYAI DALGDNVVGL LQGLEE