Gene Jann_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1010 
Symbol 
ID3933454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp965077 
End bp966177 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID637903358 
Producthistone deacetylase superfamily protein 
Protein accessionYP_508952 
Protein GI89053501 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.942652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.837706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG GCTTTTTCTG GGATGAGCGG TGTTTCTGGC ATTCGGGCGG GAATTATGCA 
GGTCTGTTGC AGGTCGGCGG CCTGATCCAA CCGGGCGGCG GGTTGCCCGA AAACCCCGAG
ACCAAGCGTC GCCTCGTCAA TCTGTTGCGC GTCACCGGGT TGTGGGAGGG GCTTGCGACC
CAACAGGCCA CCCCCGTGAC GCAAGACGAC CTCCTGCGCA TTCATCCGTC CGACTATCTC
GCCAGCTTCA GGGCCAAATC GGCGGAGGGC GGCGGAGAGT TAGGCCTGCG CACACCGTTC
GGCCCCGACG GGTATGACAT CGCCTGCGTG TCCGCCGGGC TGGCGAAGGC TGCGCTGTTT
GCCACGCTCA AAGGCGACGT GAAAAACGCC TATGCCCTGT CGCGGCCCCC CGGCCACCAT
TGCCTGCCGG ATTTTCCCAA TGGCTTCTGC CTGCTCGCCA ATATCGCCAT CGCGATCGAG
GCTGCCCAGA CCGTCAAGCT GACCGACCGT GTCGCGGTGA TTGACTGGGA CGTCCATCAT
GGCAACGGAA CCGAGGCGAT ATTCTACGAC CGTGACGATG TCCTGACGAT TTCATTGCAT
CAGGAGCGCA ACTACCCGCT CGACACCGGC GATTTTGAAG ATCGCGGATC CGAGGCGGGT
GTGGGCTACA ACCTCAACAT TCCACTGCCC CCCGGCACCG GCCACGCAGG GTATGAGGAG
GCGTTCGAGC GGCTGGTGAT CCCGTCCCTT CACGCCTTCC AACCCGATGC GATCATTGTG
GCCTGCGGCT ATGACGCATC CCTCGTCGAT CCGCTGAGCC GAATGATCGC AGGCGGCGAC
ACTTTTCGCG CGATGACAGA CATGACGATG GAAGCCGCGG ATGATCTGTG CGGGGGCCGC
CTGACCGTGG TGCACGAAGG CGGCTACTCC GAGGTTCACG TCCCCTTCCT TGGCCACGGC
GTGCTGGAAT CGATGTCCGG CAGCGACATC CATGCGCCCG ATCCGTTCCA GTACAAATTC
GAAGGCCAGC AACCGGGCGC ACGCTTCAAT CGCTACGTCT CGACCCTGAT CGCCGAGATG
GAAGATGCTC TGGGGCTGTA G
 
Protein sequence
MTTGFFWDER CFWHSGGNYA GLLQVGGLIQ PGGGLPENPE TKRRLVNLLR VTGLWEGLAT 
QQATPVTQDD LLRIHPSDYL ASFRAKSAEG GGELGLRTPF GPDGYDIACV SAGLAKAALF
ATLKGDVKNA YALSRPPGHH CLPDFPNGFC LLANIAIAIE AAQTVKLTDR VAVIDWDVHH
GNGTEAIFYD RDDVLTISLH QERNYPLDTG DFEDRGSEAG VGYNLNIPLP PGTGHAGYEE
AFERLVIPSL HAFQPDAIIV ACGYDASLVD PLSRMIAGGD TFRAMTDMTM EAADDLCGGR
LTVVHEGGYS EVHVPFLGHG VLESMSGSDI HAPDPFQYKF EGQQPGARFN RYVSTLIAEM
EDALGL