Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1949 |
Symbol | |
ID | 5709853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 2026816 |
End bp | 2027856 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641276457 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001541755 |
Protein GI | 159042503 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGTGA CTGTGCATGA TGAGGGTTAT TTAAGGCATG AGACGCCTTC GAATCATCCT GAGTCCCCTA ATAGGGTATT GAGCGCGTTA AGTGGATTAA GTGGGTTAGC TAAGGTGATT AAACCCATGG TTAATGTTGA TTTAATTGAA ACAGCTGGGC TTGTTCACGA GTACCATTAC GTTGAGTACG TTACATCACT TTGCGCAACT GGGGGGTGGC TTGACCAAGA CACCTATGTA TCCAAGGGTA CTTGCGACGC CTTAAACTCC AGTGTTAACG CCTTAGCCAC AGCGGTTAAT CTACTTAAGT CAGGTGTAAG GTACGTTTAC CTACCCTTAA GGCCCCCTGG TCATCATGCT GGATTCAGGG GTAGGGCACT CATGGCTTCA ACGCAGGGAT TCTGCATACT TAATAACGCC GCAATAATCA GTAGTCTACT ACTTAAGGGT GGGGCCTCTA GGGTTGCTGT ACTGGATATT GATGCGCATC ATGGTAATGG TACACAGGAA ATATTCTACA ACACATCCAG GGTCTTCTAC ATTAGTACTC ACCAGGATCC AAGAACACTT TACCCGGGTA CTGGTTATGT TAATGAGACT GGGGTGGGTG ATGGGGAGGG CTTTAACATG AATATACCAT TACCACCAAT GACTGGTGAT GATTTATATA AGATTATCTT AAGGCCAATT GAGAATGCAT TAAGGGAATA TAAGCCAGAT TACGTAGTGG TGTCACTGGG CTTTGACGCA CATTATCTCG ATCCATTAAC TAACCTTAAC CTAAGCCTAA ACAGTTACAT TGAGGTATTC CTCATGATTA GAAGGCTCAT TAATGAGGGT GTTACCGGGG GTTCAGTTTA TGTCCTAGAG GGTGGTTATA ATTCTGATGT TATTAAACAG GGTTCAAGGG CCTTAGCCAT GGTAAGTAAT GGTGTGGATT CAATTAGAAT TGAGGATCCC ACTAGGACTG ATGCAGGTGT GGTTAAGTAC TTTAATAAAA TGATTGAGCA GCATAAGGCA TTATTCACCC AGTACTGGTA G
|
Protein sequence | MLVTVHDEGY LRHETPSNHP ESPNRVLSAL SGLSGLAKVI KPMVNVDLIE TAGLVHEYHY VEYVTSLCAT GGWLDQDTYV SKGTCDALNS SVNALATAVN LLKSGVRYVY LPLRPPGHHA GFRGRALMAS TQGFCILNNA AIISSLLLKG GASRVAVLDI DAHHGNGTQE IFYNTSRVFY ISTHQDPRTL YPGTGYVNET GVGDGEGFNM NIPLPPMTGD DLYKIILRPI ENALREYKPD YVVVSLGFDA HYLDPLTNLN LSLNSYIEVF LMIRRLINEG VTGGSVYVLE GGYNSDVIKQ GSRALAMVSN GVDSIRIEDP TRTDAGVVKY FNKMIEQHKA LFTQYW
|
| |