Gene Ccur_11040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_11040 
Symbol 
ID8375311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp1258063 
End bp1260063 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content50% 
IMG OID644994026 
Producthistone acetyltransferase 
Protein accessionYP_003151477 
Protein GI256827518 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.142131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGC CGCTCACATC CCCTGGTACC GATACATCTC CTGTCACAGA GACATCCCCT 
GCTGTAGATA TGCTATCCAG CACAGATACG CCATGCGTTA CCGATACATC ACCTGGTGTA
GATGCGTCAA CTGTTATCAA TGCATCACCC ACAAACGAAA AACCCACTAT CGAGGGGCTT
ATCACAGATA TTATCCAAGC ACTGAAAAGC TCTCGTGAAC TTGATGCGCG AAAACTGGAA
TCTCTGATCG CACGCTATAA TCGCGCATGG CATAGCCCAG AGCGCCACTT TGCAAAACGA
CAGTTGCTAC CGGCCTATCA GCGCATGAAG CGCGAGCACG AAGATCTCTG GCGCACCTGG
CATATCAGCG AACAAGATGA GGCTTATCTT ATTGGTATTT TACAGATGAA ACCCCGCCGC
AGCGCATCAG GTGTGGCCAC CATCACTGTT ATTACTAAGC CACATGCCTG TGCGAGCGCG
TGTCTCTACT GTCCCAACGA TGTACGCATG CCAAAAAGTT ACCTCTTCAA AGAGCCTGCC
TGCCAACGTG CCGAACGTAA CTTCTTTGAC CCATACCTGC AGGTTGCAAG TCGCCTTGAA
GCGCTGCACG AGATGGGTCA TGCAACCGAC AAGATTGAGC TGATTGTTTT AGGGGGCACC
TGGAGCGATT ATCCACGCAG CTATAGAACC TGGTATGCTC GTGAACTGTT CCGCGCTCTC
AACGACAGCC CCACCGAGCG TCAAGAGTGC TGCAATACGC GACGCACTGC CTATAAGAAA
TCTGCTATCG GTTGCGAAGC TGCGGAACTT GCACAGCAGT GCGAACCCGT CCAAGCTCTT
GTTAACGCTG GACAGATTAC CTATAACGAA GCAGTCGGTT GTCTGTTTGG ACCGCAATCT
TCTTGGGGCA CTACCGCAAG CTGGCAACAG ACTTGCCAAG AAGACCTCTT CGCTGAACAT
ATCCGCAACG AAACGGCAGA TCACCGCTGT GTCGGATTAG TAGTAGAAAC ACGTCCCGAT
GCCATCAGCG CCGAAAGTTT ACACGCTTTA CGGGAAATCG GCTGCACAAA AGTACAGATC
GGCGTCCAAA GTACCCGCGA CGAAATTCTC GCCGCAAACG ATCGCGGCAT TACCACAGCC
ACCATCAAGC AGGCTTTTCG CCTTTTGAGA GTGTTTGGAT TCAAAATCCA TGCTCACTTT
ATGGTAAATC TCTTGGGTTC AACACCCGCC GACGACAAGC GCGACTATGC CTGCTTCGCC
GATACACCCG CTTTTCGCCC TGACGAAGTA AAACTCTACC CCTGCGCCCT CATTGAAGGG
ACTGGTCTTA TGGCGCATTG GGAAAAAGGG ACATGGCGCC CCTATTCTGA AGATGAGCTT
GTCGACGTAC TCACTGCAGA TGTGCTGGCT ACCCCTGCTT ACACGCGCCT TTCCCGTATG
ATCAGAGATA TATCAACCCA GGATATTGTT GCGGGCAATA AAAAACCCAA CTTGCGTCAA
ATGGTAGAAG AACGCCTACG CACCGACGGT TATCACGAAG CTATCCGCGA GATCCGTTTC
CGCGAAATCA ATCGCGAAAC CGTTGACATA AATAAACTTA CGCTAAAAGA AATCGTCTAC
CAAACCGAGG TATCCGAAGA ACATTTCTTA CAGTGGGTTG ACCCCACTGA CCGTATCGCT
GGCTTTTTAC GCCTGTCACT TCCCTATAAA AGCTATATCA CTGCACACGC TGGTGAAATC
CCTATATACG CCGGTGAAGC CATGATTCGC GAAGTGCATG TATATGGCGT GGCAGCGCAC
TTACATCACA CCGATGCGGG CGTGCAGCAT CTGGGACTCG GCCGCCAACT GGTCGAACAA
GCATGTATTA TTGCCGCAAA AGCAGGTTTT ACTAAGTTGA ATGTTATCAG TGCTATTGGC
ACGCGCGAAT ACTATCGACG CCTTAGTTTT TATGACAATG GCCTCTATCA GCAGCGCAAT
TTACTTGAAC CAACAAAATA A
 
Protein sequence
MEKPLTSPGT DTSPVTETSP AVDMLSSTDT PCVTDTSPGV DASTVINASP TNEKPTIEGL 
ITDIIQALKS SRELDARKLE SLIARYNRAW HSPERHFAKR QLLPAYQRMK REHEDLWRTW
HISEQDEAYL IGILQMKPRR SASGVATITV ITKPHACASA CLYCPNDVRM PKSYLFKEPA
CQRAERNFFD PYLQVASRLE ALHEMGHATD KIELIVLGGT WSDYPRSYRT WYARELFRAL
NDSPTERQEC CNTRRTAYKK SAIGCEAAEL AQQCEPVQAL VNAGQITYNE AVGCLFGPQS
SWGTTASWQQ TCQEDLFAEH IRNETADHRC VGLVVETRPD AISAESLHAL REIGCTKVQI
GVQSTRDEIL AANDRGITTA TIKQAFRLLR VFGFKIHAHF MVNLLGSTPA DDKRDYACFA
DTPAFRPDEV KLYPCALIEG TGLMAHWEKG TWRPYSEDEL VDVLTADVLA TPAYTRLSRM
IRDISTQDIV AGNKKPNLRQ MVEERLRTDG YHEAIREIRF REINRETVDI NKLTLKEIVY
QTEVSEEHFL QWVDPTDRIA GFLRLSLPYK SYITAHAGEI PIYAGEAMIR EVHVYGVAAH
LHHTDAGVQH LGLGRQLVEQ ACIIAAKAGF TKLNVISAIG TREYYRRLSF YDNGLYQQRN
LLEPTK