Gene CNK00950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00950 
Symbol 
ID3254413 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp297631 
End bp299743 
Gene Length2113 bp 
Protein Length557 aa 
Translation table 
GC content50% 
IMG OID638253585 
ProductPol II transcription elongation factor, putative 
Protein accessionXP_567783 
Protein GI58260746 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.110026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCAC CCACAACCTC ACAGTCGGAA CTGCACCTTC TATGTGTATC CGCCGTGGTC 
AGAGACCTCA TCTCAACCTA TAATTCCTCA AGCAGCTCCG CCACCCAGCC TCCAAACGTA
AACAGTCTTC GAGCAAAATA CGCAAAAAAA TACGGACTCA AGGCAGTCCC TCGTTTGACA
GATGTCCTCG CTGCTGTCCC TGAAGAATGG AAGGACAGGC TGAGGGGATG GCTGAGAGCA
AAGCCAGTCA GGACGGCGAG CGGTGTGGCA GTTGTTGCTG TCATGTGCAA ACCCCATCGA
TGTCCTCACG TAGCCATGAC CGGAAACATC TGCGTGTAAG TGTTTCTAGT GGCACTGATC
TGTGATCGCT ACTTATAGAA TACGATAGCT ACTGCCCCGG CGGTCCCGAC TCTGACTTTG
AATACTCAAC CCAGTCATAC ACTGGATATG AGGTACGTAC ATTCTGCTAC CCGCTTTGTG
TGCCTCGTCT GACATTCTGA AGCCTACTTC TATGCGAGCC ATCCGGGCCA GATATGACCC
ATATGAGCAA GCGCGTGGAC GTGTGAACCA GCTCCGTGAC CTTGGACACA GCGTCGACAA
GGTTGAGATC ATGTACGTAG CTTTTCTTTT TCTTTTTTGG TTTTCAAAAG TTTTTCCGCT
CACTGGATCA CAGTATCATG GGAGGAACGT TCATGTCCAT GCCGGAAGAC TACCGCCATA
AATTCATTGC TGGACTTCAC AATGCCTTGA GTGGTCACAC TGGAGAGGAC GTTGACGAGG
CTGTCAAGTA AGTCTATTTG CCAACTCAGC AAAAAGTTCG AGTAAACTAA TATACCACAG
ATTCTCTGAG CAAAGCAAGG TCAAATGCGT TGGTATCACT ATTGAAACTC GTCCCGATTA
CTGCTTGAAG CCTCATCTGA GTCAGATGTT GAGGTATGGA TGCACCCGTC TGGAAATCGG
TGTTCAATCA GTCTATGAAG TGAGCCAACT CCATTTTTAC TCACCCCCTC CCCCCTCTCC
CCACCAAAAA AAAAATCCAG CTGACACCCA GTCAAGGACG TGGCACGAGA CACCAACCGA
GGACACACTG TCCGAGCTGT CAGTGAATCC TTCCACATGT CCAAAGACGC CGGCTACAAG
ATTGTCGCCC ACATGATGCC TGACCTCCCC AACTGCGGTA CCGAGCGAGA CATTTGGCAA
TTCCAAGAAT TCTTTGAAAA CCCCGCTTTC CGCTCAGACG GTCTCAAACT GTACCCAACC
TTGGTCATCC GTGGTACCGG TCTTTACGAA CTGTGGAGGA CTGGCAAGTA CAAGAATTAC
CCTCCCAACG CCCTTGTCGA TATCGTAGCG AGGATCATGG CGCTCGTACC CCCCTGGACG
CGAGTCTACC GCGTCCAACG AGATATCCCG ATGCCGCTCG TCTCTTCCGG CGTGGAGAAT
GGTAATTTAC GTGAACTCGC ACTTGCGCGT ATGAAGGATT TCGGTGCCGA GTGTCGAGAT
GTGCGATACC GTGAGGTCGG TCTGCACGAG ATTCACCACC GTGTGCGACC GCGTGATATC
GAGCTTATCC GAAGAGATTA CGCGGCGAAT GGCGGATGGG AGACGTTCTT GTCGTATGAG
GATCCTCAGT CTGATATCTT GGTCGGTCTT TTGAGGTTGA GAAAGTGTTC AGAGGAAGGG
ACGTTTAGGA AGGAGTTGGT TGGTATGCAA GGTGGATGCA GCCTTGTGCG AGAGCTGGTA
GGTTTTTTTT ATTTTTTTAA TGGGGTTGAA GTATGGATAC TGATCCAAGG TTGATTAGCA
CGTGTATGGT ACTGCTGCAC CCGTTCACTC TCGTGACCCC AAGAAATTCC AGCATCAAGG
TATCGGTACA TTGTTGATGG AAGAGGCGGA GCGTATCGCC CGTGAGGAGC ACGGTAGCGG
TCGGATCGCT GTAATCTCTG GTACGTATAC GAGCCAAACA GCTTTGCCGA ACAAGGATTT
TATTGATAAT CATGTACACA CAGGTGTTGG AACGCGTGAT TACTATCGAC GGCTTGGTTA
CTTTCTCGAT GGGCCTTATA TGGTCAAGGA TCTTTTGTAC GATGACGAGT AGATTTGAGC
TTGTGATTTT TGA
 
Protein sequence
MIAPTTSQSE LHLLCVSAVV RDLISTYNSS SSSATQPPNV NSLRAKYAKK YGLKAVPRLT 
DVLAAVPEEW KDRLRGWLRA KPVRTASGVA VVAVMCKPHR CPHVAMTGNI CVYCPGGPDS
DFEYSTQSYT GYEPTSMRAI RARYDPYEQA RGRVNQLRDL GHSVDKVEII IMGGTFMSMP
EDYRHKFIAG LHNALSGHTG EDVDEAVKFS EQSKVKCVGI TIETRPDYCL KPHLSQMLRY
GCTRLEIGVQ SVYEDVARDT NRGHTVRAVS ESFHMSKDAG YKIVAHMMPD LPNCGTERDI
WQFQEFFENP AFRSDGLKLY PTLVIRGTGL YELWRTGKYK NYPPNALVDI VARIMALVPP
WTRVYRVQRD IPMPLVSSGV ENGNLRELAL ARMKDFGAEC RDVRYREVGL HEIHHRVRPR
DIELIRRDYA ANGGWETFLS YEDPQSDILV GLLRLRKCSE EGTFRKELVG MQGGCSLVRE
LHVYGTAAPV HSRDPKKFQH QGIGTLLMEE AERIAREEHG SGRIAVISGV GTRDYYRRLG
YFLDGPYMVK DLLYDDE