Gene CNA04230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04230 
Symbol 
ID3253378 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1135541 
End bp1137029 
Gene Length1489 bp 
Protein Length439 aa 
Translation table 
GC content48% 
IMG OID638252743 
Productgeneral RNA polymerase II transcription factor, putative 
Protein accessionXP_566766 
Protein GI58258707 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5333] Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.333957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCAACAGCA GCTGCAGTTA GATGTCTTCC AACTTCTATA CCTCCTCTCA TAACCGCTAT 
TGGCTCCTGA CTCGGCCGTC TCTTCTGGAA TCTCGGCAAA CAGACCTCAA ATACTGCACC
TCTCGCCAGC TATATTGCCT CTTCATCTTC TTCTCTCAAC TCATCCAAAA ACTCGGTAAA
CGACTGCTGC TGAGGCAAAT ACCGATAGCC ACCGCATGTG TGTTTTTCAA GCGGTTCTAC
TTCAAGAACA GTTTGTGCGA AACGAATCCA TATCTGGTGC TCGCGGCTTG CATTTATGTG
GCAGCCAAAG TAGAGGAGAC TCCGGTACAT ATCAAGAGTG TCGTAAGTGA GGCCAAGTTG
GTTTTCCATG GTGAGATTTC CCTTTAGGCT CTCAATTGTT ACATTTCCTG AAGTCGGGCG
ATCAATGAAA CAGAACACAA CATCAAAATG TTCCCTGCTG AGACCAATAA GCTTGGAGAA
ATGGAGTTTT ATCTACTGGA GGATCTCGAT TTCCACTTAG TGGTCTTCCA CCCATATCGG
GCGCTACTCC ATCTCACCGG GAGGGAGTCT GCAGACATGG GAAAATTTGA GAAGTCCAGA
GTTCAAGAAG ATATGGAAAT ACGAAAAAAA GAAGGAGATG CCAAAAAGAT GCGAGAAGAA
GAGGCGAAGA AGGCGAGCAG TAAGGGACAG CAACCAACAG TTGGACAGGC ACTTGAAAAA
GAGGGGGAGC GCCTCGAAGA GGCTGAGGAA ACCAGGATAA GGCGTCTAAT GAGTAGAGGG
ACAGGCGAAG GTATGATGGA AGTGGACGAG GGTGTTTTGC AAATATCATG GTGAGTGTGA
TTATCGACCC CAATTTGGCA TCTAGCTGAT AGGTCCCAAT AGGTTCATCC TCAACGACTC
CTATCGCACC GATGCCCCTC TGCTATATCC TCCTTATATA ATCGCTCTCT CGGCAATATA
TATCGCCTTC TGCCTAACAT CCATGTCGAA TTCCTCTGCC CGCACCCGTG CGTCTTCCAC
TCAGCGACCG GAACTCTTGC AGTCGGCTTC GATTAATGAA GGATTGAATT TGCTTCCACC
GCCTAAAAAT GCCGCAGAAT TTCTGGCTGG GTTTCAAGTC AGTTTACCAA TGCTGTTTGG
TTGCGTGCAA GAGATTATTG GACTGTATCC CGTATGGGAG GCATTTGAGC CAACGGTGAT
GAGGAATTCC CAAGCACAAG CCAAAACGGG GAATGCAGCA GCACCTGTCC CGGCTGCAAC
TGGGACAAAA ACCGGGCAGA ACAACGATTT AGTCCAAGAC AAAAAGGACA AGTTCGGTTT
GGAGGAGGCT GAATCTCTGG TACGGAAAAT GATCGAGGAA AGGATGATAG ATTTAGGTCA
TCCAGATAAT GCGGGTGTTG AAAAGGCTTC AGGTACCGGC CCCTCCAATG TAGCGGGTAA
AAAGCGCGCA AGATAGCATA GATCATGTCT TGCTCATTTT CGAGTGCAC
 
Protein sequence
MSSNFYTSSH NRYWLLTRPS LLESRQTDLK YCTSRQLYCL FIFFSQLIQK LGKRLLLRQI 
PIATACVFFK RFYFKNSLCE TNPYLVLAAC IYVAAKVEET PVHIKSVVSE AKLVFHEHNI
KMFPAETNKL GEMEFYLLED LDFHLVVFHP YRALLHLTGR ESADMGKFEK SRVQEDMEIR
KKEGDAKKMR EEEAKKASSK GQQPTVGQAL EKEGERLEEA EETRIRRLMS RGTGEGMMEV
DEGVLQISWF ILNDSYRTDA PLLYPPYIIA LSAIYIAFCL TSMSNSSART RASSTQRPEL
LQSASINEGL NLLPPPKNAA EFLAGFQVSL PMLFGCVQEI IGLYPVWEAF EPTVMRNSQA
QAKTGNAAAP VPAATGTKTG QNNDLVQDKK DKFGLEEAES LVRKMIEERM IDLGHPDNAG
VEKASGTGPS NVAGKKRAR