Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04230 |
Symbol | |
ID | 3253378 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1135541 |
End bp | 1137029 |
Gene Length | 1489 bp |
Protein Length | 439 aa |
Translation table | |
GC content | 48% |
IMG OID | 638252743 |
Product | general RNA polymerase II transcription factor, putative |
Protein accession | XP_566766 |
Protein GI | 58258707 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG5333] Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.333957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCAACAGCA GCTGCAGTTA GATGTCTTCC AACTTCTATA CCTCCTCTCA TAACCGCTAT TGGCTCCTGA CTCGGCCGTC TCTTCTGGAA TCTCGGCAAA CAGACCTCAA ATACTGCACC TCTCGCCAGC TATATTGCCT CTTCATCTTC TTCTCTCAAC TCATCCAAAA ACTCGGTAAA CGACTGCTGC TGAGGCAAAT ACCGATAGCC ACCGCATGTG TGTTTTTCAA GCGGTTCTAC TTCAAGAACA GTTTGTGCGA AACGAATCCA TATCTGGTGC TCGCGGCTTG CATTTATGTG GCAGCCAAAG TAGAGGAGAC TCCGGTACAT ATCAAGAGTG TCGTAAGTGA GGCCAAGTTG GTTTTCCATG GTGAGATTTC CCTTTAGGCT CTCAATTGTT ACATTTCCTG AAGTCGGGCG ATCAATGAAA CAGAACACAA CATCAAAATG TTCCCTGCTG AGACCAATAA GCTTGGAGAA ATGGAGTTTT ATCTACTGGA GGATCTCGAT TTCCACTTAG TGGTCTTCCA CCCATATCGG GCGCTACTCC ATCTCACCGG GAGGGAGTCT GCAGACATGG GAAAATTTGA GAAGTCCAGA GTTCAAGAAG ATATGGAAAT ACGAAAAAAA GAAGGAGATG CCAAAAAGAT GCGAGAAGAA GAGGCGAAGA AGGCGAGCAG TAAGGGACAG CAACCAACAG TTGGACAGGC ACTTGAAAAA GAGGGGGAGC GCCTCGAAGA GGCTGAGGAA ACCAGGATAA GGCGTCTAAT GAGTAGAGGG ACAGGCGAAG GTATGATGGA AGTGGACGAG GGTGTTTTGC AAATATCATG GTGAGTGTGA TTATCGACCC CAATTTGGCA TCTAGCTGAT AGGTCCCAAT AGGTTCATCC TCAACGACTC CTATCGCACC GATGCCCCTC TGCTATATCC TCCTTATATA ATCGCTCTCT CGGCAATATA TATCGCCTTC TGCCTAACAT CCATGTCGAA TTCCTCTGCC CGCACCCGTG CGTCTTCCAC TCAGCGACCG GAACTCTTGC AGTCGGCTTC GATTAATGAA GGATTGAATT TGCTTCCACC GCCTAAAAAT GCCGCAGAAT TTCTGGCTGG GTTTCAAGTC AGTTTACCAA TGCTGTTTGG TTGCGTGCAA GAGATTATTG GACTGTATCC CGTATGGGAG GCATTTGAGC CAACGGTGAT GAGGAATTCC CAAGCACAAG CCAAAACGGG GAATGCAGCA GCACCTGTCC CGGCTGCAAC TGGGACAAAA ACCGGGCAGA ACAACGATTT AGTCCAAGAC AAAAAGGACA AGTTCGGTTT GGAGGAGGCT GAATCTCTGG TACGGAAAAT GATCGAGGAA AGGATGATAG ATTTAGGTCA TCCAGATAAT GCGGGTGTTG AAAAGGCTTC AGGTACCGGC CCCTCCAATG TAGCGGGTAA AAAGCGCGCA AGATAGCATA GATCATGTCT TGCTCATTTT CGAGTGCAC
|
Protein sequence | MSSNFYTSSH NRYWLLTRPS LLESRQTDLK YCTSRQLYCL FIFFSQLIQK LGKRLLLRQI PIATACVFFK RFYFKNSLCE TNPYLVLAAC IYVAAKVEET PVHIKSVVSE AKLVFHEHNI KMFPAETNKL GEMEFYLLED LDFHLVVFHP YRALLHLTGR ESADMGKFEK SRVQEDMEIR KKEGDAKKMR EEEAKKASSK GQQPTVGQAL EKEGERLEEA EETRIRRLMS RGTGEGMMEV DEGVLQISWF ILNDSYRTDA PLLYPPYIIA LSAIYIAFCL TSMSNSSART RASSTQRPEL LQSASINEGL NLLPPPKNAA EFLAGFQVSL PMLFGCVQEI IGLYPVWEAF EPTVMRNSQA QAKTGNAAAP VPAATGTKTG QNNDLVQDKK DKFGLEEAES LVRKMIEERM IDLGHPDNAG VEKASGTGPS NVAGKKRAR
|
| |