Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA05940 |
Symbol | |
ID | 3253925 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1590270 |
End bp | 1593440 |
Gene Length | 3171 bp |
Protein Length | 866 aa |
Translation table | |
GC content | 47% |
IMG OID | 638252914 |
Product | general RNA polymerase II transcription factor, putative |
Protein accession | XP_566947 |
Protein GI | 58259069 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | [TIGR00603] DNA repair helicase rad25 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.687804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTTTCTTT CCTCTCTTCT TTCTTTGTTC TTCATTTATC GCCTTTTCTC TTCTCCGAAA TGAGCGAACC ATCTTCTCCA GCTTCATCCC TCGATTTCTT TGAGTCGGAC GCTTCAGCAG ACTCCGACTA CGATGAAGCA CCACGCCGCA CTCGGAAGCA GCCTTCCAAG AAATATGGAG CCCGGCACGG TACATCCACT CCCACAGCTT CAGGCTCAGG CTCTGGTACA AAAATTAAGA TAAACTTGTC CTCTCTCCAA CGACGGGCAG TGGAGGGCAA TACCGCTGTT GAGCAGGAAG AAGAAGGAGA CGAGGATGAA GAAGGGTATT TTGATGGTTT AATTGGTAAA CGAGGAGTGG ATCTTTCAGG CCAAACGCTG AAAGGTGATC ATTCTTTGAG ACCGCTTTGG GTAGATGACC GCGGTAATAT GTGGGTACAT GAGTCCTTGG ATCCGACACA GCAAGTCATT GCTGACCATG TCTGGACAGT ATTGTTGAGG CCTTTGCTCC CTTTGCAAAG CAAGCGCAAG ATTTCCTGGT TGCCATTTCC GAGCCAGTAT CTCGGTGAGT CATCCGTATG ACTTAAGTTT ACCTGTTCAT CGTGCTGACA AAGGATATAG GCCCGCCCTT ATACATGAAT ACCGCATAAC CAAGCCTTCT TTACATTCTG CCATGTCAAT TGGTCTTGAG ACCAAGGTCA TCATTGAGGT CCTCTCTCGT CTGAGCAAGA CACCCCTTTC ACCGCGACTT GTCGCGCGAA TAGAAGAATG GACCGCATCG TTTGGTAAAG TCCGGCTTGT ATTGAAGGAC AACCGATACT TCCTTGAAAC GAGTGTCCCC GAATTCTTGC AAAAACTGAT GAACGATGAA GTCATCAAGG AATGCATGGT GCATCGTGAA GAGGAAACAG GTCCTACTGT ATTTGGAGCG GAGGAAGGTG CTCGTCCACG ACGAGACTTT GCCATTCCTG GGACAGAAGA AGCTCGAAGA CGAGAGAGGG GTGAAGATGC CGAACAGACT CGTGAGAATG ATGCTGTCTT GGGTGCAGTG ATTGGGATTA GCGAGGCGGA TGAGATGGAT GATGAAGATG ACAAAGTTCA TTCGTTTGAG GTGTCTGGCG AGCGGATGGA GGATGTTCGA AGACGGTGCA AGGATATCGA TCTTCCTGCA TTGGAAGAGT ACGATTTCAG AAATGACACG ATCAATCCCA ATCTCGATAT ACAGTTGAAG CCCATGACAG TCATCAGGCC GTATCAGGAG ATGAGTCTAG CCAAAATGTT TGGTAACGGT AGAGCCAGGT CAGGTATCAT TGTCTTACCT TGTGGAGCGG GAAAAACGCT GGTGGGCATA ACTGCGGCAT GTACGATCAA GAAGAGCGCG TTGGTGCTCT GTACTTCTGC GTTGGTTTGT TTTTCTTTTC TCGTTGCGCA AACCTCTGAC GTTATACGGG TAGTGTATCG GTAGCCCAAT GGAAGCAACA ATTCCTTCAC TTCTCCAACA TATCGGAACG ACAGATCTGT GCCTTCACCC AGGGCGAAAA AGAAATGTTC AGTACGTCGG CGGGCATCGT CATCTCAACC TACTCCATGA TTGCCAAAAC TGGCAAGCGA GCGCATGATG CGGAAAAGAT GATGCAGTTC CTTCGGTCCA GGGAATGGGG ATTTTTACTG TTGGATGAAG TGCATGTGAC TCCGGCGGAT ATGTTCAGAA AATGTATCAA TAATTTCAAA GTGCATGCCA AGTTGGGTCT CACTGGTGAG TCTTTGCGGT TCAGTGGATT GACATTGATA TGGAGACTAA CGTGAAAAAA TAGCAACGCT GGTAAGGGAG GATGATAGGA TTGGGGATTT GGGATACTTG ATTGGTCCAA AGTTGTACGA AGCCAATTGG ATGGATCTCG CTAAAAATGG CCATATTGCC ACTGTCCAGG TATGTTTCGT TGCCCAATTT TTTTTTGGTC TATACACTAC TGAGCAGGAC GTTTTTTTTT AGTGTGCCGA AGTTTGGTGC CCCATGACTC CAGAATTTTA TCGCGAATAT TTACGGAATC CTTCTCGCAA ACGCATCCTT TTGCACGCCA TGAACCCGAA CAAGATTCAA GCATGTCAGT TCTTGATCAA CTATCATGAG AGCCGAGGCG ACAAGGTGAT CGTATTTTCC GACAATGTGT TTGCACTCGA GGTGAGTTGT TTTGGGCTCG AGAAAATCTT GGTTTGTACT GACGGGACGA ATAGGCGTAC GCCAAAAAGT TGGGCAAGTC TTTTATTCAC GGCGGGACGC CTGAAGGCGA ACGGTTGCGG ATTCTTTCGC GATTCCAACA CGACCCCCAG CTGAACACCA TCTTCCTCTC CAAGGTCGGT GATACTTCTA TCGACTTGCC TGAAGCTACT TGCTTGATCC AAATATCTTC CCATTTTGGT TCTCGACGAC AAGAAGCTCA GCGATTGGGT AGGATTCTGA GGGCAAAGCG AAGAAATGAC GAGGGTTTCA ACGCCTTTTT TTATTCGCTT GTTTCCAAAG ATACTCAGGA GATGTTCTAT TCCTCGAAGC GGCAAGGATT CTTGATTGAC CAAGGTTACG CGTTCAAAGT GATCACCGAA CTTCACGGTC TTCATAGCAT GCCCAACCTC GTTTTCGCTT CCAAGGACGA ACAGCTGTCA TTGCTAGAGT CGGTACTGAA CCAGGGTGAT GCCGCGGCAG AGACGGCGGA CCATTATATG AGGTTGAATG GGGGTAAGCA TCTCAAGAGG ATTGCGGGCG CTCAGCCGAG TACGAGTGGG ACGACGGTGC AGAGGTTCAT GGCACCGTTG GAGCATTTGA GTGGAGGGCA GAATATCAGT TATAGAGAAC AGAACAAGAG TGTCAAGTGG GTTTTTTTTT CTTTTTCTTT TTTTTCCCCA AAATCTATAT GATTACCGGA AAAGTGAGGA GCATATACTA ATCAAGGATT TCATCTTTCC AGCAAGGAGT TATCGAGAGA AGTACGGCAG AATAAGAGGG CTGGGGGATC GAGTAGTGGG AAAGATAGCC ATTCGATTTT CAAAAAGAGA AAAACAGAAT TGGCAGCGGC CAAGAAGCAG CGTGAGACGG GATTCTAAGC AGTAGGCATA CTTATATTAA AAAAAGCGGT ATCCTTGTAG CAAGTAAAAT ACATGCAAAC TCAATCTGTT G
|
Protein sequence | MSEPSSPASS LDFFESDASA DSDYDEAPRR TRKQPSKKYG ARHGTSTPTA SGSGSGTKIK INLSSLQRRA VEGNTAVEQE EEGDEDEEGY FDGLIGKRGV DLSGQTLKGD HSLRPLWVDD RGNIIVEAFA PFAKQAQDFL VAISEPVSRP ALIHEYRITK PSLHSAMSIG LETKVIIEVL SRLSKTPLSP RLVARIEEWT ASFGKVRLVL KDNRYFLETS VPEFLQKLMN DEVIKECMVH REEETGPTVF GAEEGARPRR DFAIPGTEEA RRRERGEDAE QTRENDAVLG AVIGISEADE MDDEDDKVHS FEVSGERMED VRRRCKDIDL PALEEYDFRN DTINPNLDIQ LKPMTVIRPY QEMSLAKMFG NGRARSGIIV LPCGAGKTLV GITAACTIKK SALVLCTSAV SVAQWKQQFL HFSNISERQI CAFTQGEKEM FSTSAGIVIS TYSMIAKTGK RAHDAEKMMQ FLRSREWGFL LLDEVHVTPA DMFRKCINNF KVHAKLGLTA TLVREDDRIG DLGYLIGPKL YEANWMDLAK NGHIATVQCA EVWCPMTPEF YREYLRNPSR KRILLHAMNP NKIQACQFLI NYHESRGDKV IVFSDNVFAL EAYAKKLGKS FIHGGTPEGE RLRILSRFQH DPQLNTIFLS KVGDTSIDLP EATCLIQISS HFGSRRQEAQ RLGRILRAKR RNDEGFNAFF YSLVSKDTQE MFYSSKRQGF LIDQGYAFKV ITELHGLHSM PNLVFASKDE QLSLLESVLN QGDAAAETAD HYMRLNGGKH LKRIAGAQPS TSGTTVQRFM APLEHLSGGQ NISYREQNKS VNKELSREVR QNKRAGGSSS GKDSHSIFKK RKTELAAAKK QRETGF
|
| |