Gene CNA05940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA05940 
Symbol 
ID3253925 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1590270 
End bp1593440 
Gene Length3171 bp 
Protein Length866 aa 
Translation table 
GC content47% 
IMG OID638252914 
Productgeneral RNA polymerase II transcription factor, putative 
Protein accessionXP_566947 
Protein GI58259069 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID[TIGR00603] DNA repair helicase rad25 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.687804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCTTTCTTT CCTCTCTTCT TTCTTTGTTC TTCATTTATC GCCTTTTCTC TTCTCCGAAA 
TGAGCGAACC ATCTTCTCCA GCTTCATCCC TCGATTTCTT TGAGTCGGAC GCTTCAGCAG
ACTCCGACTA CGATGAAGCA CCACGCCGCA CTCGGAAGCA GCCTTCCAAG AAATATGGAG
CCCGGCACGG TACATCCACT CCCACAGCTT CAGGCTCAGG CTCTGGTACA AAAATTAAGA
TAAACTTGTC CTCTCTCCAA CGACGGGCAG TGGAGGGCAA TACCGCTGTT GAGCAGGAAG
AAGAAGGAGA CGAGGATGAA GAAGGGTATT TTGATGGTTT AATTGGTAAA CGAGGAGTGG
ATCTTTCAGG CCAAACGCTG AAAGGTGATC ATTCTTTGAG ACCGCTTTGG GTAGATGACC
GCGGTAATAT GTGGGTACAT GAGTCCTTGG ATCCGACACA GCAAGTCATT GCTGACCATG
TCTGGACAGT ATTGTTGAGG CCTTTGCTCC CTTTGCAAAG CAAGCGCAAG ATTTCCTGGT
TGCCATTTCC GAGCCAGTAT CTCGGTGAGT CATCCGTATG ACTTAAGTTT ACCTGTTCAT
CGTGCTGACA AAGGATATAG GCCCGCCCTT ATACATGAAT ACCGCATAAC CAAGCCTTCT
TTACATTCTG CCATGTCAAT TGGTCTTGAG ACCAAGGTCA TCATTGAGGT CCTCTCTCGT
CTGAGCAAGA CACCCCTTTC ACCGCGACTT GTCGCGCGAA TAGAAGAATG GACCGCATCG
TTTGGTAAAG TCCGGCTTGT ATTGAAGGAC AACCGATACT TCCTTGAAAC GAGTGTCCCC
GAATTCTTGC AAAAACTGAT GAACGATGAA GTCATCAAGG AATGCATGGT GCATCGTGAA
GAGGAAACAG GTCCTACTGT ATTTGGAGCG GAGGAAGGTG CTCGTCCACG ACGAGACTTT
GCCATTCCTG GGACAGAAGA AGCTCGAAGA CGAGAGAGGG GTGAAGATGC CGAACAGACT
CGTGAGAATG ATGCTGTCTT GGGTGCAGTG ATTGGGATTA GCGAGGCGGA TGAGATGGAT
GATGAAGATG ACAAAGTTCA TTCGTTTGAG GTGTCTGGCG AGCGGATGGA GGATGTTCGA
AGACGGTGCA AGGATATCGA TCTTCCTGCA TTGGAAGAGT ACGATTTCAG AAATGACACG
ATCAATCCCA ATCTCGATAT ACAGTTGAAG CCCATGACAG TCATCAGGCC GTATCAGGAG
ATGAGTCTAG CCAAAATGTT TGGTAACGGT AGAGCCAGGT CAGGTATCAT TGTCTTACCT
TGTGGAGCGG GAAAAACGCT GGTGGGCATA ACTGCGGCAT GTACGATCAA GAAGAGCGCG
TTGGTGCTCT GTACTTCTGC GTTGGTTTGT TTTTCTTTTC TCGTTGCGCA AACCTCTGAC
GTTATACGGG TAGTGTATCG GTAGCCCAAT GGAAGCAACA ATTCCTTCAC TTCTCCAACA
TATCGGAACG ACAGATCTGT GCCTTCACCC AGGGCGAAAA AGAAATGTTC AGTACGTCGG
CGGGCATCGT CATCTCAACC TACTCCATGA TTGCCAAAAC TGGCAAGCGA GCGCATGATG
CGGAAAAGAT GATGCAGTTC CTTCGGTCCA GGGAATGGGG ATTTTTACTG TTGGATGAAG
TGCATGTGAC TCCGGCGGAT ATGTTCAGAA AATGTATCAA TAATTTCAAA GTGCATGCCA
AGTTGGGTCT CACTGGTGAG TCTTTGCGGT TCAGTGGATT GACATTGATA TGGAGACTAA
CGTGAAAAAA TAGCAACGCT GGTAAGGGAG GATGATAGGA TTGGGGATTT GGGATACTTG
ATTGGTCCAA AGTTGTACGA AGCCAATTGG ATGGATCTCG CTAAAAATGG CCATATTGCC
ACTGTCCAGG TATGTTTCGT TGCCCAATTT TTTTTTGGTC TATACACTAC TGAGCAGGAC
GTTTTTTTTT AGTGTGCCGA AGTTTGGTGC CCCATGACTC CAGAATTTTA TCGCGAATAT
TTACGGAATC CTTCTCGCAA ACGCATCCTT TTGCACGCCA TGAACCCGAA CAAGATTCAA
GCATGTCAGT TCTTGATCAA CTATCATGAG AGCCGAGGCG ACAAGGTGAT CGTATTTTCC
GACAATGTGT TTGCACTCGA GGTGAGTTGT TTTGGGCTCG AGAAAATCTT GGTTTGTACT
GACGGGACGA ATAGGCGTAC GCCAAAAAGT TGGGCAAGTC TTTTATTCAC GGCGGGACGC
CTGAAGGCGA ACGGTTGCGG ATTCTTTCGC GATTCCAACA CGACCCCCAG CTGAACACCA
TCTTCCTCTC CAAGGTCGGT GATACTTCTA TCGACTTGCC TGAAGCTACT TGCTTGATCC
AAATATCTTC CCATTTTGGT TCTCGACGAC AAGAAGCTCA GCGATTGGGT AGGATTCTGA
GGGCAAAGCG AAGAAATGAC GAGGGTTTCA ACGCCTTTTT TTATTCGCTT GTTTCCAAAG
ATACTCAGGA GATGTTCTAT TCCTCGAAGC GGCAAGGATT CTTGATTGAC CAAGGTTACG
CGTTCAAAGT GATCACCGAA CTTCACGGTC TTCATAGCAT GCCCAACCTC GTTTTCGCTT
CCAAGGACGA ACAGCTGTCA TTGCTAGAGT CGGTACTGAA CCAGGGTGAT GCCGCGGCAG
AGACGGCGGA CCATTATATG AGGTTGAATG GGGGTAAGCA TCTCAAGAGG ATTGCGGGCG
CTCAGCCGAG TACGAGTGGG ACGACGGTGC AGAGGTTCAT GGCACCGTTG GAGCATTTGA
GTGGAGGGCA GAATATCAGT TATAGAGAAC AGAACAAGAG TGTCAAGTGG GTTTTTTTTT
CTTTTTCTTT TTTTTCCCCA AAATCTATAT GATTACCGGA AAAGTGAGGA GCATATACTA
ATCAAGGATT TCATCTTTCC AGCAAGGAGT TATCGAGAGA AGTACGGCAG AATAAGAGGG
CTGGGGGATC GAGTAGTGGG AAAGATAGCC ATTCGATTTT CAAAAAGAGA AAAACAGAAT
TGGCAGCGGC CAAGAAGCAG CGTGAGACGG GATTCTAAGC AGTAGGCATA CTTATATTAA
AAAAAGCGGT ATCCTTGTAG CAAGTAAAAT ACATGCAAAC TCAATCTGTT G
 
Protein sequence
MSEPSSPASS LDFFESDASA DSDYDEAPRR TRKQPSKKYG ARHGTSTPTA SGSGSGTKIK 
INLSSLQRRA VEGNTAVEQE EEGDEDEEGY FDGLIGKRGV DLSGQTLKGD HSLRPLWVDD
RGNIIVEAFA PFAKQAQDFL VAISEPVSRP ALIHEYRITK PSLHSAMSIG LETKVIIEVL
SRLSKTPLSP RLVARIEEWT ASFGKVRLVL KDNRYFLETS VPEFLQKLMN DEVIKECMVH
REEETGPTVF GAEEGARPRR DFAIPGTEEA RRRERGEDAE QTRENDAVLG AVIGISEADE
MDDEDDKVHS FEVSGERMED VRRRCKDIDL PALEEYDFRN DTINPNLDIQ LKPMTVIRPY
QEMSLAKMFG NGRARSGIIV LPCGAGKTLV GITAACTIKK SALVLCTSAV SVAQWKQQFL
HFSNISERQI CAFTQGEKEM FSTSAGIVIS TYSMIAKTGK RAHDAEKMMQ FLRSREWGFL
LLDEVHVTPA DMFRKCINNF KVHAKLGLTA TLVREDDRIG DLGYLIGPKL YEANWMDLAK
NGHIATVQCA EVWCPMTPEF YREYLRNPSR KRILLHAMNP NKIQACQFLI NYHESRGDKV
IVFSDNVFAL EAYAKKLGKS FIHGGTPEGE RLRILSRFQH DPQLNTIFLS KVGDTSIDLP
EATCLIQISS HFGSRRQEAQ RLGRILRAKR RNDEGFNAFF YSLVSKDTQE MFYSSKRQGF
LIDQGYAFKV ITELHGLHSM PNLVFASKDE QLSLLESVLN QGDAAAETAD HYMRLNGGKH
LKRIAGAQPS TSGTTVQRFM APLEHLSGGQ NISYREQNKS VNKELSREVR QNKRAGGSSS
GKDSHSIFKK RKTELAAAKK QRETGF