Gene CNH00870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00870 
Symbol 
ID3259048 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp940024 
End bp943554 
Gene Length3531 bp 
Protein Length650 aa 
Translation table 
GC content52% 
IMG OID638258395 
ProductRNA polymerase II transcription factor, putative 
Protein accessionXP_572280 
Protein GI58270248 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCCCCCGTT TCACCCACAC GTGGCCTTCC TAGCGCCCTA CTCCCGTGAA CAAATAAAAC 
TTCCACAAGG TGAGCCACGC ATCCTGCCTC TCTGTAGTCC GCCATGCTGC TCGCAGTGGT
CTACCTAAAA AGATACCACT TTTCGTCATC TTTGACGCGT CGCGAGGCAT CTTAATGACC
TTGTTCGCCA CCCTTCTTCT CAGTGTGGAC TTGAGCGAAA ATACTTGCCC TTGACTTGTC
GAAAGTAATC CATTCGGGCA ATTCATTCAT GGGCCTTGCT TTGTATGTCG CCCCTGAGGT
ATTCCCTCGT CTGCCATCTT ACTTTTATCT CCAGGGCAAT CAGATAGCCC TGGGACCTGC
AAATATGACA AACGTGTGGG ATGCTGATTC TGCTTGTTTT TGCAGGCCGT CCACGACAGC
CACTACTCGG GGAAGAGTCA AGTATATTGG AATTTGTCGG CCCAACCAAG TGAGTACATA
CGTGTCTATG ATCGTGCATC GGACTTCTTT GCGACTCGGG TGACGTTCTC CGTCGGGGAT
GGAAGAGGGT CTCTGCTTTA CATATGTTCG CGAAGACACC TTGCTGTGCT CGAGAAGGAT
CTCGGTATTT CCGCGCCCAG TTTGCACTTC TTTGCCCCAT CCTCTTCCCT TCTTCCATCT
GGCGCTCGGG TTATAGGTGT CTTACGAACA CTTATCTTTC GCCGGAGTTA CGAAGACAGC
TGTGGTGAGC ATTGCAAAGG TACCATGAAA GGATGGCTGG CGTGAGCCCG AGCATCTGGC
ACCGCAGCCG GCGCAGAATA TTCTACCATG GGGTCGATGT CGGGAGTTGT GGGTGGATTA
GTAAATATGA TCTCGGCATT TTCTTTCCGC TTCATCTGCC AGACCACAAT GTAACGGTTG
CTGACGCCAT TCAACAGGCT TCGTCAGCGC GACTCACAGT ACAGGTCACG AAGCGTAGAT
CAGGTCCGTC GTTATCGCTT TGCGGATAGA TGAAAGACGA CGAAACATCC TTCTTTGTCT
TCCTTTGTCC TTCAGTGATT TCCCACACCA CACCGTAGTT CCTGGAACGC CCGCGCCACC
TCCTTACATT CACATCTCAC GTACAACATG AGCTTCGCAG CTCCCGACGA CCGAGCATAT
TATAACTACT CCCGTGCGCC AGGCTCCGCT CACGGTCCCG GTCACGAATC TACTTCGCCT
GATCAGAGAC TTCCTTCATC TCATGGTTAC AACAAGGGTG CATCCGCACC TAACGAATCA
CCTAATCAAG TATACCATCC CGAAATGGCA GGTCCTCCTG GTTCTTCCCA TAGTTATCCT
CCTCAACCTA TGACGCCACT CACTGGCTCT AGCGCTTATC CACCCCAACA TCCTCCTTCT
GGTAGTTACT ATCTGCCCCC GCAGCAACAG CAACAACAAC AATCGGAACA ACCTATTCCT
TCCCCTCCTA GCAGCTCTAA CCGACCTCCT TCAGCTACTG GTTATACCCC CGATGGGCAG
CCTATCATCC CCGTAGGTGT GTCTGGTGGC AAAATGTTTA GGTGCAGAGG TTACGGTGAT
TGTGACAAGG TCTTTACAAG AAGTGAGCAT TTGGCAAGGC ATGTGAGGTG AGTGGCTTTT
AGTGTTTGCC GATAACGATT GTACCTTGAC TGATAAGTGG TTTCGCGTCC TTTGTTACCC
CTGGTACCTT AATTACAAAT CCAGAAAACA TACCGGTGAA CGTCCTTTCC CTTGTCACTG
TGGCAAAGCA TTCTCCAGGC TCGACAACCT ACGTCAGCAC GCTGCTACCG TCCACGCTGA
ACAGGCTCAG CTGAATGAAA CCATGCTTGC TTCTCTCGCG CCTATCCACG CTGCCCTTTC
TCAGCGTGCT AGCAGAGAAC AACGACGAAG AGGTGAAGTT GTCGAGGTGC CTAAGGGTGC
AGTCGAACGA CGTCGCGAGA CCCGCAAAGC CCAAGCCGCT CAAGCCGCCG CCGCTGCCCA
AGCTGCTGCT ACAAATGGCC ACAGCCAGCA AAACTCTCCT TACGCTCAGT ACCACGAATC
TCAGTGGAAT GCCCCCCCTC ATCCTCGACC CAGGACTAAT GGCGGGTATG ATTACCCTTA
TGTGCCTGAG CATAGTCTTA ATGACGATGC TGGACCATCA AGGAGACCCA GTTCGTCTGC
GGGCTATGGC TACCAACAGG GCTACTATGA CTCTGCTCGA CCTCCTACTG CCCCTGGGTC
TGGTTCTTCT GGCGAGTCGA TGTCTGGGTT GCCTTACCCC TACCGACCCA TGTCTGCTTC
AGGCCGTGAA CTCCCCGTTC CTGCGCATTA TTCCGAGTCT GAGCCTCCTG CTCCTGCCCA
CGGCCCTCCT CCCCAGTCAC CAATGTACGG CAACGTCCCT CCTGCTCAAC AACAGCCTCC
CAACTGGTCT TCTCCTCCGG GACACGGTAC CTACCCTCCT CACGACACCG CTGCATACCC
TCCTCCTCCC GAAGGCTATT ACCCTCCTGC ACATGCTGCT CACGGCTCAT ATCCTCCTAG
GGAAGACGTG TACGAGTACC ATCCTCCAGG TTGGCAGGGC CAGTACCCTC CTGCGGCGGC
TAATGGCGGA CCCTACGCTC AGGGATACGG CACCGGTGCA CCCCCAACTG CTCCTCCACC
TCACGAGTCT CCCTTCCAAT ACAACGTTCC CAATCCAGGC GCTGAGGGTT ACCCATATCA
AAACTACGAC TCCCGCAAGC GTCGCGCGGA AGATGACTTT GGCAAGGATG ACCGTAAACA
CCCCCGGCCA TCATCATCTT CTAACTCTCA ACTTCCCGAT AGTTCCACCC CTGCTCACGC
GAACGGCGCG CACGGTGCTC CTCATCCTCA CCCTCATCCC GGTGCGGCAC CAGGTAATGG
AGCCATGGAC CCTCCACGCC CGCATGATCC CAACTGGTTG CCTGCCACTA CGGAGAGAAG
GGGTAGTTTG GCGATCTCCG CGTTGTTAGG TAGTCCTCCC AAGGCGGTGA GGAGTAGACC
CTCGACAGCT GATGGCGGTA CGGCGGGTGC GCACGGGTAC GAAAGTTATC ATTATGACCC
CCATGGTCAG GTATCCGATG ATCAACAGAC AGATGGGGAC AAGGAGAGGG AAGAGAAGGG
GGAAGTGAAG AAACATGGTG ATAAGAAGAA GCTGTAAGGC TGAAAGGATG TGAGATTCAG
GCAGTGTTGT GGATGTTTTT CTTTTTTTTT CCATATATTT CTCTTCGAGT GATACCCACT
TGCGGCTTTT CGAAATGAGT GATTGATGAC CCCTAATTTT ACGAAGGAAA GGATTTATTG
GCTAATTGTG CGAAGTAATT TAGTTATCTA GCTATGCTAT ACGATTTACG GGCTTGGAGG
CAGATTAGAC TAAGAGCTCT CTCTCTTTTA TTTTACCTTC CTAACCCCTT CATTAATCTA
TTTTTCGTAC TTCAATTTAT AGCTTTCTGG TGTTTAGATA CACGATGTGA TATGAATTCT
CTGATTTTTA TTACTGTTTT TGATACGATT GATGCATTTT GCCTCTTATT T
 
Protein sequence
MSFAAPDDRA YYNYSRAPGS AHGPGHESTS PDQRLPSSHG YNKGASAPNE SPNQVYHPEM 
AGPPGSSHSY PPQPMTPLTG SSAYPPQHPP SGSYYLPPQQ QQQQQSEQPI PSPPSSSNRP
PSATGYTPDG QPIIPVGVSG GKMFRCRGYG DCDKVFTRSE HLARHVRKHT GERPFPCHCG
KAFSRLDNLR QHAATVHAEQ AQLNETMLAS LAPIHAALSQ RASREQRRRG EVVEVPKGAV
ERRRETRKAQ AAQAAAAAQA AATNGHSQQN SPYAQYHESQ WNAPPHPRPR TNGGYDYPYV
PEHSLNDDAG PSRRPSSSAG YGYQQGYYDS ARPPTAPGSG SSGESMSGLP YPYRPMSASG
RELPVPAHYS ESEPPAPAHG PPPQSPMYGN VPPAQQQPPN WSSPPGHGTY PPHDTAAYPP
PPEGYYPPAH AAHGSYPPRE DVYEYHPPGW QGQYPPAAAN GGPYAQGYGT GAPPTAPPPH
ESPFQYNVPN PGAEGYPYQN YDSRKRRAED DFGKDDRKHP RPSSSSNSQL PDSSTPAHAN
GAHGAPHPHP HPGAAPGNGA MDPPRPHDPN WLPATTERRG SLAISALLGS PPKAVRSRPS
TADGGTAGAH GYESYHYDPH GQVSDDQQTD GDKEREEKGE VKKHGDKKKL