Gene CNA01820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01820 
Symbol 
ID3253806 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp486751 
End bp491945 
Gene Length5195 bp 
Protein Length1290 aa 
Translation table 
GC content52% 
IMG OID638252515 
Producttranscriptional activator, putative 
Protein accessionXP_566547 
Protein GI58258269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.203708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCACTCCTCC CCCATCCACG TGTCCCACAA TCTGCTCCCG CTGCCACTAA CCCACCTACA 
CGCCGCCGTC CCGCCAACCG ACCTTCCGAC GTCGAGTCAA AACAATACTC GAGGCACTCC
ATCGGTCAAC AATTTTCAGC CACCCGCAAC CAGCCCGCAA CTGCAGAGAC TTCAGTCCGT
TACCGCGCCC ATAGACTGCA GTAGAGGAGA CCTTGCAAGC TCCCCTCCTT CCTATCAGGT
ATGTGCTATT CCCAGCAGAT CCATTTTACA TTTTATACGC GGAAACACGA CAGGAGCGGG
AGGATATTCG TCAGCTCATC GTGAAGGCGG CGCTGGACGG GTGGGCTTCT CGTCAATTCC
ACAAGAAAGG CCAATAAAGG CTGTTGCCGA TTGCCAATGA AAAGTTTGCC TTCTATTGAA
ACGGTGTCTG ACCTCCCTCT TTTGGCTTTC GTGCCGAGTC TTCTTGCTTT CTGTTCGCAT
CTCTTGTCAT CCAAGCTTTG TTGTGGCAGG CATCATTCGG GGCCTCCAGC CTTTCTTCTC
GTCCTTTGAT CTGTGGTGAA TGCAGGCTTA CGATGGACCG CGCGGTCTTC CCGCTACTTC
CAATCACCAC AGCCTCTAAC AACACAAAAC CGACACCATT TCCACCCCGT CTCCACCCGT
TATAGGTCCC TCTCCTAGGT CTACCACATC TTCAAAATCA TCTCCTGTAT CCACTACTTC
AACTAGCACT GGTGGGACTT CTTCTAGGCT TCGTCCACTT CTACCGAAAC CAACAGCGCC
TCGCGAGCCG TCTACTCGAC CCGTTAGCAC TGCAACCAAA CAGCAGCTAG CGGCGGCTAG
AATGGACAAG CTTCCATGGC GCACTACATC GGGGGGGAAA AAGGGACCAT TCACTGTGGA
TGCTGGAGGA GGGCAAGGTG TCTACAATGG GAAAGTAAAT AGTGCTCGGG TCCCGGAACG
ACCCCCTTCT GCCGCGTCTT CCCGCCTCTC CAATCCTCCA ACACCTGCCA TGTCGCCTTC
CATTCCAATC AATCGACACA ACTTTCCTTA CCCTCCACAG GGATACCAAT CATTCTCTCA
ACCATCAAGT ATAGGACACA GCGGCAGTGG TTCTTTCCAT GGTAATGACC ATTCGTCGAT
TGCCACCACC CCTACGCCAA TGACTCCTAT TGATATTGTT GGAGTGAGTT CTAGCGTCCC
ACGAAACAGC TTTTCCCAGG GTGTTCAGCC CTCTTCACAC TCTCAACTAT CCTCTTCTAG
ATCAAAGCGT TCTGTTCCCA TGATATCCAC TACTCCCGTA GAACCTCCTC CTTCCTCTTG
GGGTGCATCT TTTCCTGATC AACAGGGTGA AAATGATGCG AATAGCAAGC CATTCGACTG
GACGTCTCTT CCGAACCCCG CGAATTGGTT TTCTCCGTTG GGCACGGTTC AAAAGACCAC
CTGTAATGAT GATCCCATCG ATCCAGCTGT ATTTGCCAGC CTTGCTGATC TGGTCGAGCG
AAATCAGGGT AAAATGAACG ACAGTGCTTC CATCGATTTT ATGGGCGCTT TGAACGCAAA
TACGTCTCCG AAAGCTGCTA GGAGCACTGC CCATTCAACT TCACATCCTG GAGGTACAAG
CCTTTTGAGT CGTCGCTTGC AGCACCAACA GCAGACTAGC TCTGATGGTC AGAACATACA
TTCAAGCAGT CAATCTTCGG CTAACTCTCC TGGCATCTTG CACAATTCCG GGGGGCTATC
TGGATTCCCT GGTCCTCAGC AGCAGTACAA TCATGTCACC TTTGCACAGC CTGGTAAGGT
GAGTAAAGGT GCATCTAACC AACCCTTGAC GCCTTGGCCT TTATCCGAGA GGGCAATGGG
TTCGAGCGAG ACCCCCGTGA CAACACCCGG CGGTAGTGAT TTCGGAGTCA ACAGCCCCTT
TGAGATGGGT ATTACCGGCT CGTGGCAGCA GGAATCGATG GCCAAACATG AGTTACAGGG
CTCAAACAGT AGTTCACGAT ATCCGCCCAT TGCTCCACGT CGGCGTGAAG CACCTCAGCA
TCCTGCTCCT GTGTTCCAGA CACACCGGGG AAGTGTACCA TCCTCCGAGC ATGCCTCGAG
AGCCGGCTCT GAAGCTCCTC ATGACGGGCC CATGCCCGGC GGCGTTTCAC TCAATGGTTT
ACCTCCTTTG CCGAACGGAT TGTCTCTTGA GCATCTTGCC CAGTATGGAG CTGCAGGTTT
AGAAATGGCT TTGAGAGTGG GCATGGGTAT TGGTATGGGC TTGAGTCAGC AAACGCAGCA
AACGAAGCAA ACTGTTGATG TTGCATCCCC GCCTTGGCCA CTTACCAAAA GTGTTCCTAC
GCCATCCTTT TCTCAAAATC CGTCGTCCCC TGAAGCATCG TCTCGTAAGG GACGAAAAGA
CTCAAACATC GTCAGCGACA TCCTTCAAGA CGACTTCCTC ACTGCTCGCG TTCCTAGCAC
TCCTCTTATG ACCCCACCAC TCAATGGTTT TGGGTCTTTC CCAGTTACTC GTCGACCATC
TCAGAGCGAT GCCACAAGCC CGTTACCCGA GGTTGGTCCT CCGGAGCAGG TGGCTGAGAA
GGATCCTCTT GCTGCGCAAG TCTGGAAAGC ATACGCAAGA GCCAGAGATA CCTTGCCCAA
CGGGCAGAGG ATGGAGAATC TGACCTGGAG AATGATGCAT CTTACACTGA AGAAGAAGGA
AGAGGAGCAA GCGGCAAAGG AGAAGGAGGA AAGAGAGAAA GAAGATAAGG AGGCTGCTGA
GAAGGAAGCA GCAGAAGCAG CAGCAGCAGC AACAGCAGCG GCAGCAACAG CAGCAGCAGC
CGCTGCAGCC GCAGAATTGC CTCCTGTGGA GGAAAGACGA GGAAGAACAA AGGGGAAGTC
AAGAATTGTT GGTTTTGCTG GAGCAACAAG CTCAAATTCC CAATCGCCAA AGTGAGTTAC
CGTGTCAACA TCATGATGTC ATTTACTGAT TATTTTTCCT AGCGGCATGG ATATCGATTG
GCGAGCAGCA AGCCGATCCC GTTCTCGCAT ACCCATGGAT ATTGATTGGC GTGCTTCTTC
CAGGTCACGC TCCCGTTCTG CGGCTCCATT CCGCAACCCC TTCAGTGAAG CTCACGCTCA
TCACCTTCTT GCTGCAGGCG GAACACCTAT CGCAGAGATG GGCCAATACA TGGCCGGACA
TGGTAGTCTG AACATTCACG CCGCTAACGC CCACTCGACA AGCCATCATG GCAACCAGAA
TCAATATCAC CATGCTTCAT CTCTCCCTGG TCCTTCTAGC ATGATGCTGC AAAGCTTGAG
TCAGTTACCC GAGAAAGAAG GAGAGCAGGA TGAAGTGCAA CAAGCTGTCG AGCACATGAA
TGGGGCTGAC TTGTACCCCG CCTCTGCACC TCAGAACAGG AGCGCGCTGG AGCATCTTCA
GATGTCTCTC GCGAGTGGGC TTAATCCTTC CGAAGAAGCT ACGTCTAACT TACCTGGAAT
CAATGGTCCA GGTCTCTATA CACATAGCCA AGAAAACTTC CATCCTCATT ATGGGTTTCT
TCCTCGTCGT GTGCGAAAGA CATCTTTCGA CCACACTGTG AAATTGTTGG AAGAGGGTGA
ACCATCCAGC TCTCCTCAAT TCATGTCCAA TCCTCGCAAG CGCCATGCCG AAGCCTCTCC
TCAAGGGGGC GCGAATAGTC CTCTTCCAGA AGGTGACAGC GGCTTCCCCA CATCAAACTT
CACCTTCAGC TTCCCTCAAT CCTACGAGAA TTTTTTTGAC CTAGCTGCCG CAAGTGCAAC
TCCTTCAGGG ACCCAAGAAA ACAATGATGT GAACCATGAA GGTGATGACG ACCTTGCAGA
CTTGACAGAC TGGGCAAGTC ATCCTGTTAC AGCGGATACG TCAGCGTTTG GTTCTCCTTC
GGCGTTTGGT CACATTGAAC CCGGAATGTC CCTTCCTTCG ATGCCTCAAG CTACAGGCGA
CAATCCTTTT GATTTTCAGC AGCTCATGCA CCTCTACCTC AATGCAAACT CATCTGCGAG
CCCGTTTACC CACATCAATC CCTCTCAAGT TCTTGGGGCC GTGCCCGGTC AGGCCGCCAA
CGAATTCTCT CCCAGTGCCG TCTCTCCCCA AAGCGGTGCT CCTACGCCCA GTAACAACAG
CTCAGGTAAC AACATCCGGC CATTACCAAA AGCTGTAGGC GGTAAGGCTG TAGACAACAG
ACAGATGCCG CCACCGAACA GATCTAGCAG TACTCCTAAT CTCGCTGCTC TGAGGATGTC
TTCTCAAAGC GAATCATCAA AGCACGGTCG TACTGCCTCA ACTAATGCAT CAGGAAACGC
TGGGTCTGGT TCCAGTAAAG GGAACAAGGG ATCAAGTGGA AATAATAAGT CCCGACCAGG
TACGCCGACG AGTGAAAATG AAGGCGGGCC AGGCTCTATC ATGCCTAGTG GTGAGAATCC
CACAATGTGT ACAAATTGTC AGACGACAAA CACTCCTTTA TGGAGACGAG ATCCAGATGG
GCAGCCGTTG TGCAATGCGT GTGGACTTTT CTACGTGAGT ACTGTCTTTC AGCTATATAA
TTAGAGACGT TCTAATGACC TATCAGAAAC TGCACGGTGT CGTTCGACCC TTGTCGCTCA
AGACGGATGT TATTAAGAAA AGGTGCGTTT GCAGTCCGGC GAGAATTGCA ATATCTAATC
AGAGTGTGTA GAAACCGAGC GGGTCCTGGA CCGAAAGAAA GCAACCCTTC CCGCAAGAAC
AGTGTCGCTT CGTCTAAAAA TGTGTCTGTG CGGTCAAAGC CCAGCTCGCC AACAGTTGCG
TCGTCTGCAA ATTCCGGTGG CGGAAGCAAG AAGGCAAGGC ATGCTTCTGA TGCTCCTGTT
GAATGAGCGA ATGAATAGGA GCAGTGGGAG CGGAGATTCG GGGAGAGGCG GTTTGTCGCT
CGCGTAGCGG GCTCTGATTA CGCCTGACGT TCCACTTTTA TGTTTTTCAC TTCCATATTA
TTCGGTTACC ATTTATCCTA CCGTTTTCAA ACTTATACCG TTGAGCTACT CTCGGGCCCG
TGCATCCAAC GGTACCATGT ATGAATTCTT AAAACCTCTG GGTCTCTAAA AAGAGGCTCG
GTTCTTAAAT GTTAGATACG ATTACAACTG TTTATTGGTT TCTTTCGGTT GAGTTTTGGG
GAGATTATTT AGGACGCTCG TTGATCATCA TCGAA
 
Protein sequence
MDKLPWRTTS GGKKGPFTVD AGGGQGVYNG KVNSARVPER PPSAASSRLS NPPTPAMSPS 
IPINRHNFPY PPQGYQSFSQ PSSIGHSGSG SFHGNDHSSI ATTPTPMTPI DIVGVSSSVP
RNSFSQGVQP SSHSQLSSSR SKRSVPMIST TPVEPPPSSW GASFPDQQGE NDANSKPFDW
TSLPNPANWF SPLGTVQKTT CNDDPIDPAV FASLADLVER NQGKMNDSAS IDFMGALNAN
TSPKAARSTA HSTSHPGGTS LLSRRLQHQQ QTSSDGQNIH SSSQSSANSP GILHNSGGLS
GFPGPQQQYN HVTFAQPGKV SKGASNQPLT PWPLSERAMG SSETPVTTPG GSDFGVNSPF
EMGITGSWQQ ESMAKHELQG SNSSSRYPPI APRRREAPQH PAPVFQTHRG SVPSSEHASR
AGSEAPHDGP MPGGVSLNGL PPLPNGLSLE HLAQYGAAGL EMALRVGMGI GMGLSQQTQQ
TKQTVDVASP PWPLTKSVPT PSFSQNPSSP EASSRKGRKD SNIVSDILQD DFLTARVPST
PLMTPPLNGF GSFPVTRRPS QSDATSPLPE VGPPEQVAEK DPLAAQVWKA YARARDTLPN
GQRMENLTWR MMHLTLKKKE EEQAAKEKEE REKEDKEAAE KEAAEAAAAA TAAAATAAAA
AAAAELPPVE ERRGRTKGKS RIVGFAGATS SNSQSPNGMD IDWRAASRSR SRIPMDIDWR
ASSRSRSRSA APFRNPFSEA HAHHLLAAGG TPIAEMGQYM AGHGSLNIHA ANAHSTSHHG
NQNQYHHASS LPGPSSMMLQ SLSQLPEKEG EQDEVQQAVE HMNGADLYPA SAPQNRSALE
HLQMSLASGL NPSEEATSNL PGINGPGLYT HSQENFHPHY GFLPRRVRKT SFDHTVKLLE
EGEPSSSPQF MSNPRKRHAE ASPQGGANSP LPEGDSGFPT SNFTFSFPQS YENFFDLAAA
SATPSGTQEN NDVNHEGDDD LADLTDWASH PVTADTSAFG SPSAFGHIEP GMSLPSMPQA
TGDNPFDFQQ LMHLYLNANS SASPFTHINP SQVLGAVPGQ AANEFSPSAV SPQSGAPTPS
NNSSGNNIRP LPKAVGGKAV DNRQMPPPNR SSSTPNLAAL RMSSQSESSK HGRTASTNAS
GNAGSGSSKG NKGSSGNNKS RPGTPTSENE GGPGSIMPSG ENPTMCTNCQ TTNTPLWRRD
PDGQPLCNAC GLFYKLHGVV RPLSLKTDVI KKRNRAGPGP KESNPSRKNS VASSKNVSVR
SKPSSPTVAS SANSGGGSKK ARHASDAPVE