Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA01820 |
Symbol | |
ID | 3253806 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 486751 |
End bp | 491945 |
Gene Length | 5195 bp |
Protein Length | 1290 aa |
Translation table | |
GC content | 52% |
IMG OID | 638252515 |
Product | transcriptional activator, putative |
Protein accession | XP_566547 |
Protein GI | 58258269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.203708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCACTCCTCC CCCATCCACG TGTCCCACAA TCTGCTCCCG CTGCCACTAA CCCACCTACA CGCCGCCGTC CCGCCAACCG ACCTTCCGAC GTCGAGTCAA AACAATACTC GAGGCACTCC ATCGGTCAAC AATTTTCAGC CACCCGCAAC CAGCCCGCAA CTGCAGAGAC TTCAGTCCGT TACCGCGCCC ATAGACTGCA GTAGAGGAGA CCTTGCAAGC TCCCCTCCTT CCTATCAGGT ATGTGCTATT CCCAGCAGAT CCATTTTACA TTTTATACGC GGAAACACGA CAGGAGCGGG AGGATATTCG TCAGCTCATC GTGAAGGCGG CGCTGGACGG GTGGGCTTCT CGTCAATTCC ACAAGAAAGG CCAATAAAGG CTGTTGCCGA TTGCCAATGA AAAGTTTGCC TTCTATTGAA ACGGTGTCTG ACCTCCCTCT TTTGGCTTTC GTGCCGAGTC TTCTTGCTTT CTGTTCGCAT CTCTTGTCAT CCAAGCTTTG TTGTGGCAGG CATCATTCGG GGCCTCCAGC CTTTCTTCTC GTCCTTTGAT CTGTGGTGAA TGCAGGCTTA CGATGGACCG CGCGGTCTTC CCGCTACTTC CAATCACCAC AGCCTCTAAC AACACAAAAC CGACACCATT TCCACCCCGT CTCCACCCGT TATAGGTCCC TCTCCTAGGT CTACCACATC TTCAAAATCA TCTCCTGTAT CCACTACTTC AACTAGCACT GGTGGGACTT CTTCTAGGCT TCGTCCACTT CTACCGAAAC CAACAGCGCC TCGCGAGCCG TCTACTCGAC CCGTTAGCAC TGCAACCAAA CAGCAGCTAG CGGCGGCTAG AATGGACAAG CTTCCATGGC GCACTACATC GGGGGGGAAA AAGGGACCAT TCACTGTGGA TGCTGGAGGA GGGCAAGGTG TCTACAATGG GAAAGTAAAT AGTGCTCGGG TCCCGGAACG ACCCCCTTCT GCCGCGTCTT CCCGCCTCTC CAATCCTCCA ACACCTGCCA TGTCGCCTTC CATTCCAATC AATCGACACA ACTTTCCTTA CCCTCCACAG GGATACCAAT CATTCTCTCA ACCATCAAGT ATAGGACACA GCGGCAGTGG TTCTTTCCAT GGTAATGACC ATTCGTCGAT TGCCACCACC CCTACGCCAA TGACTCCTAT TGATATTGTT GGAGTGAGTT CTAGCGTCCC ACGAAACAGC TTTTCCCAGG GTGTTCAGCC CTCTTCACAC TCTCAACTAT CCTCTTCTAG ATCAAAGCGT TCTGTTCCCA TGATATCCAC TACTCCCGTA GAACCTCCTC CTTCCTCTTG GGGTGCATCT TTTCCTGATC AACAGGGTGA AAATGATGCG AATAGCAAGC CATTCGACTG GACGTCTCTT CCGAACCCCG CGAATTGGTT TTCTCCGTTG GGCACGGTTC AAAAGACCAC CTGTAATGAT GATCCCATCG ATCCAGCTGT ATTTGCCAGC CTTGCTGATC TGGTCGAGCG AAATCAGGGT AAAATGAACG ACAGTGCTTC CATCGATTTT ATGGGCGCTT TGAACGCAAA TACGTCTCCG AAAGCTGCTA GGAGCACTGC CCATTCAACT TCACATCCTG GAGGTACAAG CCTTTTGAGT CGTCGCTTGC AGCACCAACA GCAGACTAGC TCTGATGGTC AGAACATACA TTCAAGCAGT CAATCTTCGG CTAACTCTCC TGGCATCTTG CACAATTCCG GGGGGCTATC TGGATTCCCT GGTCCTCAGC AGCAGTACAA TCATGTCACC TTTGCACAGC CTGGTAAGGT GAGTAAAGGT GCATCTAACC AACCCTTGAC GCCTTGGCCT TTATCCGAGA GGGCAATGGG TTCGAGCGAG ACCCCCGTGA CAACACCCGG CGGTAGTGAT TTCGGAGTCA ACAGCCCCTT TGAGATGGGT ATTACCGGCT CGTGGCAGCA GGAATCGATG GCCAAACATG AGTTACAGGG CTCAAACAGT AGTTCACGAT ATCCGCCCAT TGCTCCACGT CGGCGTGAAG CACCTCAGCA TCCTGCTCCT GTGTTCCAGA CACACCGGGG AAGTGTACCA TCCTCCGAGC ATGCCTCGAG AGCCGGCTCT GAAGCTCCTC ATGACGGGCC CATGCCCGGC GGCGTTTCAC TCAATGGTTT ACCTCCTTTG CCGAACGGAT TGTCTCTTGA GCATCTTGCC CAGTATGGAG CTGCAGGTTT AGAAATGGCT TTGAGAGTGG GCATGGGTAT TGGTATGGGC TTGAGTCAGC AAACGCAGCA AACGAAGCAA ACTGTTGATG TTGCATCCCC GCCTTGGCCA CTTACCAAAA GTGTTCCTAC GCCATCCTTT TCTCAAAATC CGTCGTCCCC TGAAGCATCG TCTCGTAAGG GACGAAAAGA CTCAAACATC GTCAGCGACA TCCTTCAAGA CGACTTCCTC ACTGCTCGCG TTCCTAGCAC TCCTCTTATG ACCCCACCAC TCAATGGTTT TGGGTCTTTC CCAGTTACTC GTCGACCATC TCAGAGCGAT GCCACAAGCC CGTTACCCGA GGTTGGTCCT CCGGAGCAGG TGGCTGAGAA GGATCCTCTT GCTGCGCAAG TCTGGAAAGC ATACGCAAGA GCCAGAGATA CCTTGCCCAA CGGGCAGAGG ATGGAGAATC TGACCTGGAG AATGATGCAT CTTACACTGA AGAAGAAGGA AGAGGAGCAA GCGGCAAAGG AGAAGGAGGA AAGAGAGAAA GAAGATAAGG AGGCTGCTGA GAAGGAAGCA GCAGAAGCAG CAGCAGCAGC AACAGCAGCG GCAGCAACAG CAGCAGCAGC CGCTGCAGCC GCAGAATTGC CTCCTGTGGA GGAAAGACGA GGAAGAACAA AGGGGAAGTC AAGAATTGTT GGTTTTGCTG GAGCAACAAG CTCAAATTCC CAATCGCCAA AGTGAGTTAC CGTGTCAACA TCATGATGTC ATTTACTGAT TATTTTTCCT AGCGGCATGG ATATCGATTG GCGAGCAGCA AGCCGATCCC GTTCTCGCAT ACCCATGGAT ATTGATTGGC GTGCTTCTTC CAGGTCACGC TCCCGTTCTG CGGCTCCATT CCGCAACCCC TTCAGTGAAG CTCACGCTCA TCACCTTCTT GCTGCAGGCG GAACACCTAT CGCAGAGATG GGCCAATACA TGGCCGGACA TGGTAGTCTG AACATTCACG CCGCTAACGC CCACTCGACA AGCCATCATG GCAACCAGAA TCAATATCAC CATGCTTCAT CTCTCCCTGG TCCTTCTAGC ATGATGCTGC AAAGCTTGAG TCAGTTACCC GAGAAAGAAG GAGAGCAGGA TGAAGTGCAA CAAGCTGTCG AGCACATGAA TGGGGCTGAC TTGTACCCCG CCTCTGCACC TCAGAACAGG AGCGCGCTGG AGCATCTTCA GATGTCTCTC GCGAGTGGGC TTAATCCTTC CGAAGAAGCT ACGTCTAACT TACCTGGAAT CAATGGTCCA GGTCTCTATA CACATAGCCA AGAAAACTTC CATCCTCATT ATGGGTTTCT TCCTCGTCGT GTGCGAAAGA CATCTTTCGA CCACACTGTG AAATTGTTGG AAGAGGGTGA ACCATCCAGC TCTCCTCAAT TCATGTCCAA TCCTCGCAAG CGCCATGCCG AAGCCTCTCC TCAAGGGGGC GCGAATAGTC CTCTTCCAGA AGGTGACAGC GGCTTCCCCA CATCAAACTT CACCTTCAGC TTCCCTCAAT CCTACGAGAA TTTTTTTGAC CTAGCTGCCG CAAGTGCAAC TCCTTCAGGG ACCCAAGAAA ACAATGATGT GAACCATGAA GGTGATGACG ACCTTGCAGA CTTGACAGAC TGGGCAAGTC ATCCTGTTAC AGCGGATACG TCAGCGTTTG GTTCTCCTTC GGCGTTTGGT CACATTGAAC CCGGAATGTC CCTTCCTTCG ATGCCTCAAG CTACAGGCGA CAATCCTTTT GATTTTCAGC AGCTCATGCA CCTCTACCTC AATGCAAACT CATCTGCGAG CCCGTTTACC CACATCAATC CCTCTCAAGT TCTTGGGGCC GTGCCCGGTC AGGCCGCCAA CGAATTCTCT CCCAGTGCCG TCTCTCCCCA AAGCGGTGCT CCTACGCCCA GTAACAACAG CTCAGGTAAC AACATCCGGC CATTACCAAA AGCTGTAGGC GGTAAGGCTG TAGACAACAG ACAGATGCCG CCACCGAACA GATCTAGCAG TACTCCTAAT CTCGCTGCTC TGAGGATGTC TTCTCAAAGC GAATCATCAA AGCACGGTCG TACTGCCTCA ACTAATGCAT CAGGAAACGC TGGGTCTGGT TCCAGTAAAG GGAACAAGGG ATCAAGTGGA AATAATAAGT CCCGACCAGG TACGCCGACG AGTGAAAATG AAGGCGGGCC AGGCTCTATC ATGCCTAGTG GTGAGAATCC CACAATGTGT ACAAATTGTC AGACGACAAA CACTCCTTTA TGGAGACGAG ATCCAGATGG GCAGCCGTTG TGCAATGCGT GTGGACTTTT CTACGTGAGT ACTGTCTTTC AGCTATATAA TTAGAGACGT TCTAATGACC TATCAGAAAC TGCACGGTGT CGTTCGACCC TTGTCGCTCA AGACGGATGT TATTAAGAAA AGGTGCGTTT GCAGTCCGGC GAGAATTGCA ATATCTAATC AGAGTGTGTA GAAACCGAGC GGGTCCTGGA CCGAAAGAAA GCAACCCTTC CCGCAAGAAC AGTGTCGCTT CGTCTAAAAA TGTGTCTGTG CGGTCAAAGC CCAGCTCGCC AACAGTTGCG TCGTCTGCAA ATTCCGGTGG CGGAAGCAAG AAGGCAAGGC ATGCTTCTGA TGCTCCTGTT GAATGAGCGA ATGAATAGGA GCAGTGGGAG CGGAGATTCG GGGAGAGGCG GTTTGTCGCT CGCGTAGCGG GCTCTGATTA CGCCTGACGT TCCACTTTTA TGTTTTTCAC TTCCATATTA TTCGGTTACC ATTTATCCTA CCGTTTTCAA ACTTATACCG TTGAGCTACT CTCGGGCCCG TGCATCCAAC GGTACCATGT ATGAATTCTT AAAACCTCTG GGTCTCTAAA AAGAGGCTCG GTTCTTAAAT GTTAGATACG ATTACAACTG TTTATTGGTT TCTTTCGGTT GAGTTTTGGG GAGATTATTT AGGACGCTCG TTGATCATCA TCGAA
|
Protein sequence | MDKLPWRTTS GGKKGPFTVD AGGGQGVYNG KVNSARVPER PPSAASSRLS NPPTPAMSPS IPINRHNFPY PPQGYQSFSQ PSSIGHSGSG SFHGNDHSSI ATTPTPMTPI DIVGVSSSVP RNSFSQGVQP SSHSQLSSSR SKRSVPMIST TPVEPPPSSW GASFPDQQGE NDANSKPFDW TSLPNPANWF SPLGTVQKTT CNDDPIDPAV FASLADLVER NQGKMNDSAS IDFMGALNAN TSPKAARSTA HSTSHPGGTS LLSRRLQHQQ QTSSDGQNIH SSSQSSANSP GILHNSGGLS GFPGPQQQYN HVTFAQPGKV SKGASNQPLT PWPLSERAMG SSETPVTTPG GSDFGVNSPF EMGITGSWQQ ESMAKHELQG SNSSSRYPPI APRRREAPQH PAPVFQTHRG SVPSSEHASR AGSEAPHDGP MPGGVSLNGL PPLPNGLSLE HLAQYGAAGL EMALRVGMGI GMGLSQQTQQ TKQTVDVASP PWPLTKSVPT PSFSQNPSSP EASSRKGRKD SNIVSDILQD DFLTARVPST PLMTPPLNGF GSFPVTRRPS QSDATSPLPE VGPPEQVAEK DPLAAQVWKA YARARDTLPN GQRMENLTWR MMHLTLKKKE EEQAAKEKEE REKEDKEAAE KEAAEAAAAA TAAAATAAAA AAAAELPPVE ERRGRTKGKS RIVGFAGATS SNSQSPNGMD IDWRAASRSR SRIPMDIDWR ASSRSRSRSA APFRNPFSEA HAHHLLAAGG TPIAEMGQYM AGHGSLNIHA ANAHSTSHHG NQNQYHHASS LPGPSSMMLQ SLSQLPEKEG EQDEVQQAVE HMNGADLYPA SAPQNRSALE HLQMSLASGL NPSEEATSNL PGINGPGLYT HSQENFHPHY GFLPRRVRKT SFDHTVKLLE EGEPSSSPQF MSNPRKRHAE ASPQGGANSP LPEGDSGFPT SNFTFSFPQS YENFFDLAAA SATPSGTQEN NDVNHEGDDD LADLTDWASH PVTADTSAFG SPSAFGHIEP GMSLPSMPQA TGDNPFDFQQ LMHLYLNANS SASPFTHINP SQVLGAVPGQ AANEFSPSAV SPQSGAPTPS NNSSGNNIRP LPKAVGGKAV DNRQMPPPNR SSSTPNLAAL RMSSQSESSK HGRTASTNAS GNAGSGSSKG NKGSSGNNKS RPGTPTSENE GGPGSIMPSG ENPTMCTNCQ TTNTPLWRRD PDGQPLCNAC GLFYKLHGVV RPLSLKTDVI KKRNRAGPGP KESNPSRKNS VASSKNVSVR SKPSSPTVAS SANSGGGSKK ARHASDAPVE
|
| |