Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG01620 |
Symbol | |
ID | 3258874 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 461198 |
End bp | 465328 |
Gene Length | 4131 bp |
Protein Length | 1242 aa |
Translation table | |
GC content | 55% |
IMG OID | 638257779 |
Product | transcriptional activator, putative |
Protein accession | XP_571878 |
Protein GI | 58269444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.722836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCACTCGCC ATGTCGACAG ACCCACCGAG AACGTCGTCA CGTCCACGGC AAAAGTCCCA GCGGGCACTC GAGCATGAAG ATACCAAGCG CTACCTCGAG CAGCAGCTCG ATTCCCCCGC CAAGATCCCA AGGCCACGGC CCAAGAGCAA GCCCAAACAC CGGTCGTACT GTGTATGCAA GCAGGATACG TCAGGGCCGA TGATAGAGTG CGATGTCTGC AGCGACTGGT GCGTCGCTCT CCTTGTCCCG CTGTTCCCGG CTGACTCGTC GGCAGGTTCC ATTTTAAATG CATCAATCTC GCAGAGGACG ATGCAGAGAA AATCCGTACG TCTCCTGGTC CACACGTAAA AGCACTAACG CTTGTGAAAA GACAAGTATG TCTGTCCGTC ATGTACTCTC TCAAATCCCG ACCAGTACAC CACGTGTAAG TTTATCAAAG ACACAAGGCC AATGGGACTC ATGCTTGGTC GGCGGGTAGA TTCCTACGAT ATAGCGTCTT TCCCGTCACC GTCTCCGCCG CCTGGGCTTG AACTTGGTCC TGCTGGTGCT GGTGTGCAGG AGATAATCAA GCCACGACCA AGCGATGGAA AAGGCAAGAG CAAGCACGAG GATCAGAAAA AGAAGAGTAT ACTGGCAAAA CGAAAATCCA AGGCGAGTGT GACATATCAA AAGCTGCGAT CAAAGAACCG CGGAGCTGAT ATATGTCGTA TTGTAGACCG AGACGAGTCG AGGCGGAGAG GATGCCGATG GAACTGGAGA CGGAGAAGAC GACGACGACG ATGGGGATAA AGTCATGGCA ACCTCTTCGG CTCATGACTC TGACTCGGCT TCCGTCTCAG CCTCGAGCTC AGACTCGAAT CCACGACACC CAGACTCGGA CTCGGACGCA GCTTCGAACG CTTCAGGACC GTCCCGTAAA CGCCCTCGTC CCCGACCTCG TGTGCCGTCC AAATCAACAA ACGTACAAAG AGGCGACGAA AAACCCTCTC TGCCCTCCAG CTCTTCCACT CAACCCAAAC CAAAACCCAA ACCAAAACCC AAGCCCAGAT CCATCTCCCA ATCCCATCCA CAGCCCGAAA CCAAACACGT CCAGGCCAAG ACCCTCCCCC CAATCCGACA GTACGTTCTA TCCAAACTAA CACTTCTCTT CCACACCCTC TTCTCCTCCC CACCTTCCTC CCATCCGTTC ACTGAAGAAG AATCCGCCCG TTATGCCGCC GATGTCGAGC GCGCCATGTT CCAAGCGTTT AAAGATATCC ACCCATCTAC CGCTTCCACT ACCACCACCA CTTTTTCCGC CACCGCCACC ACCGGCGTCG GCGCTGCCGC CAAAGAATCC GCTGGAACGC GGTACAAGAC CCAGTTCAAC CTCCTCACCA GCTCTCTCAC CAAAAACAAA CACCTCCTCC GCCCCTCCCT CCTTCAATCC ATCCTCTCCC GCTCCCTCCC ACCCACCGCC CTCGCCCTCC TCTCCGCATC CGATTTAGCA TCGGCAGCTC AGCTGGAAGA GATGGAGCGG GCAAAGCAAG CGGTGTTAGA GTCGACGGTG AAGACGAGGG AAGAGCGAGA GAGGGAGAGG GAGATGATGA GTGGAGGAGG CGGCGGGGTG AGAGCGGGCA GGGATGGGTT GGAGCAGTTG GAGGATACGA GGGAGAAGGA GATGTGGGAG GTTAGGAAAG AGGAGGAGCG GGAACGTAGC GAACAGCGTG AGCGCGAGCA TGAGCAAAGA GAGCGAAGAG AGAGTGAGGT GGAGCGTGAT AGTGTACATC CAGACGACGG TTCATTGAGC AAAGATGTCA AAAGTATTAA AGATACAATC AAACACGAGC CAAGTCGTTC ACCCACCATC CCATCTACCG CCATCCCACA CCACCGCCGC ACATCTACAT CCACATCCAC ACTTACACCT ACATCCACAC CTTTCAAGAC CGAGCCCGCC AAACCGGCGT TTGAGCTAAC CTCTGCATGG GGGGCGTCCA ACCTTCAGGG TGGCGGGAAT GATCAAGGTG GTGGTGGTGG TGGTGGTCGT TTGGAACGAA AGCCGAAAAA GTTGGATTTG AGTGATCAAG CTATTGCTGG TAGTGTCGGC GATGTTGGCG GTGTCGGGGG AGGTGCGGGG GAAGGGGATT TGGATTTGGA TTTGGATCTG GGTTATGGAT CGGGAGAGAT GGATTTGGAC GAGTTGGATT ATACCGGTGA AGTCGACGCC GCTGCTGCTG CCGATGGTGC AGGTGGTCAC GGAGATGGTG TTGATCAAGA GGCAGAGATC GAAAATCGAG AAGAGGGGGA AGGACTAGGT GAGGAAGGAA AAGGAGAAAA CCAGCAGAAG AAGAAATGGG AAGAGTTGAT GGAACGGCCT ATTGAATGGT CTGGTTGGGT AAAGTTATTT CTTTTGTGCA ACTTGATATT CCAAAACTAA AAACAGTCGA CAGATCACAA ACCCCGCCGT GCCATCTTCT CTCCGGCCTC CCATCGCCCT CCGTCTCCTC ACCCCCCACC CCTCTTCTTC TTCTAATTTC ACACAATCAC ACACCCAAAT ACATCCCCCA TGGACAGCCC TCTTGCCAAA CTCGACGATA GAGATAACAG GGCGTGTACC CACCAAGGCA TCGGTACAGT ATCTGAGCGA TATGAGATTG AACCCCGGGA AAGAGTTGAT CTGCGTCTTG TTTACTTTTG GGAAACCTTA TGCTCATGGC GAGGGTGACG GACAAGCAGA GGGAGAAAAA AGGGAATGGG AACGATTGGT GGGGTATCAT GTTGAACGAG AGTAAGTCTT TTTTTTTTTT TTGCTGGCCA TCTTCATAAG CTAATACCAA ACCCACCTCC CAGCCGACAC GCAATCTACC TCCCATACGG CCCCAAAGGC CCCCCACCCC CACCACCAGC ATCATTATCC CCATCTACAA CAGAAGGAGT GCAGAATACA TTTAAAGAAC TGTATCTCAT CCCCCTCCGA CCGGGGGATG CGTTACCGGA ATTCTGTGAA TTGGTTGACG GGCTGGATGT GCTCAACGGT CGATCAGCGG GAAACGAAGA TGGCACAGGC GGAGATCTTG CCCGGGGAAA AGAAAATGGG AAAGTGGGGA GTGTGTGGTT GGGCGTTTTT GTTGGGAGTA AGAGACGGAA CGGGAGTGTC AGCACGAGTG CAAGTGCGAA TGGTAGTGCC AGTGCGCAAC GGCAGATACA AACACAGACA CAGGTGCAAA AGGAGGGCCG ACCGCAAGCT AGAAGCGAGA GCCAGAGTCA GACGCCCCCG CATGACCCTC GTGGTCCTCG TATCCCCCGT CCGGCCTTTA TCCCTCCTGC CCACGGTGCT TCGCCGTCCA TCTCTTCCTC CCGTCCATCC GTCCACCCGT TATCACCTGC ATCTGCACAA TCACAATCAT CAGGACCGGG ACAAGGGGTC ATTCAGAATG AAAAGCTCCA GCAACTCATG GCTAGTCTTA ACCCGTCTTC TCTCGGCCTA CCTGGACTAC CTGGATTGTC CGGGTTGCCC GGGTTGCCAA ACTTGCCAGG AGTCCATGGA GGGGCGCAAG TTGGTACACC CCCTTTCCCT CCTGGAGGAA CTACCCCACT CGGAACTGGA ACTGGTGTAT CCAATTCTCC TCCTACTCTC CCCCGACCCC CGCAGCATCT ACAAACTAAT CACCTCAATC AGCGTCTTCC ACCTCCTCCA CCTACACTTC CACCACCCCC AGGGGGCTAC CGTCCCCCAC CTTATGGATA CATACCGACG CCACCTCAAG GGCCGGCAAT GACGCATGGG TATGGGTCGC CGCCGTACCG GAGAGGAAGT GGAGGGGCGT ATGCTTATCC ACTAGGATCA TCGGGTCCGC CGGGGCCGCG GGGACCACCG GGGCCGGGGC CCTTGCCTAG ACCTCCACTC CCGCCTGGCG CTCAAGCAGC TCAGGGAGGT AGACCGCCGC AGGAAGAGAG ACACCATCCA AATCAGAATC GTCAGCAAGA GCAAGGACAC AGGCAAGGTC AACGCCAAGG GCAAAGATGG AGGAATGAGA GAGGGAATGG AAATAACAAT GGAGGATGGA AACGTCGGGG TCATTGAGTA GAACTAATAA TATGACGATA AGTGATGTAT AATATATTTC ATTATGTGCA TAGGCATATA C
|
Protein sequence | MSTDPPRTSS RPRQKSQRAL EHEDTKRYLE QQLDSPAKIP RPRPKSKPKH RSYCVCKQDT SGPMIECDVC SDWFHFKCIN LAEDDAEKIH KYVCPSCTLS NPDQYTTYSY DIASFPSPSP PPGLELGPAG AGVQEIIKPR PSDGKGKSKH EDQKKKSILA KRKSKTETSR GGEDADGTGD GEDDDDDGDK VMATSSAHDS DSASVSASSS DSNPRHPDSD SDAASNASGP SRKRPRPRPR VPSKSTNVQR GDEKPSLPSS SSTQPKPKPK PKPKPRSISQ SHPQPETKHV QAKTLPPIRQ YVLSKLTLLF HTLFSSPPSS HPFTEEESAR YAADVERAMF QAFKDIHPST ASTTTTTFSA TATTGVGAAA KESAGTRYKT QFNLLTSSLT KNKHLLRPSL LQSILSRSLP PTALALLSAS DLASAAQLEE MERAKQAVLE STVKTREERE REREMMSGGG GGVRAGRDGL EQLEDTREKE MWEVRKEEER ERSEQREREH EQRERRESEV ERDSVHPDDG SLSKDVKSIK DTIKHEPSRS PTIPSTAIPH HRRTSTSTST LTPTSTPFKT EPAKPAFELT SAWGASNLQG GGNDQGGGGG GGRLERKPKK LDLSDQAIAG SVGDVGGVGG GAGEGDLDLD LDLGYGSGEM DLDELDYTGE VDAAAAADGA GGHGDGVDQE AEIENREEGE GLGEEGKGEN QQKKKWEELM ERPIEWSGWI TNPAVPSSLR PPIALRLLTP HPSSSSNFTQ SHTQIHPPWT ALLPNSTIEI TGRVPTKASV QYLSDMRLNP GKELICVLFT FGKPYAHGEG DGQAEGEKRE WERLVGYHVE RDRHAIYLPY GPKGPPPPPP ASLSPSTTEG VQNTFKELYL IPLRPGDALP EFCELVDGLD VLNGRSAGNE DGTGGDLARG KENGKVGSVW LGVFVGSKRR NGSVSTSASA NGSASAQRQI QTQTQVQKEG RPQARSESQS QTPPHDPRGP RIPRPAFIPP AHGASPSISS SRPSVHPLSP ASAQSQSSGP GQGVIQNEKL QQLMASLNPS SLGLPGLPGL SGLPGLPNLP GVHGGAQVGT PPFPPGGTTP LGTGTGVSNS PPTLPRPPQH LQTNHLNQRL PPPPPTLPPP PGGYRPPPYG YIPTPPQGPA MTHGYGSPPY RRGSGGAYAY PLGSSGPPGP RGPPGPGPLP RPPLPPGAQA AQGGRPPQEE RHHPNQNRQQ EQGHRQGQRQ GQRWRNERGN GNNNGGWKRR GH
|
| |