Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ02610 |
Symbol | |
ID | 3254338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | - |
Start bp | 752882 |
End bp | 756362 |
Gene Length | 3481 bp |
Protein Length | 897 aa |
Translation table | |
GC content | 53% |
IMG OID | 638253418 |
Product | nucleus protein, putative |
Protein accession | XP_567551 |
Protein GI | 58260282 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.333547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAAGGGGGC AATGGAGAAT CTCCTCACCA TCCGTGACCT TTTATCGAGC GTGCAGATAG AGAACTTTCT TCTTCACTGT AGTGTATGAC AGCTCTGTCC CTCATTGATC AACATCTTCC CATCGACACT CACAACTCCT CCAGACCTCT CTCGTCCTTT TCGTGTAGCG TCCGCTCCTA AAGGGCGATA CCGTCGATCT TTGACCAGCT AATCACCTTT TGGGCATACT ACCTGACATG TACGTCTTTA TTCCCCTGAT CTAATCTCGG ACCCTTGACG AGTCCCCATG ACCACACATC CGTCCCCTCA CTCAAAGCTA TCAGCTTTTC CATACGTCCA ATTCTCATTC TGCCTCCGCC TGTGGAATAC CAACGCGCTG ATTGCGCACT AGAGACACAG TTGAGGTATC CCCAAGAGCC AGTTCTACAT CATTAAAAAT ATCCGGCCCG GACCCGCCAT GCAACCTCCC CAAAAGAAGC TCAAGCGACG CGTACCCGAC ACCCAACGTC AGAGAGTATC GGTATCATGC GATCGGTGCA AGGTCCGCAA AATCCGTTGC ATCCGCATCT CAGGGGGGAA CGACCCTTGC GCAGCATGCG CACAGCTCAG TCTCAATTGC GAATCAACCT TACCTCGGAA ACAAAGGGTC TACGCTAGCT ACGACCAGCT ACAATTGAGA TACCGTGCAC TCGACACCTT GATCAAGCGA TTGTATCCCG GCGAGAATGT CGAAAGCGTC GACGATATAT GCGAACTGGC GCGGAAACAG GGGCTCGATT TGTCGGGATT TGAAGACGAA GGGGAGGATC TGGATCCACT ACCGAAACTG AGCGACAGGG AAAGCTCGGG CACAACTTCT GCTTCCAAAG AAGGCAGCAG CTTATTCGAC TCGGTACAAA ACCTACGTAT ACCAGAAGGA GGGCTCATCC CAGCACCTCG AGGCGGATAC CACTACGTGG GTCCGGCAAG CTCGTACCAG TTTGCAAATA CGATCCGGCA CCTAGTCAAA AAGTCGAGTG CATACACGCT TGCATTCGAT CGTGTGGGGT ACAGACGACA ACAGCGGGCA AATGAATTCA CCTCGTCAGA TCGGACTACA GCTCTCGAAG CGCGTATACC AGGGCATCCT GTGATGGTTG GAGAAAACGA GGCCTCCCCC ATGAGCGAAA GTATAGGATC GTGCCCTTCT GATGTCGGTC CTGTACCGTC TCCTCAGGAC CGAACAACCC CCAGAAGTAT ACCACACTCC ATCACCCGTC GGACGATAGA CATCATGCCT CCCCGCCAGC TGGCAGACAA ACTTGTTCTC GCTTTCTTTG ACCGGGTGCA TCTGAACTTT AATCTCTTTC ATCGAGGAAG TTTTCAGGTT CGCTACGAAT CAATATGGTC ATCCAGGAAT GAAGCCGGCT TAGAAGATCT TGAGCCCGGG TGGCTATGTG TCCTGTGTAT GGTGTTTGTA CTGGGCGCTC AAGCTCTAGA ACGAGACGGT CTGCGGGAAG CTACAGTTAT CCAAAGCCGG TATCTTGCCA TTGTCATCCG CGAGGGGATG CAGCGACTCG TCCTCACGGC AACACAATCA AACGTGCAGG CCCTAGCTCT GCTCAGTCTG TATCAGCATA ATGCCGGCGA ACGCAATACC GCCTGGATGC TGGTGGGACA CGCTGCTCAT ATGGCTGTCG CCCTAGGCAT GCAGCGGGAT GGCGAAAATG CAAACTTTGA CTTTATCGAG CGCAATACTC GGCGGATCAT ATGGTGGACA CTGTATCTCT TTGAGCAGAA TCTGAGCTTT ATACTCGGTC GGCCGAGCGC GACGTCGACT CCGGACGTCA GTGCGAGTTT GCCAGACGAG GCGGTCAAGG ATGGCGCAGA TTCGCCACCG GGCTACTTGG AACAGGCGGT CAAGCTAGGC GACATTTCGA CCAAGATCAA ACGTTTCACG GCCGCCATTT CTTCCGACTT TGACAAGCCT AACCGGTTGA CTGCGACTAC CGACATTGCC AACCAGCTAG ACGAACTGCT GCTTCAATGG GACCGGTCAC TCCCGCCACA TCTGAGATAT ACAGCGCAGT TTGCAACGGC AAAACATCGC CGGACAGTTC GTCTTCTTCA CGCGACATAC AATCACCTGC GGTCTGTGCT AGGCCGACCG TATTTACTTT GCAAAATCAA TCATGATCTC GATAATTCCC AATCGCCTCT TCACGTTAAC TCATCATCAG GTCTAGCAAG TGCAATCACC GCATTGTCGC AAACGTCCCT GTCCGCCGCA AGAAGCTGCA TGGAGGCTTT ACTCTCCTTG GCGAGCGCCG CCTCGCTCGA AGGAGAGGTC TGGTACGACT ACTACTACGT CCATCATGCT TCTCTCATCC TGTCCCTGCC CTTTTTAGTC GATTTCAACG ACCAACACGT CGCCTCGGAT CGAGCCATTA TATCGGCGAC ATTGAACCTT GCTCAAAAGA GCCGCTTGGC CCCCACGTAC CGTATCTTGA TCAACGTGTC GATCCAATTT GCCAAGATTG TTGGTATAGG ACCAGATGAC GACCCAAGCA GACCAGCTTC CCCTCGACTC GGTGCAAGCG CAATCCGCCC AGATCTGTCT TCGTCCGGGA AGTCGATGGT ATTTCCCGAA GGATTCACCG ATGAGAACAA CCATACCGAT TGGAACCTAG GGCCGAGTTC GGTCACGCCA GAGAACTCTG GTGGCAATCG CTCGTTCGAC TCTCGAGCGA ATTATCACAT GTCGGGCATG TCGCACGAGG CCGGTCCTGG TACAGGTTCG TCGACGGTAC AGTGGCCGAC TCAGCTCCCC CACCACACGA ATCCGACAGC ATCAGACCCG TGGTCACTGC TTCCCATGCT CAACTCGACT CCTTCCTCTC AGCCGCTCTC GCTCGAACAG CTCTTGGGCA TGCAGCCATC GACGCTCTTC AACGACAGTG CAACTCAGCA ACCTGTCGCC GACCTTGGCT TCTCGGACAT GTACAACTTT GGTTTCGGGC TGCCGGCACA GAATGATGGG ATAGGCCCTT ACTGGCCAGG TGGCGTGGGT GAAGAGGGTG GAAGCGGAGG CTTGGGTGAT ATGCCTTGGG ATTTCTTTGC CGGGGGAGAT TGGGCCGGGG GTGGACATGA AACTGGACGA GAAGGGAGAT GATTGTTGAG GTGATGTTGA GTGGAGCTGA GCCGAGTCAA GTTTATTTTG GTTCGCTGGT GCATGGAAGG AGGAAGAACT GCGATGACAT CGACAATGGG CACGATGCGC TGGTCACAGG AGCATAATCA CGCATTCACA TAGGCATACC TTTGGCACTG GCATAGACAT ATTAGGGCAG CAGTTTGTAT AACTGAGACA ACCAACGCAT TCATATACAT TTATCGCATT CTGGGCATTT CAATGGATAG ATTTAGTCGT CTGCAGCATT ACTACTGATG CTCAAAACTC GCAAGTCGTT CTAGATGTTG TGAAATACGG GCTTAGAGGG G
|
Protein sequence | MQPPQKKLKR RVPDTQRQRV SVSCDRCKVR KIRCIRISGG NDPCAACAQL SLNCESTLPR KQRVYASYDQ LQLRYRALDT LIKRLYPGEN VESVDDICEL ARKQGLDLSG FEDEGEDLDP LPKLSDRESS GTTSASKEGS SLFDSVQNLR IPEGGLIPAP RGGYHYVGPA SSYQFANTIR HLVKKSSAYT LAFDRVGYRR QQRANEFTSS DRTTALEARI PGHPVMVGEN EASPMSESIG SCPSDVGPVP SPQDRTTPRS IPHSITRRTI DIMPPRQLAD KLVLAFFDRV HLNFNLFHRG SFQVRYESIW SSRNEAGLED LEPGWLCVLC MVFVLGAQAL ERDGLREATV IQSRYLAIVI REGMQRLVLT ATQSNVQALA LLSLYQHNAG ERNTAWMLVG HAAHMAVALG MQRDGENANF DFIERNTRRI IWWTLYLFEQ NLSFILGRPS ATSTPDVSAS LPDEAVKDGA DSPPGYLEQA VKLGDISTKI KRFTAAISSD FDKPNRLTAT TDIANQLDEL LLQWDRSLPP HLRYTAQFAT AKHRRTVRLL HATYNHLRSV LGRPYLLCKI NHDLDNSQSP LHVNSSSGLA SAITALSQTS LSAARSCMEA LLSLASAASL EGEVWYDYYY VHHASLILSL PFLVDFNDQH VASDRAIISA TLNLAQKSRL APTYRILINV SIQFAKIVGI GPDDDPSRPA SPRLGASAIR PDLSSSGKSM VFPEGFTDEN NHTDWNLGPS SVTPENSGGN RSFDSRANYH MSGMSHEAGP GTGSSTVQWP TQLPHHTNPT ASDPWSLLPM LNSTPSSQPL SLEQLLGMQP STLFNDSATQ QPVADLGFSD MYNFGFGLPA QNDGIGPYWP GGVGEEGGSG GLGDMPWDFF AGGDWAGGGH ETGREGR
|
| |