Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02870 |
Symbol | |
ID | 3254623 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 844663 |
End bp | 848067 |
Gene Length | 3405 bp |
Protein Length | 837 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253778 |
Product | nucleus protein, putative |
Protein accession | XP_567882 |
Protein GI | 58260944 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGTCTCACA AGCTGCTTTC TCCAATAGCC CGCAGGCCGC ATATCATTCT ACATCCTTGT CGTGCCATTC TCCGTGGAGC TCGCTACTGA GCCACAGTGC ATAATGTCAT CTTCGGGGGA GCCATCAGAG ACGAGACAAC GACTGGATCG CAATAGGCGC CGCTCAACTT CATCCGACCG CTCCGCCCCC GCTCAGAATG GCAAAAAAGC TAAAACTACG GAAGAGAGTA ACCAAAGCAA CGAGAAGAAT AATGAGGATG GTTCCCGTCC GAATGAGGAT CCGGTTGCCT CGAGTAGATC GGTAAGTTTG GTGTTAGAAC GACGGATATC AGGAAGTACT GACTGGCCTT TGGCTCCTGA AGTGCAAGGA ATGTCGCAGG TTGAAGCGAA AGGTATGTGG TGAAGAGCTT ATAAGATATC GCCCTCACAA ATGTGCAGTG TAACCGCCAA CTTCCTTGCA GCGTATGCAC ACGATTCGGA AAAAAGTGCG CATATGCTAC CGAACCTCTC CGTAAAAATC AGGAGCAAGT TATCATGCTA TATCCATATC GTGCTTTACT AACCCTTGTA CGTTCCGCGT CGGCCATCTT GTTCATTTCC CCATCTCCTC CCACAGATAT ATCATGGGGC TGGAACGAAA AGTCGATTAT CTTCAACACT TGCTGGAGAC AAGCCATTCC AGTCACGGGG GTGGAGTTCC GCCAGCCAGA TCGGATCAAG GGCTTAATGT TGGCTCCCCA GCCTCACAAT CCCAATCGCT TCATTCGGAG GAACATGTGA GGGCTTGTTA CGCGTTCAAG TTTCGATTTT TAATGCCCTT TTCCAGGTTG TTCCGATGCG GCAACCTCCC TTGCTTCCAA TTCAATATCA GCCATCCCCT GCCCCTACAA GAATCCCAAT GCGGCCTCCT TTGCTTGCCG GTCAAACACA CTTATCTACA GCTAGGACAC CAGAGACGCG CAGCTTTACT CCCTCACACA TACCTTCTTA TGAACTTTCC ACCGCTCCTG GCAGCGCCAG CTCAAGCAGT GCTATTATGA GACCAGAGTC AATGCCCACT ACAGACAATG CAGCGCGCCC ACTTATCAAC GGAGCCGAGA ATCCAAACAT CTCTTCTTCC AGCACAATGG GCCCATCACA TTCGATGGCG ACAATCCCCG TGGACTTCAA ATACGACCTT CCGCCGGCTA GTGAGGTCTT AGAAAGGCAA CCCAGCAGCG TGTCTGGCTA TGAATGGAAT GAGAGATGGG CGACTAAAGG CCTCGGAGGG AATGATGGTT ATGCCTCTTT GTCGATAGAA CCTGATGGGC AAGGGTATCT AGGTACGTGA CTCCGAACAA CTTATTCTTG TATTCTTCGC TGGCCACCAT ATCAACGGTT GTGTTTTCGC TCATACTACA AAATTTTCAG GATTCGCTTC AGGGTCAACA CTGTTACGCA TCCTGCAATT GTGTGCGGGG TCGGTTTCTC TTGCATCACT GGAATCAGAT CTTTCAGCGG CTCCGCCGCC ACCTCGACCC CAAGGGTGGA CTCCTACAAT GTCGGAAACC GTGTCTTGCG TAGATGCGTA TTTTGAGCAC TATCACCCTC AGTATCCGCT GTTACATGAG CCTACGTTTA GAGCGCAATG GAATGACCTG GTGCCCCAGC CATTAGCGAG CGAGTAAGTG CTCAAGCAGA TCAATTTCAA AGTGATATTG ACATCTTCAA AGGTGGGATT TTCTTTTAAA CATCGTTCTC GCGATCGGTG CCTTTTGTGT CTCTCGTCCA ACATACGTCA TCGACTATTT CCTCGAAGCC GCCATATCAC AACTCTCTGT CGAATATCTG GAAAAAGGCA GCTTGACATT GGTTCAAGCC TTCTGTTTAT TAAGTAACAT TTCACAAAAA CGTAACAAAC CGAATTCAGG TTCGGTATAT ATGGGTAAGT ACACGTATTA CATGAGCTGA TTTGCGAAGT TGACTTGTTA AAAGGTATCG CATTACGCTT GGCGATCGGA CTAGGTCTTC ATCGCGAGCT TCCCTTTTGG AATATTAGTC CATTTGATAG AGAAGTCAGG TAAGCTCCGA AATTATCCCT TTTTGATATG GATTGTTCAA TTCCCCGCAG GCGGCGAACT TGGTGGGTTG TAGTATCATT TGACTCTGGT TCAACGATTA CATTTGGCCG TCCAATTGTA AATATGACTA CTGTTTAGTA TAAGTAATTG AGACTTACAA TACCGCAGCT AGAACCTCAT TATCCAGCGG AATCCGATGT GTACATGGTG CATAATGTTC ATGACAGAGT CAGTGCATAT GTGTCTATCC AATTGCACAA TCAACTAATC ACATAAATAG GCATTTACCC CAGCGGCTCG ATTACCACCT GTTGAGGTTA ATGAGCCAAC CATCTATACC GCCCTTATTA ATCAAGCATC CTTCCATGCT TCAACCAATC GTATCTATAC CCGCGTCATG TCATCACCGG CACCGTCTGC TACAGAGACT TTGGCGCTTG ATTCAGATCT GCTTAGCTGG TACGCTACTG TACCCGAACA TTTACATCCT CAGAACGAAC CCGTATCACC TCATTGGCTT GAGTTTGCAC AATACAAGAT GTTTTGGAGG TACTGCAACC TGCGAATCAT TCTTCATCGC CGTGCTTTTC TTGAGAGAGC TTTGAAAGCC CTCCCATTAT GGGTAACCGA TGAAGACATC GAAGACGATG ATAACACGGT GGCGGAAATA AAATGTACCC GCCTATGTCA GTATAATGCT TCCGATACGA TTCACAGCAT GAGTCGGTTC TTTTCGATGC ATCAACGGAT GAGCCGCCTG GAGAGTTGGT ATGGTCTGTG AGTTTACCTG TCATTTAATC ATGCATCATA AACTCAATTG AGATCACTTA GCCATTTCCT TTTCCATGCA AGTTTCATCC CTCTTATCGC ATTGCATGTT GATCCAAATT CCCTTCGACG CCCAGTCTGG GAGGAAGAAG TGTCTCTGGC ACGCGAAATC CTTGTCTCTT TGAAGGACGA CCCATTGGTC GGGAGATGCC TTTCAATTAT TGATGCTCTT GTTCCCCATA CTTCTTCGAC ATCGATCGGG CCAGGACAAA TGGGATTCCA AGATACAAGC ACCATGCTTT ATGAGATGCT TCAAAGCAAT CCCAGCTGGC AAAATTCGCT CGATGTGCCT GATGATGACC TGGCTCCAAA TTTGTGAGTC GTCGATGACG TTTCCGTGGA CTTTTTGTTG ACGCAATACA TATAGATTAC CGGTGGCAGA TTTGGCGACT CTGAGCTCTC TGTGGCCCTA TCGAAGTTCT TAAAAGTAAA AGGAAGTTCT AGTCATGATT CACTTTGTGT GATAGCTGTA AAAGAGGTAT ATGGGGAACG TAGCATAATG CAATGTAGAT AGATA
|
Protein sequence | MSSSGEPSET RQRLDRNRRR STSSDRSAPA QNGKKAKTTE ESNQSNEKNN EDGSRPNEDP VASSRSCKEC RRLKRKCNRQ LPCSVCTRFG KKCAYATEPL RKNQEYIMGL ERKVDYLQHL LETSHSSHGG GVPPARSDQG LNVGSPASQS QSLHSEEHVV PMRQPPLLPI QYQPSPAPTR IPMRPPLLAG QTHLSTARTP ETRSFTPSHI PSYELSTAPG SASSSSAIMR PESMPTTDNA ARPLINGAEN PNISSSSTMG PSHSMATIPV DFKYDLPPAS EVLERQPSSV SGYEWNERWA TKGLGGNDGY ASLSIEPDGQ GYLGFASGST LLRILQLCAG SVSLASLESD LSAAPPPPRP QGWTPTMSET VSCVDAYFEH YHPQYPLLHE PTFRAQWNDL VPQPLASEWD FLLNIVLAIG AFCVSRPTYV IDYFLEAAIS QLSVEYLEKG SLTLVQAFCL LSNISQKRNK PNSGSVYMGI ALRLAIGLGL HRELPFWNIS PFDREVRRRT WWVVVSFDSG STITFGRPIL EPHYPAESDV YMVHNVHDRA FTPAARLPPV EVNEPTIYTA LINQASFHAS TNRIYTRVMS SPAPSATETL ALDSDLLSWY ATVPEHLHPQ NEPVSPHWLE FAQYKMFWRY CNLRIILHRR AFLERALKAL PLWVTDEDIE DDDNTVAEIK CTRLCQYNAS DTIHSMSRFF SMHQRMSRLE SWYGLHFLFH ASFIPLIALH VDPNSLRRPV WEEEVSLARE ILVSLKDDPL VGRCLSIIDA LVPHTSSTSI GPGQMGFQDT STMLYEMLQS NPSWQNSLDV PDDDLAPNLL PVADLATLSS LWPYRSS
|
| |