Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB03390 |
Symbol | |
ID | 3255864 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1019609 |
End bp | 1023157 |
Gene Length | 3549 bp |
Protein Length | 938 aa |
Translation table | |
GC content | 48% |
IMG OID | 638254984 |
Product | nucleus protein, putative |
Protein accession | XP_568913 |
Protein GI | 58263006 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.175447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAAAAAAAA CTCATTCCCC CTCTTGCTTG CTTTCAACAG ACCTCACAGT CGTCCCAGCC CTTTCATTGA GACAGGTTTC TCCCACGCAG CGATAAAATA CAAGAAAGGG CCGCAACGCA CAGCTTTTTG CCTCTCTTTA GACCTCGACC GAGGCTGACT GAGCGCGTCG GCATCGCGCA TCCGGCCAAA ACATCAACTA GGGGAACGTC TCCTCACAAT GTTCGTCGTC ACTTTTGTCC CATACCTTGC TATACTCTTT TAGCTAACTC AATCTCTTTG ACTACCATAA ATTGTCTGTA CAGTAAGACG CCATATACTG CTTGCGCTAA AAAGGACGTC TACTGCAGTC TCAAAGGAGG ATAACCCACG GCAAACAGTA ATAAAAAGCG TGGAATATTG TGCCACACCA AGGTAAAATG TCCGATGTCG ACGGAAAAAG GCGTAAGATA CAGGTACATG GTCGTCTTCT ATCGTGAGAA GGTCGGATAA AGTAATGACA AGTCTTCTAG CGGGCCTGTG ATGTATGCAG GCGAAAAAAA ATTAAGTGCG AAGGGCCCAT GAATAGCCTG AGTGATGCCA GTCAGTGTAA TATCGTGCTA TATCATTGAA AATGCCTGAC TGAAGACGGT TTCCTGTAGA ATGTGCTCAC TGTGAAGAAT ACGGCATGGA TTGCACGTAC GTCGAGGCGG CAAAGAGAAG GGGGCCTCCA AAAGGGTATG TCGTGTATTA ACTCTACTGA GCATACCATC TCATGTGTAT TATTAGTTAC GTTGAGACAC TGGAGCAAAG AGCTGGACGA TTAGAGAGGA TGCTGCAACA GGTAATTGCC GTTCACCGTT TGGTGTTTTA ATATCTGCTG ATATTACAAG ATTTATCCTG GTGTCGATTT AAACGAATAT GTGGGGCCAA AGCCGGACAG GGAAGACTTT GATATCAGTG CGTATCATGG CACTCTTCGT TCCCTCAATA TCCCACCATA TCCAGCCTTA AAGCCTCTAC ATTCCGAACA TCTTGTCACT CCACATACAT CTCGTTCTAC TTCGGTTGGC ACATCTCCGG CGGCCCCAGC ACCTTCACCC TCAATGCAAG CGCTGGGTCC CTCTCCATGG CGAATGTATG AAAGAGATCC TGCGAAACCT GCTGAAAATG AGTCCGACGT GGAAGAAGAG GCGGCTGCGC AGCTGTCTAT AGCTACATCA ATGAGTCAGT TAGACATTCG CGACAGCCAT TGGCGTTGGC ATGGTCGGGC GTCAGGGGCT TTCCTTATGC GACAGTTTGA AGATCTCAAG TCAGCGACAG GTAATACCTC AAGTATCATA CAAGACATCA ATAACCACAA ACGACAACAA TTCTGGCATG TCCCAGAATG GGAACTTGTC ATTGCGAACG AAGGCTTACG CCCTCTTGAC TACTCTATCT GGCCAGAAAA AGGCCTAGAT CAACGACTTA TCGACGCATA TTTTGATAAC GTCAATCTTC ACCTGCCCTT ACTTAACCGT AAATTCTTTC AACGACAATA CGATTCTGGT ATGTGGCGGA ACAATCCCGG TTTCTCGAGA GTCTGTTTGC TGGTTTTCGC CAATGGATCG CGGTTCGTGG ATGATCCACG GGTCTACTGG CCTGCAAATT TGTCGATGAC AGAGGAAGGG AGTGAACGCC TTGCAACGGA CAAAGACGGT ACGCTCCGTT ACTCGGCTGG CTGGAAATAC CTCCGCAGCC TTCTTCGCAT GGGAAGAAGT ATCATGCAGG GACCAAATCT GTATGAATTT CAAACCCAGG TCCTCATTTG TCAATTCTTG CAGGGGAGTG CTGTCCCACA TCTTATGTGG ATTTTGTCAG GCTTCGGTCT CCGCTCGGCC CAAGAACTAG GCATTCATGT TCGGGCCACT TTACTCCATG CCGATCCTAC CGAGCGAGCT CTTTACAATC GCGCGTTTTG GTGCTTGTAC CACATTGACC GGTATAACTG CGCTGCGATT GGCCGATCAG TCGCTATACA GGATTCTGAC TTCAATGCGG ATTATCCAAT TGAGGTCGAT GATGAGTACT GGGACACTGG AGATACTGAG CGCGACTTCA AGCAGCCAGA AGGGAAAATC TCATTAATAA CGTCCTTTGT CCAAACACTC AAACTCGATC ACATCATGGG CGCAATATTG CAGAACGTGT ACGCAATCAA CAAGCTTCCA GAGCAGCGAG CGGACATTGC TGCTCAGCGT GCCATCGTTG TTGAGTTAGA CTCTGCCCTC AATTCTTGGG CTGACAACGT TCCACACGAG CTTCGCTGGG ACCCTAGTTG CTCCGACTAT CAATTGTTCC GCCAGTCAGC TGTGCTATAT ATCTATTATT ACTACTGCCA AATCCTAATC CACCGTCCCT TCATTCCTGG CCCGCGAAAT CAACATGCCG CCGATCTACC GTCCCTTGCA GTTTGTGTCA ATGCCGCTCG GTCAATCTGC AACATCACCT ATGCGGCACT CAAAAGAGGT AGACAGGAGG GGTGCTTACC CGGACGAGCC CTAAACGTCT CGTTCATGCT GCCAACATGG ATCGCTGCCA TTATCCTTGT GATCAACATC TACTCTGGGA GACAAACAGC GGCCGAACGA GAAAAGGCTT TGATCGACAT TGGGCGATGT GTATCGGCAA GTAAGGAGCT GGAATTGATA TGGAGGCAAA GCGGTAAATA CACCGACTTT TTGTTGCAAT TGGCAAGAGA GGGCGGAATG CCCAACGCCG ACAAGGTGCC TATGGTCGAG AAAAGATTGC GTGAAAACAA TCCGCAACTG TCAGAGCGCT CACGGCGTCC AGAGTCAGTG CAAGGGTCTA CCGCCGGAAC ACCTGATCAT AACTCACCTT CAACCAGTTA CCCATACAGT CATGGTCGAT CAAATGGGCA GAATTCCAGG TCGTCCGGCG AGCATTCCCG CCAGCCATCT GCGGCACCTG TAACAGGGTT TGATCTGCGC AATTTCACTT GTCCAGAAAC ATATGGTGAT ACTTCAGCCA CACCTCAATT TCCCTATTCT CGTGATGATC TCCCGTTTAC GCATATGCCC TCTCCCTCGT CATCCCAAAC TGGCTTTCAA AACATGTTTC AACCATCCTC GCAATCTTCA CAATATCCTT CCAATACTAG GAATGTGCCT CCACACTTTA CACCTTCCTC CCAAGCGCAT ACGCAGTACA ATCTCCAAAC TCCTAATAAT GAACATCCTT TACCGTCGCA GAATGATGGC GAGCTAGCGT CTTCCAATAT CTATGATTCG TTGATTGACA TGACAAGCTT TGAGTCACAA CTGCTTGACA TGAGTACCAC AGCTTTCGGG GGACCCGAAA ACACGTCAAA TGGCGATTGG TGGTCTCGAT TATTTAACGA CTACATGTGG GTAATAGCAT ATTATGGCAT TCGAAAGACA CTTGTAAATC CCAAGCTGAC TGCTTTATTA ATCCCAGGGG TCCTGATCTT CACACCAACA TGCCTCCCGC ATGTGGTCGT TCGTCGAATT CCGGAGTTTG ACAGTTATCG TAAAAGGTTG GCCGAAATA
|
Protein sequence | MSDVDGKRRK IQRACDVCRR KKIKCEGPMN SLSDAKCAHC EEYGMDCTYV EAAKRRGPPK GYVETLEQRA GRLERMLQQI YPGVDLNEYV GPKPDREDFD ISAYHGTLRS LNIPPYPALK PLHSEHLVTP HTSRSTSVGT SPAAPAPSPS MQALGPSPWR MYERDPAKPA ENESDVEEEA AAQLSIATSM SQLDIRDSHW RWHGRASGAF LMRQFEDLKS ATGNTSSIIQ DINNHKRQQF WHVPEWELVI ANEGLRPLDY SIWPEKGLDQ RLIDAYFDNV NLHLPLLNRK FFQRQYDSGM WRNNPGFSRV CLLVFANGSR FVDDPRVYWP ANLSMTEEGS ERLATDKDGT LRYSAGWKYL RSLLRMGRSI MQGPNLYEFQ TQVLICQFLQ GSAVPHLMWI LSGFGLRSAQ ELGIHVRATL LHADPTERAL YNRAFWCLYH IDRYNCAAIG RSVAIQDSDF NADYPIEVDD EYWDTGDTER DFKQPEGKIS LITSFVQTLK LDHIMGAILQ NVYAINKLPE QRADIAAQRA IVVELDSALN SWADNVPHEL RWDPSCSDYQ LFRQSAVLYI YYYYCQILIH RPFIPGPRNQ HAADLPSLAV CVNAARSICN ITYAALKRGR QEGCLPGRAL NVSFMLPTWI AAIILVINIY SGRQTAAERE KALIDIGRCV SASKELELIW RQSGKYTDFL LQLAREGGMP NADKVPMVEK RLRENNPQLS ERSRRPESVQ GSTAGTPDHN SPSTSYPYSH GRSNGQNSRS SGEHSRQPSA APVTGFDLRN FTCPETYGDT SATPQFPYSR DDLPFTHMPS PSSSQTGFQN MFQPSSQSSQ YPSNTRNVPP HFTPSSQAHT QYNLQTPNNE HPLPSQNDGE LASSNIYDSL IDMTSFESQL LDMSTTAFGG PENTSNGDWW SRLFNDYMGP DLHTNMPPAC GRSSNSGV
|
| |