Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA02310 |
Symbol | |
ID | 3253579 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 601422 |
End bp | 604832 |
Gene Length | 3411 bp |
Protein Length | 1033 aa |
Translation table | |
GC content | 52% |
IMG OID | 638252563 |
Product | hypothetical protein |
Protein accession | XP_567149 |
Protein GI | 58259473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.22008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTATGAACAA CCAAAACCAG CAGCCACCGG TCAACCTCTC GGACCCGCAT CTACAGGCCC GCCTCGCCCT CCTCGCAGCC CAGCAACAGG GCCTTGCAGA CCGGCAGCCA GTCTCCCAAG ACCACATCGC AGCAATGGCT GCTGCCATGG GCTCCATGCA AGGACAGAAC AGGGATGCCA TCATGAAGCA GGCAAGCTAC TTCTCCCACC GTCCACGGCC ACCGCTCACG TCTCACAGTT ACAAGCGCTC CAGAACTCAC AAGCCCACCG CGCAAAGCTC GCGGCTCAAC AGACAGCCAA CCAGCAGCCG CAGCCCGGAC CTAGCGCCCC ATCGCCTGCA AACCAGCCGG GTTTATCTGC ACCATCGCCG ATTACCGCAC CCTCCCCCTC CAACCCCCTC CATCAAGATA GCTTTCAACA TCCGGGCATG CCCAGATCTG GCTCAGGCGA TAATCTTTTA AATCAGCAAC AACAACAGCA GCAGCAGCAG CAACAACAAC AACAACAGCA ACAACAACAA CAACAACAAC CGTTCATGTC GCAACAGCAA CGCGAACTGT TTCTAGCACA ACAACGAGCT CATCAGCAGC AAAGCATGGC TGGACCATCT GATGGCTCTG TTCGTCCACC ATCACAGGCT CAGCCAAATT TACCCGGTCA ACCTCAAGGC CAGTCACCAC GACAACCGCT CAGCCACGTG CAACAGCGAC AAAACTTTTT AAAGACATTT ATGGGCTACT TTCAAAATAT CGGACAACAG CCGCCCCCAG CCATATTCGA TAACGGCGAA AGAGAAGGAG CGTTCAAAGT TGGAGACGGA TGGATGGACG TCATTGATCT GCTGATGGCC GTCATGAAGG CCGGTGGTAT TATGAATGTA GGTGCCTTCA TCGAGTCTTT TAGCATTGCC TTGATCATCC TTTCTACAGG CGATGCAACA GCCAGCAGAC AGTCCCACAT GGCGCAATCT TTTAGCAGTC AAGAACATCC CCACGACTCT CCCTTACCCC ATCCCTGCTC CCAAGCCCCC CAATTCAGAT CCTAATGCCC CTCCTACAAT GACGACCGAT CCTGTCCAGT ACCTCACCGC TGCCTACTTT GCCTACATCC ACGGGTTTGA GACACATATG CAAAAAACAA GGCAGGCTTC GTATGCTAGG CAACAGGCAA TGGCCATCGC CCAAGGTAGA CCGCCTCCAC CGCCACCCGT CATGCCTAGC CTGGCCGGTT TCAGACTTCC AGGACAAGCC CCTAGCCCTG CAGAAAGTTG GTCATCAGCC CAGGCACAGA CACCAGCTCC GGCCGTCCCT TCTTTACCGC CACCCCTCAA TACATCAGCA CAACCACCAC CTGCCTCCAC CGCTCCCACG CCTACTCATC CCGCGCCTTC ACCAAGTGAT TCTTCTACAA AATCTTCGAC TACATCTGGC CCACCTCGAG CGAGAAAGGC ATCCAGCAAA AAGGAAAAGA AGGATCTGTC TGTCAACACC AATGTGCCAA GCCCAATTGA CGGAGAAGCA GAGACGCCTA CGGGCAGTTC TGGCGGGAAA AAGAGGAAGA GGAAGAATGC GAATGCACCT CAACCAGTAT GTTCAAGGCA TGCTTCAATG CGACAGTTTC TAAATTTTTA ATTTTTTATA ATAGGAAACC GCCTCTACAC CCGCTCCGCC CCCAATTGTT TCTACTCCTG CCCCCGAACC TCCTGCGGAA CCAACCAAGC GAGCACGCTA CCGTGTCGAA TACCGTCCCA TCAACTTTCC TGTGCAGACA TTTGCCGGCT GGGAACCTTC CATGGTCTCC TCCACTTTCC CTAAACACTC CCTTCGACAA GGTACCCGCC CTATCCATGA TCTCGCAGTG GTCGACATGG AGGCCATATT GATGGGCCTG AGAAGCCGTA TGCCAAAGGA ACTGGGTTAT GCGGTCACAG TTTTAAATAT GCTGTCAATG TCGCACCCCG AAGAGAATAT CAACGGCCTG CCGCTGCATC ACCTGAGAGA GATTTTCATC GAGCTCTTGG ATTTGACGGA AGAGGCCGCA TTTGGGGATG GAGGGCGGAG TGGATGGCTG AAAAGTTGGC ATGATCTGAA TGACGTCAAG GAAGAATCGG CTGCCGACGA TAAGAACAGC TGTATGGACA ATCTGAACAA GATGCCGTTT TTCGAACTAG AACGATTGGG AAGGGATTTT GATTTCTCGG TATACAAGGA TGAGGAAGAG TATCAATGGA GAAAAGAGGA AACGAGCGGG AGCACTGGAA TCGTTCTCGC GTGTATCAAT ATGCTCCGCA ACTTTTCGAT GCTTCCAGAC AATCAAGAAC TCATGGCATC CTATCCTCAA ATGATCAACC TCTTAGCTTC CATCTCAGAT GCCCGATTGT GTCGTTTGCC CGGAGAGAAT TGCACCAAAA TCAAACGACC ATTTTCCATC ATAGAACTCG CCCGAATCCG CCGTGATTGC GTCAGCATCC TTGTCAACAT TGGCGAGTAC GTTGAGCTCC CTCGGGTACC CTCGTCTTCA AGCCTTGCCA TCTTCCGACT CTTATCCACC TTTATCGCAT CAGGCTGGGA GTCCAATGCC CTTAGTGAGC CTGTTTATGG TCCCACGCTT TCATCATCCA TCCGCGATGT CGGTCCGCCC ACCGTCGTCC CATCCATTGA CCGTGCCCTC GCCGCTTTTT CCCTTCTTGC CCAACCAGAT GCAAATCGCG AGGCACTGGG GTCGTCTGTT CCCCCTTCAG AGCTGATAGA CATGTACGAG TCCCTTCTCA AGCTCTTACC CGTCACCAAA CGCCAATTCG AAGCGATGCA TAGCATCGAA GAAACCTTGG GCTACAACGA AACACTAGCC CTATGCCTTT ACTCCCTGGC CTTTCTCTCT CCCTTGCATG TCAGAGCAAG TATGCGCAAC GTACCGGGAA GCGTACCGCT TCTCACCCGC ATCATCTTTG ACACCGCTCT TCAAAAATCC GACTATCGCT CAAACCCTTT CGGAATCCTC TGCCGTCGCT TATGCGAAAC CCTAGGCGTC CTCAACGGGA CCGTCTCGCC AGCAGGCACT GTTGAAGGGC CATCTGGGAT GGGATTCGGC GCAGGAGGGA TCGAAGGTAG CGGCTGGAAG TTTGCTAGTG GAAGGGTGGA AAACGGATGG TTGGCAGGTA AAGAAGAAGG TGTGTTGGGT GCTATTTTAG GCGTAAAAGG AATGAGTTGG GCCGCGTTGG GAGAGCTGGA CGGTATGGTC TGGGGTGGTG ATTCGATATA AATAAAATAA TGATAATAAT AATAATAGAG TGGATAATAA TAGAGTGGGC TATAATTCAT TCATGTCGTG TAGTATAATT ATATGTAGTA GTCAGCCACG CACGGATGTG GTGTAATATA GCTTAGCTGG GGTATAAAAT AGTGGAAGTC A
|
Protein sequence | MNNQNQQPPV NLSDPHLQAR LALLAAQQQG LADRQPVSQD HIAAMAAAMG SMQGQNRDAI MKQLQALQNS QAHRAKLAAQ QTANQQPQPG PSAPSPANQP GLSAPSPITA PSPSNPLHQD SFQHPGMPRS GSGDNLLNQQ QQQQQQQQQQ QQQQQQQQQQ PFMSQQQREL FLAQQRAHQQ QSMAGPSDGS VRPPSQAQPN LPGQPQGQSP RQPLSHVQQR QNFLKTFMGY FQNIGQQPPP AIFDNGEREG AFKVGDGWMD VIDLLMAVMK AGGIMNAMQQ PADSPTWRNL LAVKNIPTTL PYPIPAPKPP NSDPNAPPTM TTDPVQYLTA AYFAYIHGFE THMQKTRQAS YARQQAMAIA QGRPPPPPPV MPSLAGFRLP GQAPSPAESW SSAQAQTPAP AVPSLPPPLN TSAQPPPAST APTPTHPAPS PSDSSTKSST TSGPPRARKA SSKKEKKDLS VNTNVPSPID GEAETPTGSS GGKKRKRKNA NAPQPETAST PAPPPIVSTP APEPPAEPTK RARYRVEYRP INFPVQTFAG WEPSMVSSTF PKHSLRQGTR PIHDLAVVDM EAILMGLRSR MPKELGYAVT VLNMLSMSHP EENINGLPLH HLREIFIELL DLTEEAAFGD GGRSGWLKSW HDLNDVKEES AADDKNSCMD NLNKMPFFEL ERLGRDFDFS VYKDEEEYQW RKEETSGSTG IVLACINMLR NFSMLPDNQE LMASYPQMIN LLASISDARL CRLPGENCTK IKRPFSIIEL ARIRRDCVSI LVNIGEYVEL PRVPSSSSLA IFRLLSTFIA SGWESNALSE PVYGPTLSSS IRDVGPPTVV PSIDRALAAF SLLAQPDANR EALGSSVPPS ELIDMYESLL KLLPVTKRQF EAMHSIEETL GYNETLALCL YSLAFLSPLH VRASMRNVPG SVPLLTRIIF DTALQKSDYR SNPFGILCRR LCETLGVLNG TVSPAGTVEG PSGMGFGAGG IEGSGWKFAS GRVENGWLAG KEEGVLGAIL GVKGMSWAAL GELDGMVWGG DSI
|
| |