Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00610 |
Symbol | |
ID | 3255344 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | - |
Start bp | 198987 |
End bp | 202553 |
Gene Length | 3567 bp |
Protein Length | 1073 aa |
Translation table | |
GC content | 57% |
IMG OID | 638254477 |
Product | expressed protein |
Protein accession | XP_568593 |
Protein GI | 58262366 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.430818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTCGACCTG CCCTTTCCCA GCCGCCGCAG CAGACACCCG CCATGGCAGT CCCCCCTCGC CGCGTGTCCG TGGCTGCGCC TGCCGCGCCC GCCACCCGGC CAATGGCCAC ATCCAGCCCG CCGCCCGCCC CCCCCGCCCC GCCTGAGGGC CGCCTGCACT GGACGCCACA CGCCGGAGAC ACCAGGGCGA AGGGCGGAAA GGACAAGGCG GGCTGGGGCG AGCTGCCGGT GGCTGTGCTC CAGTAAGTCG CCGCCGCCGC CGCCCCGAAC AGTGACCTGA CGCGCCGTGC AGCCGCATCC TGTCGTACGC GCAGCTCGCC GTGCCGCTCG ACCTGACGCT CCAGACGTAC TACGGCCACG ACCAGCGCGC ACGCGAGGTT GCGCTCGCAC TGGTCCGGAG GATATGGTTC TGCCGCATGC GGATGGTCTG CACTGGGTGG CGCAACGCGG GTGAGTGGCC GCCTGGTCGG TGGATCGCCG CGGCTGATTT CACCCATAGT CGACTCGCAT TCCTTCTGGC CAGAATTCAC TCTGCTGCTA GACCCCTCTA GACACCACTC GTCGGCCGTA TCCGACATAC AGTCTGCTCG CCTGGCACCT TCCACCCCGT CGGTCCCCAC TCTCTTCCAC CGCGCAAGGT CCAGCACGCT CTCCGCCTGC CTCGCATGCC GTCTCAACCA CCCTTCACGG CTGGGGTACT ACCCTGCCGT CCGAAAACGT CTGACCTTTA CGTCGAAATT CGCTCTGGCG CCGACATGCG AGAAGCACGC CAACAACTTC TGCTCAGGTT GCATGCGCGA GAATGAGCCT CAAGCTGGTC GTGTCGGGGC CCTCGTAGAA GGCGGGACAC CATCGCCGGG CGAACTCGCT GTCCCCTCGC TTATGTTATC GCAATGCAAC CACGGAGACG TGGACGAAAA CGGCGTGGAG CGTTTCCCCC GATCGTTAGT CTGCCCCGAC TGCCGCAAAG CGGCGATATG GAATGAAATC CGACTAATAC TCCATGAGTG CGCTCGTGGA GGCCGGATGA GGGGTGAGCG GAGCCTGTGG CTGTACAATG AAAAAGTCAA GGATTATATC GATTTCAACG TCGGTACAGC GTTCGAGATG GGCTACGCAG CGGTGGAAGA GCAGTGGTTG ATTGATCATA CTCGTTGGGT AGAGCTGTCC GAGACAGCTT TGCAATTACA AAACCACGAA AAGGCGCTCA AGCTCCAGTT TTTGAGGACG GCAGCGGAAG AGACACCGGC GCAAAAGCGA ATGAGATTGG CCAGGGAAGC AGAATTGAGG GGAGAAGATC AAGTGGGCAA AGAGACGGAA GAAGAAGCGA TGGAGATGGA AATGCTATAC AGAAGCTGGT GGAAAGAGCT AGAGGATGAT GACTTGAGCA GTGATGATGA GGAAGAAGAT GAATTACTAA ACGACAAAGC AAGTCTTTTT CTTTTTTCCC CGTTCATGAC CCAGAACCAA AACTAAATTC CACTTGGCAG TTCCGAGGAA AACTCAAAGC AGGGTGTATC AACGATTTCA TCAGCGACCG CATCCGTTAC GCCTTTTGGG TATCCCCCTC TGACGAAGTT TCCAAAATCG TCACAGATGA TCGTGACCGC CGCATCGACA GATCAAGCGT GATCCATACC ATGTTCCCCG ATATTGCACT GAACAGCGCC CACCCCTTTT CAAGGTACAT TGAATTCACA TTCGAGCCTG CCCAAGCTTG TGCAGAAGCA GCGGGTCTGA TATCCCTCGT GCCCAACGAT CCATCATCAT CCCCGATTTC AGATGGCCGA TTTGACCCAT TCTTACCCCC CGACCGCCTC TTACGCGCAC TAGATAAAAC GTTTGCAGAA ATGCTCTCCG CGAGGACGAG CACCGCAATG GAGAATATCG TCAGTATGGT ACGAGAGTAT TGTGATGACG ATGACGATAA AGCGGAGGAA GTGTGTGAGA ATATGAGGGT GGAGGATATT TTGGGGAGGT TGACGGCGTG GCAGATGTGG GTGCCGAGAT CGCTGGCGGA TCAGATGAAG CTCGCGGAGA TGCAAAAGGA GATGGAGGCC GAAGGCGAGG TTGACTATGA GGGTGAGGAA TCAGAGGAGG ATCTGGAGGG TACCATTGAA GAAGTACCGA CGTCAAGGTC AGGATCCCCG AGGGTAGAAC TGGTCGAGGG AGGAGAAGAG GAAGTGCACT TTACACCTGT GGAGGTCCCA CCGTCTGATG ACTATCAGCC CTCTCCTCCT TCTTCGTCTT CCCCGCCTCT TACGCTTACC CTTGGGAAAC GCAAATCGAC TGATGTTTTC CCGGAACCGA CAGATAAACG TGCTCGCCCA TCTACCTCTC CTAATGAACG CGAGCGGAAA CTCGAGCAAC TTCCTCAGAC CCCCCGGAAA CAGATATTAT CGCAGGACGA TACTTCCGGT GGGTTGGCCG TGGATACCAG CGCATCTGCT GCCGTTGTGG CCTCTGACGG GGGAGGAGCC GGAGGCGGAG GAGGATTGAA AAGAAAAGAA CCGCCTTCTC CAGCGGTCAA CCATATTGAT AAAGGACGAA AAGAAGTGAC TCCCCCTCCA GGTCCCAAAT TCGATTTGGC AGACGGTCAT GGTTATTATG GGGATAAACG GACGAGAGTG ACGGGAGATT TGGTACAAGA AGCAAGTTCT GCGCCTGTAT CATCATTGCC TGGGAGCCCG ACGCCCATGA AGAAAAGAGT GGGAGGCGGA GGTGGAGGAG AAGAGGAGGA GGGAGGGGGA TTTGAAGTGG ATGTCGAAAC GGAAGAAATG TCACGGAGAG GGACGAGCGT AACGGAGACA GTGGGCACTG TACCGGTCAC ACCCGAAGAA GGGATGAGCC TCGGTGGGGA GACTGTCATA GATCCCAAGC TGGTAAGCAT GGACCCTGAA GGGTCAGATG TGGACGACCC ACAAGCGCAA TCCGTTCAAA CGATTGACAA CCCCCCGCTT GTCGAGCTTG TCACCCCTTC TCCTTCCCCT TCCCTCTCTA CCTTCCCAGC CCGATCCCGA TCCCCATCAA CCGTGTCTGC CGCGGATTCC TACTCTTCAG GCTCCTCCTC AATCCCGCTT GACGGCGATG GGGACGGCGA TGGCACCCTC ACCCCTCTCA CGAGCGCCAC CGCCCGTCTA GCCAGGTATG TGCAACGCGC TCACGCCTCC ACGCCATTCA TCCCCTTACC CGTCTGCCGA CTCTTACCTC ATCCCGATCA CCCAGCCCAC CCGAACCAGG GTGGACAAGG CGTACAGCTT CCGGTAAACC TGGGAGAAGG AGCAAATAGA GTGTTGCTAA ATACGTGGTA CGAGGCAAGA GGGGAGTTGA GAGAATGTAA ATGTAGGATC TGTGAGAGGG CGAGGAGAAA GGCTTGGGAG TCGTTGGAGG CGATGAGGCA GTTGGTCGCA AGCGGGGAAG TTACGTGGGA GACATTGTTA TCGTAGCTGG ACGAATTTGA CGAGTTGCTA TCCATCGTTC CCGATGAAAA AGGGATTGGC GTTTAAGATG ATACTGTTTC CGGGGGATTT GTTGACGGTA GACGTCTTAT AAAAGTTGCA TATTAAT
|
Protein sequence | MAVPPRRVSV AAPAAPATRP MATSSPPPAP PAPPEGRLHW TPHAGDTRAK GGKDKAGWGE LPVAVLHRIL SYAQLAVPLD LTLQTYYGHD QRAREVALAL VRRIWFCRMR MVCTGWRNAV DSHSFWPEFT LLLDPSRHHS SAVSDIQSAR LAPSTPSVPT LFHRARSSTL SACLACRLNH PSRLGYYPAV RKRLTFTSKF ALAPTCEKHA NNFCSGCMRE NEPQAGRVGA LVEGGTPSPG ELAVPSLMLS QCNHGDVDEN GVERFPRSLV CPDCRKAAIW NEIRLILHEC ARGGRMRGER SLWLYNEKVK DYIDFNVGTA FEMGYAAVEE QWLIDHTRWV ELSETALQLQ NHEKALKLQF LRTAAEETPA QKRMRLAREA ELRGEDQVGK ETEEEAMEME MLYRSWWKEL EDDDLSSDDE EEDELLNDKA SLFLFSPDRI RYAFWVSPSD EVSKIVTDDR DRRIDRSSVI HTMFPDIALN SAHPFSRYIE FTFEPAQACA EAAGLISLVP NDPSSSPISD GRFDPFLPPD RLLRALDKTF AEMLSARTST AMENIVSMVR EYCDDDDDKA EEVCENMRVE DILGRLTAWQ MWVPRSLADQ MKLAEMQKEM EAEGEVDYEG EESEEDLEGT IEEVPTSRSG SPRVELVEGG EEEVHFTPVE VPPSDDYQPS PPSSSSPPLT LTLGKRKSTD VFPEPTDKRA RPSTSPNERE RKLEQLPQTP RKQILSQDDT SGGLAVDTSA SAAVVASDGG GAGGGGGLKR KEPPSPAVNH IDKGRKEVTP PPGPKFDLAD GHGYYGDKRT RVTGDLVQEA SSAPVSSLPG SPTPMKKRVG GGGGGEEEEG GGFEVDVETE EMSRRGTSVT ETVGTVPVTP EEGMSLGGET VIDPKLVSMD PEGSDVDDPQ AQSVQTIDNP PLVELVTPSP SPSLSTFPAR SRSPSTVSAA DSYSSGSSSI PLDGDGDGDG TLTPLTSATA RLARYVQRAH ASTPFIPLPV CRLLPHPDHP AHPNQGGQGV QLPVNLGEGA NRVLLNTWYE ARGELRECKC RICERARRKA WESLEAMRQL VASGEVTWET LLS
|
| |