Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00290 |
Symbol | |
ID | 3255442 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 107224 |
End bp | 109297 |
Gene Length | 2074 bp |
Protein Length | 543 aa |
Translation table | |
GC content | 53% |
IMG OID | 638254444 |
Product | expressed protein |
Protein accession | XP_568543 |
Protein GI | 58262266 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACTATGTC CACCCATTCC CATGTCGATC TCGTAAAGGC TCTTGAGAGC ATCGATCAGA GTGTCCTTCA TGACGATACT CTCTCCTCCC ATGAACACAC CCATCACGAC CATCACCACC ACATCGCCGC AATTGGGCAT CACCATGAAG CATCTGGGCG GCAAGAAGAT CTTTCAGAAT GGACGCGAGA GGAATTGCAA GCTGAGATTG TCAAGCTACG ACAAATGGCA GGGATTGCCA ACGACAACAA TATGGCGGCC GAAATGCTGC CTACATCTGC TGAAGGGTCG GTTGACCCTT CTTTGCGAGA GAACGCTCTC GTAACGTTGC CAGCTGAGGG AGGTGGCTCC AAAGGCAAGA GAAAGCGCAA ATCCCCTACA AATGATGTCA CAACGAGAAA GGTACAGAAA AGGCATACAG AGACGGGGAA GAGGCTCGAG AAGGAAAGGA GAACGGAGCT GGCAAAAGTT GTCCGCAACA AGGTAGGTGT ATCGACTCGT GACTTTGGCT AACTAGTTTA GATGCGTTCT TTGGTAGGCA TGGAGTTCAA CAACAGTCCG GTGCCCCAGC CAACGGTCAG TTTCTCAGAG GAGGAAAGAG GATTTACCCC TTTCGTACCC GAATGGAGAA ATATGCTCGA TGAAGATAAT CTTGCGGTGA GTTCTGTAGG CGAGAAACAT GGCGGAAAGT TTAGCTGATA TCGTGCAGTG GGTGGACAAA ATCTCCAAAA GCGTGCAAGA AGAAGCTACC AATGGCCTTC ATCCCAAGAT CCCAAATGCT GATCTCGTTT CGGAGATCGT ACATGTGAGT ATAATCTTTT TAGCATTTTT TTATCCTGCA GTTGACTGAT CCATATTACA GGGGGTCGCA TGTACAGCCT TCACCAATCT CTGTAAACGT TTTATCAACG AGAATTCATC CGATGGGGCG GATAAGAGGG AACGGTACAT CAAGAAGCGT CGTCGATGGG CACGTAAAGA CCTCAAACAA AAACGCCGTT CCCGATCAGC CGCTTCCCCA TCCATCTCTC TTTCCCTTCC GGCCCCTCTC CCTGCATCAG CTCTACACAT TGACTACATG TCCTCAGAGT ATTCGTCCTC TGGAGATGAT GAGTCTGACG TCCATCCGCA TATCAGGGTG ATGCAAAAGG ACAAATGGAG GGAGGCGACT GAAGAAGCAC AAAGGGATGC GAGTGTGGCA GCGGGGACGG GGAGAGGTGG GTGGAAAGCG GGGCAGGGGC CAAAGGTGCT GGAAGTTCGA AAACCAAAGT GGCGAAGTCA ACAAGTACGT GCTCAGGGGT TTGAGATGCC TGTTCATCCT CCCATTCTTC CTATGTGTCG TGTCAGCTCT CACTGACGTC TCCTTCTTTT TGTTTTCTTC TTTCCCCTAG CTTAACGAGA TATACGCGCG ACTGGATGCT CATGCAGACG CATACGCCGA CACACGCGCA ACGGGCGCCC GCCCCGCCTC GTCTTCCCAT TCCACTTCTG CAGCTCCTCG AGCAGGTCAC GTCGCCCCTT CGCACAAGCG ATTTTGTCTC CCTCCCGAGC TTGCTAGACG GGGAAAAGCT CCTCGAGACT TGGGTGAGGG TTGGATGTGG GTTACGGGCG TTGTGGGTGT TTGGCCTGAG GAAGGGCCCG AGCTGCAGAT GGAGGCGGCG GGGGCTGGGG GCGAGGCTGG AGATGTGGTT GGGGTTGGGG TTGATGATGC GGAAAGGGAA CAGGGCGAGA ATGGAGAAGC AGCCGTGGTA GAGGAGATGG AGGTGAGAAC ACGGGTAGTG AATGAGATGG CCGAGGTGAC GCTTGCGCAG ATGGGACAGT GGGGCACGAG CGAAGAAGTG ATGGGCGAGG CGGACAGACT GGGGTTGGTT AATGCTCTTG ATGGGTTATA ATCACGTTCG CTTGTGCTCT GTATCATTTA CTGTCTTATG CAGTTTTTGG TCAGTTTCTT TATTGCCTCT CACTGGCATC CACGTGCAGC TGGATGCGTC ACTGCCGCCT TCAAACTGTC AGGTCTATCC TCCCCGTCTG GCAAATAGAA GGCGGAACCT GGTAATCTAT GTACGTTACT GTCG
|
Protein sequence | MSTHSHVDLV KALESIDQSV LHDDTLSSHE HTHHDHHHHI AAIGHHHEAS GRQEDLSEWT REELQAEIVK LRQMAGIAND NNMAAEMLPT SAEGSVDPSL RENALVTLPA EGGGSKGKRK RKSPTNDVTT RKVQKRHTET GKRLEKERRT ELAKVVRNKM RSLVGMEFNN SPVPQPTVSF SEEERGFTPF VPEWRNMLDE DNLAWVDKIS KSVQEEATNG LHPKIPNADL VSEIVHGVAC TAFTNLCKRF INENSSDGAD KRERYIKKRR RWARKDLKQK RRSRSAASPS ISLSLPAPLP ASALHIDYMS SEYSSSGDDE SDVHPHIRVM QKDKWREATE EAQRDASVAA GTGRGGWKAG QGPKVLEVRK PKWRSQQLNE IYARLDAHAD AYADTRATGA RPASSSHSTS AAPRAGHVAP SHKRFCLPPE LARRGKAPRD LGEGWMWVTG VVGVWPEEGP ELQMEAAGAG GEAGDVVGVG VDDAEREQGE NGEAAVVEEM EVRTRVVNEM AEVTLAQMGQ WGTSEEVMGE ADRLGLVNAL DGL
|
| |