Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN02230 |
Symbol | |
ID | 3255368 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 690576 |
End bp | 693970 |
Gene Length | 3395 bp |
Protein Length | 834 aa |
Translation table | |
GC content | 48% |
IMG OID | 638254633 |
Product | hypothetical protein |
Protein accession | XP_568709 |
Protein GI | 58262598 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.140535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTCCTGACG TTTTCCTTCT CATCTCCGTT TTGATACTCT TACCTGTAAT CTATAGTTGT TTGTCATTTA ATTCATCCTT ACGAAACTCT TGACCGCTGT CCACCTAGAA GGTGACTGTT TCACACGCCC ATAACCCTGC TCGTTCAGGC AACGACGGAA TTGGTCACGA CTCACGTTCA ATCCCACGAT TTCAATTGCC TTATACGTTC ACCCCTACCT TTCACAGCTA CTTTACTCAG ATATACGCCT ACTTTGACGG CTCAAGGAAA CATTTCATAA CTACGCCAAC TTATCACCGC ATTATATGAC ACCATGACCT CATTTTCCAT ACCACTGCCA CCTCAACACA CCTCATCTGT TGCGCCACTC TCAGATAGTT CATTTGCACC TCGCACTCCC CCCCGGTTCA CCAGGCATCG CTCCGATACA ATGCTTAGCA CCTCGTCCTC CACCTCATCA TCCATCTTAC AGACTCCAGA AACGCCGTCA ATTCTGGAAA CTTTGTCCAC AGGCTTGGGT TTGGGCATTC CCGCTGGGCA ATCTAACCCA TTAGTGCAGT CACCTAACGA ATTGAGGATA AGAAGAAGAG CGAGCAGTTA TGAGTTCGGT GCGTTATGGG AAGGGTCACC AGATGACCAT CCAGTGGGAA AGCCTGGTCC TTCTTCGACG TTTTCAGTCG CAGGTACAGG GTGGAGACTG TGTGACAGTC CTGAGCAGCA TGGTAGAGAT GATTACTTTC CGCCAGCTGC TAGTATAGCT CCAGCAAACG TGATGACCTA TCAACCGAGT CCATTACACC AACAAACACA GTCAGAGGCT CCTCGAGTCG CTGACCCTGG CTTAACAACC ACCAATCCCA CCCGTCCCGT GACGGCAAGA CGAGTGTCTT TAAAACTCAA AGACGCAATT CGGCTCAAAC TGGGAAGATC CAAATCATCC TTCAAGGTCC TCAGTCGAAA GACTCAGCGT GAGGGCGGCG AACTTTCTCA TGACCAAGCA TCCCAGTCGC CATCTGTAAC CAGTAGTGAC CCGTCCACTA GATCTCGCCG ACAACGCTTG AAATCTCTCA TCTCGTTCTC TCAATCGAGC AACTTTTCTT CACAATCGGT ATGTTCTGAC ACTCCATCCT CTGATTTGGC ATTTTCTTCA GGCGGCTCGC AAGGAATCGC CTTGAGTGCA CCTTTGAATG CCGATGAACG ATCGCATTCA TTCATTATAC CGACTACAGT TGATTTCAGT GTAGAAAAGG CGATTGTTGA AGCATGCAAT GAGGAAATTG GTCAAGTTGA AGAAAAAGTG GATAGGAAAG GTAAAGGCCG AGCAGTACAT TCGCCACATC CCTCACGAAT CCAATCCCAA ACAATCGATC CTCATAGTAC GTTTTACGAT CTCTCGACCC CTCTTTTAAC CCAGGAACTG TCTTCTCCTC CCGATATGTG TCCCACTGAG GAGGCTGAGG AAGCGAAGCA CCTTAGTTTT GAAGAAAGTC TTCCGAAAGA GCTCAAGCTT CTGGTGATGA AGAAGCTGAT GGAGAGTTTT GCGGATGAAA GCTTTGGTAG GTTCACGGGT GAGTTGCTGG GAAGGATGGA ATTGATAAAG ATGAGTACTG TAAGTCATAA ACTTACAATA ACCGGTGCCG AGTTGATCGA GTTAGGTTTC CAAATCATGG GAGGCATTAT GTTTTGATGG TCAGCTGTGG CCAGCTATCA ATTTAGCTGC TATTGCCCAT CTCCTTCCCA TCTCCATCCT TCATCGTATA CTTAAGCACT CTGCGTCTTT TATTACCGAC TTTTCTCTCC GAGGGATGGA CGCAGTCGGC GGCAAAATGC TTATCAAAGC TTTAGTAGGC GAGAATGTTG ATATCAAGCT GTATGACCAT ATCGATCTGA CTCCACGACT CAATATCCAA CGACTTGATT TGAGCGGGTG CAAGACTTTG ACCGAAAATG ATTTATGTTG GATCATTTCT TGCTGCCCCA ACTTGCGCTC TCTGAATCTC AGAGGGTTAT CCGCTGTGGG CCCCAAATCG ATAGGCTTCA TCTGGAAAAT CGAAACGCTT GAAGAGATCG ATGTGTCATA TTGCAGAGCG CTTGAACTAC CTTTTCTTTT AAGCTATATC AAACGCATTT CAGAGGTGCA AGCGAAGAAC TTGCGGTCTA TCAGAGCGGC CGGTCTTTTC TTCCGGAGCA ACATGCTGTT ATTGTCTATC ATCAGACGTT GTCATAACTT GGAGAGACTT GATCTCCAAG GGTGCCACGG CTTAACCGAC GACATGTTTG AAAATTTCCA CAATTACTGC ATCGAGGACG ACAAGTGCCT GACAAGTCTC ACCCATCTCA ATGTCTCAAA CACCCCACTT ACCCCTGCCA TCTTCACTTA TCTCAACGGC CATCTCCCTA ACCTGACCCA TCTTGAAATG GCCAACCTCT CGGGTGCAGA CAACCCGGAC GACGATGACG ACGGGTATGA ACTGTCAAAG ATGTTGAAGA GTATGCCCAA GCTACGAAAG GTGGATTTGG AAGACACCGC CGGTTTGTCA GGGGTGAGTG ATATGGTGCT CGAGGCGCTG ACGCCGATGG ATGGGGATGT GGGGACGACG GGGTGTGAGC TGGAGGAGTT GAAGATTGGG TATGCGAGAG TATCATCAGG GGCGATCGTG GACCTTATCA AAGGCTGCAA GAAGCTCAGA GTATTGGAAC TTGATGTAAG CCAAGATGAT CCTCCCTTTT TTGGTTTTCT TTTTGGTCTT GTCCGACTTT TGCTGATTCT TTCGTTGATC AGAATACGGA AGCAAACAAT ACAGTCATGC GCGAATTTCT CCGCCGTTCG CATCCTTGTT CCCGGCTATC CATCATCGAC TGCCATAACG TCACATCGGC TGCTTACACT GAGATTGCAG CTTCCACCAG GGCTCGTCAG GGGTGGGAAG GATGGCCAGC TGTGCCGTTT GGTTATGATA AAGATGTGGA GATGGCTGAG AAGGCGGTGT TGAAGACGTT TTGGGGTTGG AAGAGGGTGG TAGTCCCGAA AGGGTGGGTG GAAATGAGGA ACGAGGCGGA GACGATGGAG AGCCGAAAGA GAGCGCGACG CGAGCAAGCG CAAGACGCTT GTAGCTCAAC AGAAGGGGAA AGTTCAGATG GTTCAAAATG GAAGGGGAAA GGGAAAGCCA AGGATGATGT TGATGGCGAT TCGGGGAGAA CCAGGCCGAG AATGAGGAGT AATGGTTCGA TCAGCCGTGA ACCTGTGGGG TGCATCATTG CATGATGACT CTTAACGTCG GGTATAGGTA TCATTAGTTT CAACCGCTTT AAATGGTTGG AGTTTCAAAT GCCTAGTGAT TTTTGTATGC CCATACGGAT ATGTTTTGTT GTTTACATTT TTGAGTGCAG TTAAC
|
Protein sequence | MTSFSIPLPP QHTSSVAPLS DSSFAPRTPP RFTRHRSDTM LSTSSSTSSS ILQTPETPSI LETLSTGLGL GIPAGQSNPL VQSPNELRIR RRASSYEFGA LWEGSPDDHP VGKPGPSSTF SVAGTGWRLC DSPEQHGRDD YFPPAASIAP ANVMTYQPSP LHQQTQSEAP RVADPGLTTT NPTRPVTARR VSLKLKDAIR LKLGRSKSSF KVLSRKTQRE GGELSHDQAS QSPSVTSSDP STRSRRQRLK SLISFSQSSN FSSQSVCSDT PSSDLAFSSG GSQGIALSAP LNADERSHSF IIPTTVDFSV EKAIVEACNE EIGQVSKSWE ALCFDGQLWP AINLAAIAHL LPISILHRIL KHSASFITDF SLRGMDAVGG KMLIKALVGE NVDIKLYDHI DLTPRLNIQR LDLSGCKTLT ENDLCWIISC CPNLRSLNLR GLSAVGPKSI GFIWKIETLE EIDVSYCRAL ELPFLLSYIK RISEVQAKNL RSIRAAGLFF RSNMLLLSII RRCHNLERLD LQGCHGLTDD MFENFHNYCI EDDKCLTSLT HLNVSNTPLT PAIFTYLNGH LPNLTHLEMA NLSGADNPDD DDDGYELSKM LKSMPKLRKV DLEDTAGLSG VSDMVLEALT PMDGDVGTTG CELEELKIGY ARVSSGAIVD LIKGCKKLRV LELDNTEANN TVMREFLRRS HPCSRLSIID CHNVTSAAYT EIAASTRARQ GWEGWPAVPF GYDKDVEMAE KAVLKTFWGW KRVVVPKGWV EMRNEAETME SRKRARREQA QDACSSTEGE SSDGSKWKGK GKAKDDVDGD SGRTRPRMRS NGSISREPVG CIIA
|
| |