Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01450 |
Symbol | |
ID | 3258113 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 424167 |
End bp | 427407 |
Gene Length | 3241 bp |
Protein Length | 769 aa |
Translation table | |
GC content | 46% |
IMG OID | 638257270 |
Product | conserved hypothetical protein |
Protein accession | XP_571677 |
Protein GI | 58269042 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0507307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGTC GGGCAGGAGG ATGGTCAGAA AGGAGCTTGC CACTTGCAAC AGTTTTGTTA ACACCTTGGG CGCTTTTCCA AGGAGCCTCG CAAGAGGTGA TAGACAGGGC GAAGGAGGAA GGCGATGATG AATTATTGAG GCAGGGTGGA ACTGCGTTAG AGGTGAATTC TAATGGTCTT AAACGCGCAT GACTGGGCTG ATACATATAG GAAGATGGCC ATTATTAGCA GTTCCAAGAG ATTTATTAGA AGTCCCTCAT GCCAAAAAGT CATCGGCAAG TTTCTTCTGC TTGGTATGGA CGATTTGATG ACAATCATTG CAGAGGGCAT ATGGGGCGGA CGTATCATCT ATACTGCCCT GAATGCGCAT GCTCTCATAG CTGACGTACG CCCGTTTAAC CGTCAAAGAA AAAGAACAAA CAGAATTGCT GATTCCGGTT GATAGAACTA TAAAAAGAAG CCTATTCAGA TGTATAATCC TCACAAAGCA CCATTTTTGG ATCATTACAG GTAAGGTTTA GGATGTTCAG CTGTACGCAG TTTTGGCTGA TGTGATTGTA AAATGTAGGT TAAAGGTACC ACGGTATCGA TCCATGCTTG AATATGTCAA TTTCCTCGTC TTATTTATAC TGTATGTCAT TGCCATTGAA GGCCTTGTCG AGAGCCGCAT CAATGGCCGA GAGTGGGCGT TCATTATCTA TGCTATGGGT AAGTACGATA ATAATCAGGT AGACGCGGAA GTTCAATGTT TCCTTAGCAT TCTCCCTAGA TAAACTTGCG GCTATCCGAG AGCATGGGCT GAAAGGTGAT GCAGTCCATC TGTTTGTTAC GTGAGGCTAA TGTGCTCTAT AGTATTCAGC AGCAGCCTTG TCAACGGGTT CGATCTAGTT TTCATGATCA TCTACGCCGT GTATCTCGGA GCGAGAACTT ACGGAGCCCG ATATCATAAC GAATACGCGC TGGGGCTAGG AGCAGATTGG TTAGCAATAG GTCAGTCCAG ATATTAACGG AAGCAGTCAA TGATCGGAGG GTATTTATCA TAAAGACTTA ACAGGTGCTG TACTAATTTT TCCTCGTCTG GCGTTCGTCA CGCTTGCCAA TAACCTGATG ATTTTGAGCA TACGGTCGAT GTTGACAGAA TTCTTCTGTA AGTTTGTTAT CTGAAGTCAA CATTAGTCTA TGCCAAAAGT TTACTCGTTT CAGTTTTGAT GGGAGTTGGT ATATTCTGTT TTCTAGGTAC GTCCGGGCTT CTGAACTGTA TGATAGGGAT TTGATCGTTT TATAGGCTTC GTATACGCGC TATTCACTCT TGGTCAAGGA AAATTCGAGT TATCACAAAT AGCGTGGTGG TTGTTGGAGG TGTACTTTGG ACTAGATGCT TCAGGGTTTG AACATGCTTG TGAGTGAGAC GAAAGGAAAT TATCGCATTG AAACTCTAAT CGATCCTCGA AGATCTTTTT CACCCATTTC TAGGGCCTCT GCTCATGGTC TTCTACGCCT TACTTTCAAA CACATTGCTC TTGACTGTAC TTGTCGCCAT CCTTGGCAAT ACCTTTGCCA CTATCAACGC CGATGCCGCT GCAGAGGTAA GCGCATGGAA ATACATCAGA ATTTTCGCTT AAACGACGCC ACAATCAGTC AATGTTTCGA AAAGCTGTAT CTACTCTTGA AGGCGTAAAG GCAGGTGCGC ATCACAAACC CTCTTCCTAT TCGAACAGAC TTAAGCTAAT ATTGAATCTA GATGCTGTAT TTAGTTACCA GTTACCTTTC AATTTGGTTG CTGTGATCAT AATGTGGCCG ATGAGATACG TTCTCAACGC GAGGTGGTAA GTAACTGACT TCTTTTTCGC TACTAGCTAA ACATTTTCCC ATTAGGTTTC ATAAGGTCAA TGGTGAGTGG TAGAAATTTC GATAATGCTT CAGCTGATCC ATCGCCAGTT TTCATGATCA GGGTGACGAG CGTACATATA TTGCTCTTGA TAGCCCTGTA CGAAAGGCAG TCATATCAGG ATCAAGGTCT GATGGAGCAG CTTGGGGACT TCGCGGAAAG ATATGTTGGG AGTCTTCCTA GACGTCTCAA GGCGGCAGGT GATTCTTATC ATTATCCATC TTGTGTACTA GCAATACTTA TATCATTTGT TCTAATCTAG CTGGTTTCGA CAATTTTGCT TCGAGAAGCG ATATTGCGGC AGTGTTTGAG ATCGAAAGGG AGGTAGGGGC CTTCTATGCT GGATGGGACG ATGAAGTGGA TGAGTCAGAG ATCATTTTAC CCCCTGCCTT CGATGGCGAT CCCCCGTCAA TGAACAATGA CGACGAAAGA CCCGGTCAAG ACATAACTGC CTTTGATTCC GCCACTGCTC CCTCGAAAAA ACATTCTTCA CCTCCATCAC CCGCATCTCC TTTGACGGCG TCCCCGTCGC GCCTTGAACA TCAGTTGGAA AACCCTCACG CTCGACGTAA CTCTATGCCA TCATCACATC GGTCATACAT CCAAAATCCC TATCAAGTCC CTATACGTCG ACGAAATAGT TCTATCCACG GCCCTAGTCC TTTGGCCCAG CTTTTTGTTC GAGGCTCGGA ATCTGATGCT TTGAGGGGGA GGAGGGCGTC GATGGCTGGA GCCATAGGCG CAGGTCCCGC GCTTGCTGCG CCCCCTGTGT TCGGTCCATC TTCCAAACCT AGACGGAGTC ATTTTAGGTC GGAATCATTC CCAGAGTTTT CAAGCAAAGA GCAAGATCCG CATCAACATC CAGCAACAAA TTCAGTATCC CGCTCCAATA AATCATATCT CATAAATCCT TCCATCGCGC CTATCACCGA AGGCAAGAGT GTTTCATTCT CAAGCGATCT GAAAGATCCG GAAGACGATC CAGTATGCGA TCCCAGCAGC TCCATTGCTG GTCGCAAAGA AGGGAACCTT GCTGTCGGTC ATGGTGGTCT GCCTATCGGT AATAGGGAAG AAGTTAAAAC TTCACCACTG TCGGTCAAGG CTACGCGCTT TCAGACAGCT TTTCCAGGCG CATACTCCCC TTTAGGTACC GTTAATGATT CGCGACCACA CACTCCAAAC TCACAGGCCA GCAATGTGGC TAAGGCTTCT CAAGTGGAAG TTCTGGCTCT GGCCAGGCCA GAAGAAGAAG CATTAAAACA AAATATGAAG GAGAAGCTTG AAGAAATGGA CCGGCGACAG AAACACATAG AGCAATTATT GGAACGATTG CTTGGGCACT TTGAACGATG A
|
Protein sequence | MNCRAGGWSE RSLPLATVLL TPWALFQGAS QEVIDRAKEE GDDELLRQGG TALEMAIISS SKRFIRSPSC QKVIGKFLLL EGIWGGRIIY TALNAHALIA DNYKKKPIQM YNPHKAPFLD HYRLKVPRYR SMLEYVNFLV LFILYVIAIE GLVESRINGR EWAFIIYAMA FSLDKLAAIR EHGLKVFSSS LVNGFDLVFM IIYAVYLGAR TYGARYHNEY ALGLGADWLA IGFVYALFTL GQGKFELSQI AWWLLEVYFG LDASGFEHAW PLLMVFYALL SNTLLLTVLV AILGNTFATI NADAAAESMF RKAVSTLEGV KADAVFSYQL PFNLVAVIIM WPMRYVLNAR WFHKVNVFMI RVTSVHILLL IALYERQSYQ DQGLMEQLGD FAERYVGSLP RRLKAAAGFD NFASRSDIAA VFEIEREVGA FYAGWDDEVD ESEIILPPAF DGDPPSMNND DERPGQDITA FDSATAPSKK HSSPPSPASP LTASPSRLEH QLENPHARRN SMPSSHRSYI QNPYQVPIRR RNSSIHGPSP LAQLFVRGSE SDALRGRRAS MAGAIGAGPA LAAPPVFGPS SKPRRSHFRS ESFPEFSSKE QDPHQHPATN SVSRSNKSYL INPSIAPITE GKSVSFSSDL KDPEDDPVCD PSSSIAGRKE GNLAVGHGGL PIGNREEVKT SPLSVKATRF QTAFPGAYSP LGTVNDSRPH TPNSQASNVA KASQVEVLAL ARPEEEALKQ NMKEKLEEMD RRQKHIEQLL ERLLGHFER
|
| |