Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC04900 |
Symbol | |
ID | 3256546 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1482768 |
End bp | 1485759 |
Gene Length | 2992 bp |
Protein Length | 845 aa |
Translation table | |
GC content | 52% |
IMG OID | 638255709 |
Product | conserved hypothetical protein |
Protein accession | XP_570047 |
Protein GI | 58265782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.266265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGTG AGTATTCTGC TGCAAAGTCA CAGTATGCCT TTCCCTCCCG CTGTTTTGAT ACTGGACACG CGATGATAGC TTCTCGCAAG ATCTCCAGCT GACGTCACTT TGGAACATAG TTATGCTCCG CTACGTTCAC TAGACCTCAA CATGTAGGTC GACACCTGAG ATCGCATACC GGCGATAGGC CCTACGAATG CAAAGAATGT CCCCTGCGGT TTGCCAGAAG GTGGGTTTAG CGAAAGGTAA GAGAAAGAGC CCATGCTGAT AGGATGGCGC AGTGACCTTC TTTCGCGCCA TGTCAATAAA GCACATAGGG CTCCTGATGA AGGGCCCAGC GACAAGAAAC CGACTAAAAA GGGAAGAAGG AAATCAGTCC CAGCCTCAAG CCACTCAGCG CTCACCAAAA TCAGAATTGA AGAACAAGAA GAAGAGCAGC GACGACAGTT GCTTCAGCGA CAACAGTTTC AGCGACAGCA ACAGCAGCAA CAGGAGCAGC AGCAGCAGCT ATTACAGCAG CAAATGCAAC AAGAAGCCGC GGCAGTAAGG GAACGGTCCA AGAGCGAAAG TCTCAATCAG CAACCTCATC TTCAAGCTCT CCGAATGTAT CCTCATCACC CGCTTCTGGC TTCTAGACCT GTTCAACCAT CTATTGCCTC AAATTCCTGG AACACCAATC CAAGCGGCTC TTTCGCCGCA GCGGGCATGA TGGCCATGTC ACCACCTTTC GACCCTACAA TTCAACGCCT CGGAGGAAAC TTACAACCTT TCCAAGGCCA ACCTTTTTTG GTTGGGCAAC TTCCCAATGA CCCTCCTTTT ACCGGCCAAC CTATGAGGCT CTCTGGATCA GATCAAGGCA TGAAGGCTGC AGACGGGCCT TCAGCTTCAG TCTTCTACGA GTTGGGTATG AAGAAGAGGG CATGTGATCA GTGCAATCAC TCGAAGGTGA GATGCGACTT TGCCGACCCT TGCCGTAAGT ATGCGCAATC TCATCAACAC AGTGTCCATT GCTGACTGAC TGCCACATCA GTTCGGTGTA GGCAAAGGAA CCTGTCATGC TCATATCGCA AACCTCCAAA GCTTTCTACT ACCATGGCAC CGCCTCCAGC CCCTTACAAC CTGCCTTTAA ACACAAGCAG CACAACAGTG TCTCCAACGT CGCCGTACTC TTCCCCGAAT TCTTCAACTA GTGTCCTGGC TCCCACCGAG CCTATTTCTT ACCGAAAGCC TTCTGTTGCA TCCCTTCCTC CAAACCTTGG CAATGTCCCC AATCAGTCAA ATGCCCAGTC CGTGCCATGG GTATCTTATC AAAATACCAT GGGTACCACC TGGCCATCTC CGCAAGCCCA GACCACATAC TCTATCCCTG ATGGTACGGC CCCTGGCAGT ATTACTGGTC AGTTTGTTGA GTCTCCTGGG GCTGTACATC CAGCGTTGCC GCCATATTCC TCACAGTCAA AAGCTCGAGC TGAACACAGC TTGTCCCCGC ATCGGTTGAG CATGACTCAG ACACCTTCAC TGGTTGACAG TATTGATACT TCTTCAGAGA TGGACATGGA CGATCCTGCT GAAAGAAGAG GCAGTCATAA CTCGTTTACC AGCTCAGTAC CCAGAACTCA ATGGGGTATC AAGGGTAAAG AGGTTGAAGG GGTTTTGAGG TTGCTTCCTA ACACCGGTGC TTTTGAGCAG AACACATCTC CTACCCAGAC TGCGTCTAAC CAGGTGTCGC CCGTCCCCGT CCGTGCAGAT CAGCTCTCAG ATATAGTTTC CAATCCTCAG TGGCAGTCAC AAGGAACCAT CAACCCAAAA TGGCAAACAC GGGGCAATTT CTCGTCGGAC GACGAGAGTT CTTCAGTGTT GTCCTCTTCT GCAAACTCGA CTTTTCTCGA GCAGTCGGTT AACGCCTCTG CCAGTCATGG AAACATTCAT CATGAACGTC GTAGATCTAG CGAAAGTGCA TGGGACAAGG CCATGGAACA AATGACAATA CAAGATCCAC AAAAGAATAT GGCCGTAGTT GGTGAGATTG CGGACGGAAG TGTGCCTACT GTTCCACAAA TTTCTGAGAC CGCATCTGAA GGGACACAAG AGCTGCCTGG CCTTATCGGA CCGAATGCTG AAGAATCGCC GATGGTGCCT ACCCTTTCCG ATGTGAAAGA TCTGTGGCGA TTTTTCATGA CTGAGCCGAT GACTGGCTTG ACTCCAGCTG GCGAAAAGCT TAATGAACTC GATAATCTAC CCGTTGTCAC TCCCAGACCA GGCATGGGCA AGCGCACCTT TAGCAAGTCA AGCTCGATGC CCGACCTTCA ATCTCCCTTG GTTACTGGAC CCGCGTTCTT CTCCACATTC CTGAGCGGTA TGACTCCCAA GCCAACAGAA GCCCAGCACT CGTATATGCC CTCGCATTGG GCGGAGAGAG ATCACTCCAG TGTCACCGAA GGCCTCGACG AGCCGGACAT GGGGAAGTGG AATAAAGAGA TTGAACAGAG GCAGTCGTCA TTCAGCCTGG GGAAACCGAA TGCCAAGTTG GGCAAAGGTA AACCAGATAC GTTGTCTGGG ATTAATGATG CCCCTATCCA ACCCCCTGCG ACGCAACCTA TTCGGCCTCT TCCCAGTGTC GTCCAGCGTT CTTCTGCCCT TGACCAAACT CTAGCGCCCG AAAGGCTTCC AAGTTTTGGT CTGACACCTG GATTCGAGAT TCAACAGAAC CCGCTCCTTT CCAAGCTTGG TTTGGCTTCT GCTGATGCCA GGACTGGAAG CAAGCGAATG GCCAGTTCTA CATTGGTTAA CGACCATAGA AAGGCGACTT TCACCGTATG GGACGAAGAT GGACCTGCAA CGCAGGGTCA GGGTAATGGG GCAGGTTACG GCGCGGAGAG CAAGGCGCTT GCAGGTGGGG GGGCTTTAGG AGTAAGTACC GCTCAGGGAA AGGGACTTCA ACAATCACCA TTCAAGAGCT GGGCCCTCAA TACTAGTGCC GGAGCAAGTT GA
|
Protein sequence | MSRRHLRSHT GDRPYECKEC PLRFARSDLL SRHVNKAHRA PDEGPSDKKP TKKGRRKSVP ASSHSALTKI RIEEQEEEQR RQLLQRQQFQ RQQQQQQEQQ QQLLQQQMQQ EAAAVRERSK SESLNQQPHL QALRMYPHHP LLASRPVQPS IASNSWNTNP SGSFAAAGMM AMSPPFDPTI QRLGGNLQPF QGQPFLVGQL PNDPPFTGQP MRLSGSDQGM KAADGPSASV FYELGMKKRA CDQCNHSKVR CDFADPCLRC RQRNLSCSYR KPPKLSTTMA PPPAPYNLPL NTSSTTVSPT SPYSSPNSST SVLAPTEPIS YRKPSVASLP PNLGNVPNQS NAQSVPWVSY QNTMGTTWPS PQAQTTYSIP DGTAPGSITE MDMDDPAERR GSHNSFTSSV PRTQWGIKGK EVEGVLRLLP NTGAFEQNTS PTQTASNQVS PVPVRADQLS DIVSNPQWQS QGTINPKWQT RGNFSSDDES SSVLSSSANS TFLEQSVNAS ASHGNIHHER RRSSESAWDK AMEQMTIQDP QKNMAVVGEI ADGSVPTVPQ ISETASEGTQ ELPGLIGPNA EESPMVPTLS DVKDLWRFFM TEPMTGLTPA GEKLNELDNL PVVTPRPGMG KRTFSKSSSM PDLQSPLVTG PAFFSTFLSG MTPKPTEAQH SYMPSHWAER DHSSVTEGLD EPDMGKWNKE IEQRQSSFSL GKPNAKLGKG KPDTLSGIND APIQPPATQP IRPLPSVVQR SSALDQTLAP ERLPSFGLTP GFEIQQNPLL SKLGLASADA RTGSKRMASS TLVNDHRKAT FTVWDEDGPA TQGQGNGAGY GAESKALAGG GALGSWALNT SAGAS
|
| |