Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE03970 |
Symbol | |
ID | 3257627 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 1120958 |
End bp | 1123863 |
Gene Length | 2906 bp |
Protein Length | 512 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256980 |
Product | conserved hypothetical protein |
Protein accession | XP_571195 |
Protein GI | 58268078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCTAACAT CCATATCCCT CCATACTCTT TCTTCCAACC TCTTTTATCC GCCACCGCTG CTTCTTCCTT CCTCACCGCT GCTTTCATAT ATAGACCGCC GCAACCCCTC GCCCACCCTC CCCGCAGTTA GCCCCGTACC TTTCGCGGCC CCGGCCCTTA CAAATCGCAC TCGCACACTC GCCCCATTTC GACATTGCGC CCGCTCTCCG CCGCCACGTC TCTCTATCCA CTGTCGCCTA ACAGTAAACA TCTGGCCCAG TCTGCCTCGC ACAGGCCGCC AGTTTATATC AGAACCCATC TTCTTGGCGT TCTTTTTCTT GGCACCACTG TAGCTCCGCA AATCGACATT CACGCGAGTA CACCACGGCA AAGATGGGTC TTTTGAGTCA GCGAATGGAT TCTCTCTCCA TCAGAGGGAA GACTTCCAGA CGCAGCTCCG TCGCGTCCCC TGCCGGCACC CCAGCCAGAG ACTCGGGCGA GACTCAGATC GACGATGGCG ACCCCAACGA TACAAGCACT CTCGACCAAG AAGAAGGCAA CATCCTGTTG TCCATTATCT CTCAATGTAA GCATTCGCAG CTACATATAC AGTGGTTTGC TGACAAGCTG TATGTCAGTG CGACCTGGAA TGGACCTGTC CAAGATTGCC CTTCCCACTT TCGTGCTTGA GCCCAGGAGT CTTTTGGAAA GGATCACTGA CTTTTTCTCA CATCCGGAGT TGATCTTTGG GTGATTTCTT CGTATTCCAA CTTTTATACA TTTCGATACT GACGCTGTGG GTTTTTCAGG GCCGGCAATG ACCCCGATGC AAAACAGAGA TACATCCGTG TGATGACATT TTACCTGAGT GGCTGGCACA TTAAACCCAA GGGGTGAGTT GGATTCATTC ACATTTTTAC CCATTTTTTG CTGATGATTG GTGTCTCTGC TAGCGTCAAA AAACCGTTCG TTATGTCCTC CATTGTTACT CTCAGTCTTA CTGATCGTCT TGCAGGTACA ACCCTGTTCT TGGGGAATTT TTCCGTTGCA CCTACGTTTA CCCTGATGGA TCGGAAGGAT TCTACATTGC CGAACAGGTC TCACACCATC CTGTAAGTAC ATTGTCACCT TAGACACAAT TTTGTATCTA ACAAAAAGAA CGTAGCCTGT ATCTGCATTC TTCTACATTA GTCCTAAGAA TGGGTTGCTT GTAACCGGAG AGCTTAAGCC AAAAGTTTGT ATCTGTTAAC CGTCTTTAAT AGGAGATGAT GCTGACTGGG CGGGTGACAG AGCAAGTTCC TTGGTAACAG TGCCGCTACT ATTATGGAGG GTGAAGACCG GATACGGCTG TTGGACCGAC CCGAGGATGG GGAGTATTCC ATCACTGTAA GCTGTGGTCA ATTCAATCAG CTGTAGCTCC ACGAGCTAAC ATTTGTCCTA GATGCCCAAC ACCTACGCAC GAGGTATTCT CTTCGGCAAG ATGCTTTTGG AGCTCTGTGA CGTGTCTAGC ATTGCCAATG CCAAGAATGA CTTCCACTGT GATGTCGACT TCAAGGCAAA GGTAAGGACG TCCTCGCCTG ATCTACCAGT TTGGCTAATC CGCTGGATTC TTAGGGATGG ATATCGGGTG GTTACAACGT AATTTCTGGC AAGGTCGTTG GTCCGGGCAG GTCGGACATT GGGGAGATTA GTGGCCACTG GTCATCTGCT ATTGAGTTCA AAGACAAGGA CACCAAGGAG AAAAAGGTTC TTTTCGACCC TTCTACTTCA CGTGTTGCGC CAAAGAAGGT TTTGCCCGAA TCTGAGCAGG AGGAGTATGA GTCTAGGAGG TGTGTTTTCC TTCCCTTTTT TTTCTGCAAG AGAAGGCCTG CGGCTGATTT CCGTCCTTGC TTAGGTTGTG GACTAAGCTC ACTGATGCTA TCCGAGCTGC CGACATGCAC GGTGCAACCG CCGCCAAGTC CGCTGTAGAA GACCGCCAGC GAGAGCTTGC CAAGAAGCGA GAATCCGCAG GCGAACCCGC TCACGAGCAA AGATTTTTCA AGCACGTTGC CGGGGACAAG TGGATGCCCA AGTTGGACGT TGACAAGTGA GCCGACGCGT TCCTGTCCAT CTTTTCCAAG TCGATAAGCT AAATGTCACC TTGATAGTTT GCCCAAGGAC CGAAATGAGA TGGAGGATGT AGTTCGTAAA TGGATCTTTG GCGACAAGAA TCCCTTGGAC TACGAGAGTG TCAAGACGAC TCCTCGAGGT TCCCAATCCA CGGAGTCTGG AGCTGCCCCT GTCGCCGCGG GTGAAGCTAC TCCCGTCGCT GCGGGTGGAG CTGTTGCTGG CGGAGCTGCT GGAGCTGCCG GTGTCGCTGG CGCTGATGTT AGTTCTTCCA ACACTAGTGT AGCCGAATCT ACAAGAAGTA CCTCTACAGA GCCTCGTAAG TTTTATAGGC AAACCAACCC CTTTAGATGT ATAAAAACTG ACATGAGAAT CAAAGCTGCA GTACCCACCG CACCAGGTCC TCCTGTCGGT CAACCAGTTG ACCATCCCGT ACCTCCTAAA GCCTAAACCC TTCCCTAGAA TCTCATCAAA CTCCTATCTT TTTTCCCGCA CGATACCCAA CACGCGCGCA AGTCTTACCT GGTGATGATG CCTGTAGAGT CGGTCCCCGC GAGTAGAAAG CCGAGAAGTA ATTTTTTCAA AGAGGGTGGG TTAGTAATGA GGACTATATG AATGACGCGG ACGAGCAGAA GGAAGACAGT TATTCAACCC CCAACTGTAG ACTATCAACT TTTTAAACCT GCTTCATATC CAGTCTCCTT TTGCAGGTTT TGCCTTTATC TTTTGCTCTT GTTTCTAGTA TTTTATTGCT TGGATTCATA AGTGTACGGC TGGGCTCTGA CGTCGAGAAC GAGTTAACTG TGCTTTAATG ATACAC
|
Protein sequence | MGLLSQRMDS LSIRGKTSRR SSVASPAGTP ARDSGETQID DGDPNDTSTL DQEEGNILLS IISQLRPGMD LSKIALPTFV LEPRSLLERI TDFFSHPELI FGAGNDPDAK QRYIRVMTFY LSGWHIKPKG VKKPYNPVLG EFFRCTYVYP DGSEGFYIAE QVSHHPPVSA FFYISPKNGL LVTGELKPKS KFLGNSAATI MEGEDRIRLL DRPEDGEYSI TMPNTYARGI LFGKMLLELC DVSSIANAKN DFHCDVDFKA KGWISGGYNV ISGKVVGPGR SDIGEISGHW SSAIEFKDKD TKEKKVLFDP STSRVAPKKV LPESEQEEYE SRRLWTKLTD AIRAADMHGA TAAKSAVEDR QRELAKKRES AGEPAHEQRF FKHVAGDKWM PKLDVDNLPK DRNEMEDVVR KWIFGDKNPL DYESVKTTPR GSQSTESGAA PVAAGEATPV AAGGAVAGGA AGAAGVAGAD VSSSNTSVAE STRSTSTEPP AVPTAPGPPV GQPVDHPVPP KA
|
| |