Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00800 |
Symbol | |
ID | 3254628 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 253960 |
End bp | 256973 |
Gene Length | 3014 bp |
Protein Length | 699 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253570 |
Product | expressed protein |
Protein accession | XP_567732 |
Protein GI | 58260644 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.860316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGTCTTA ACATTTATAC TTTCATTGCT CTCAATTCAC CCATTATACA ACCACCGCGT AGTAGCTGCG ACGAATACAG GCGATTCTGG CTGGTCTAGC ATTCCCAGGG CTGGTCTCTG TTCACCGGTC TTATCAAAGC AATATCTGAT TCCGCCAACG CAATCGCCGG CCTTCTGTTA CTAGTACACC CTTTTTTCCC ATCAATTGTA TTCCTACGGA ATTGCTCCAT ACTACAACAT GGACGACAGT CAAGCTGGCC CTTCCCTTAC TGCATATGCA AGCCGTTTCC TTGCGAACAG AATGGGGGAT AGAGGCAGAG AGGTCCAAGG TAGTCAGGTA TGTGTCATGG TTTCTCAGTT GTCGATCGAT CTTGAACTAA CTATCCTTAT CTGTAGATAT TCCGATCTCC ATCACCATCA TCGCCTACCC ACGACCCCTT CATTCCTTCA CCATCTCTAA ACGCATCTCA TCCTCACCTC CCTGGATCAC GATCACCTTC GCGCTCACCG ACTCGTACTC CGCCTTTCCC CGGTGCTGGA ATTGAGGCCA TACCAGATAT TGATGGCACA ATGGGAGCTA GTAGCGTTGG AGTTGGTCTC CTTTTTGACG GACCTGAGGA TGATGACCAA CCAAAACCTC AAGAGAGTTT TACACAGCCA CCTGTCGGGT TCGGCGGAAA CAAAGGGAAA GAACGGTCGA GGATACCTAA TCCCTACGAA TCTTCTTCCG AAAGTGAAGA CGAGGATGAA GGCGAGCCCG AGATGGCCCT AGATGAGGTC GATACTGTGA GACGTTCGTT ACTCCGACCT CATCCGCAGC AACAACGCGA ACCTCTCTCA GAACGCGCAA AGAAGGGTTG GCTGGCACAT CAGAGTGTGT TCCCGCCTTC ATCTTCATCA TCCAGCGATG ATGAGAGTGA TAAAGAGACG GAATCTGAAT CTGGGGATGG TGAATCGAGA TTTACGAATG GTGGTCAAGG ATTGTACGAG GGGGGGGCGG ATAGCTCCAA CTTGTATGAC ACGCGGACGA TACCTGCCGC CTACAATGTC GCTACTAACC TGGAGGAACC TCTTCTTGCC GAAGAAGATG GACAACGAGA GTCAAGAGTT CCTGTCAGGC TTCATGTTTA CCACGGTCGC TTTGGGCATT GGGAGAAAGA AGGGTTGAGG AAGTACAAAG GTCAGTACAT TGACTCATAT AAGGCATGGG ACTGATGGTT CATGGAAGAC TCTGGATTCC TCGCACTTTG GCTAGCGTCA TTAATAGGTA TCATTGTTGG CCTGTTATTT GTATGGGGCT CTACTGACGT GAGTCATGCT GCTGTTTTAC CTTTGCAGAA CGCTCGCTGA CCAGTCTTGT AGCCTTCACC GGACGCTCCT CGTTCGCCCC CGTCGATTCT CCCGCTTCTT CCGCTTCTCC TTATTCTTCT CATTCCCCCT CTTGTCCTTC CACCAGCCTT CCTTTTCTTG CTGCAGAAGA CCGTTCGGCC CGTTCTCATG GGCACGGCCA TAAGCATCCC ATTCTCGTTG TTCGTTTGTG GATGGTGGGC TTTAGGAGCA AGTTTCGAGA CTGCTGGCTT GGATGGGGTG GAACAAGGAG ACAGATGGTG GGGTACGACA GGGCTGAGGG TCGGTGCTTT GTTACTATGG GCATTGGCAG GTTGGTTTGG CAGGCTTGTC TGGCTTAGGA GGAAGAGGCT AGAAAGGGCT GCTTCAGTTG TTGAAGTGAG TCAGCCTCTT GATCAGGTTC AAACTCGATA ACTAATCCCT CTCTAGCTTT CTACCAAGCT TCTCCTATCC CATCCACCTC TCCTTCTTCT GACTCCCATC CTTCTTGGTG TATTCGCTAT TACCTCCATT CCATTCCTTA CTTTACTCAT TCGGTTGGGC ATGATCGGCT ATTGGCGTCA TCCGCGGGAG AACACTTGGG TGTTCCACAT CCGACCTTAT GCTGGGTGGC TCATCTTCTT GGTGACTTTG GTCTGGGTCT GGACCTGGGG TGTGATTCGA GGTGTTGGAA GAGTTGCGGT CGCTGGCGTT ATTGGAGAGT GGTACTTTCA TCGGTAAATT ATCAAATTCC ATGGCTCTAA TGCGCCTGAC TGATTTGTAA TTGATAGTGA GGAGCACAGT CGACAGGATC CTGTCGAAGT CACCACTGCT GCGGTACATC GAGCCACAGG CTCTTCACTT GGCTCCATCT GCCTTGGTGC AGGTATCATT GCCGTCGTGC GTACTGTTGG TCGCGCGGCC TCTACCCTTA AGCAATACAC CTCACCTAAA AACACTCGCC TTCCCTCCTT CCTATCTTTC CTTCACCATC TCGCCCCCGT TTTCACCATT ATCGCGGGCG TTCTCGATCA GCTCAACGGT TATGCCTTGG TATACGTCGG TATCACCGGC GATGCCTTTT GGCCTAGCGC CCGTCGGGCT GTCGGTCTCG CTGGGAGACG TAGGGTGGGC AAGTTACTGG ACTACACCCT CATCAAACTT TTGCTCACTC TGAGTTCGAC GGCGATGGGA TTGTTCACTG CTACAGCAGG CTACTTGTAC ATGGCTCATT CCATGGGTAA CCCGGGATAT GCGCCTTTGG CAGGGATGCT ATGTGGTGGT GTGCCGTTTT TGGCTGTCAG AGCTGGTGCT GGGGTCTTGA CTGATGCGTA AATCTTTCCT TTAACCAAAG CATGCCAAGG CAGGAAACTG ATGTGTTACA GAGCGGATGC GTTATTTATC TGCTATCAAA TTGATAGAGA GCTTGGAGGG CAACATAGCG AGGAAACGAA GGGTGCGTTC TTGGGGGAAC AGCCTCGAGG AGCTGGTGCA GTTTGATGCA GTTTGAAAGT CAGTCACGAC TTACAACACA TCTGTATTTC CATAATAGAC TATTCAGTAT CCGTGGTGTA ATTATTTAGT CCGAGCAAAG ACTCTGTATA CTATAGAGAA GGAAGTGGAG GGTTGCCATC GCGTCAACAA GAGCAAGGGA GTCGTGCATT AATCGAGTGT GTTGTAATGA GCTT
|
Protein sequence | MDDSQAGPSL TAYASRFLAN RMGDRGREVQ GSQIFRSPSP SSPTHDPFIP SPSLNASHPH LPGSRSPSRS PTRTPPFPGA GIEAIPDIDG TMGASSVGVG LLFDGPEDDD QPKPQESFTQ PPVGFGGNKG KERSRIPNPY ESSSESEDED EGEPEMALDE VDTVRRSLLR PHPQQQREPL SERAKKGWLA HQSVFPPSSS SSSDDESDKE TESESGDGES RFTNGGQGLY EGGADSSNLY DTRTIPAAYN VATNLEEPLL AEEDGQRESR VPVRLHVYHG RFGHWEKEGL RKYKAFTGRS SFAPVDSPAS SASPYSSHSP SCPSTSLPFL AAEDRSARSH GHGHKHPILV VRLRVGALLL WALAGWFGRL VWLRRKRLER AASVVELSTK LLLSHPPLLL LTPILLGVFA ITSIPFLTLL IRLGMIGYWR HPRENTWVFH IRPYAGWLIF LVTLVWVWTW GVIRGVGRVA VAGVIGEWYF HREEHSRQDP VEVTTAAVHR ATGSSLGSIC LGAGIIAVVR TVGRAASTLK QYTSPKNTRL PSFLSFLHHL APVFTIIAGV LDQLNGYALV YVGITGDAFW PSARRAVGLA GRRRVGKLLD YTLIKLLLTL SSTAMGLFTA TAGYLYMAHS MGNPGYAPLA GMLCGGVPFL AVRAGAGVLT DAADALFICY QIDRELGGQH SEETKGAFLG EQPRGAGAV
|
| |