Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01250 |
Symbol | |
ID | 3254665 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 373606 |
End bp | 376639 |
Gene Length | 3034 bp |
Protein Length | 847 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253615 |
Product | hypothetical protein |
Protein accession | XP_567733 |
Protein GI | 58260646 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGATATC AACAAGAAGA GGAAAATACA GGTTAGCACA GCCGATAATC CATGTGCAGT TCCAGCTTAC ATTTCACAGC GAGCATGCGA CATCTGTCGG AAAAAGAAGA TTAAGTGCGA TGGACCTATG AACAGTCTGA GTAGCCATAG TGAGTAAAAG TAAATAGACC CTCGTATGGA ACCGATCTGG CTAACTCCTA ATGTAGAATG TGCTTACTGT ACTGAACGAG ATTTGGAATG TACCTATATT GAGGAGGCTC AACGAAGAGG TCCCCCTAAA GGGTCTGTTT TGAGATAAGC TTCTTACTTA TACTGACATT TTCCTGTCCA GCTATCTGGA AACCGTCGAG CAACGTTGCG GAAGGCTGGA GAAACTGCTT CAAGAGGCAA GCCATGTTTT CGCCCTTCCT CCTGATATCC ATCTAACTGG TATTTATTTC TGTTTCAGCT TCATCCCGAC TTCAATTTCG ATTCCTACGT AGGCCCGCCT ATCGATCGTC AGACATTTGA CCTTGTTACC TATCACGACC TCCTCGATAC TCTTGAAATC CCACCTTATC CATCCCAAAA GCCTATAGTT ATTGATGATT TACGTCCCGG ATTATCAGCA TCCGATCCTT CAGTAACTCC GTTATCTTCT GAGTCGTCCG CGAACTCATT ATCGAACCTC AAGCGGATTC TGACACGCTT CTACAAGGCT GTCAGTTTAG ATGCCCAGAG AGAGTCTGAT ACGGAAGAAA CTGAGATGGA AGCCCGAACT CAGCTTAGCG TGGCAGAGTC GATGAAACGT ATGGGTGCAC GTGATCATCA CTGGAGATAT CACGGAACCG CCTCTGGTAT TCAATTTATC ACTCATCTGC AAGAACTTCG GTCCCGCGCC GCATCGAGAA GAATGGATTT CATCTCTTGT GTCAACCGTG TAAAGAGGCA GCGATTCTGG GAGGTTCCCG AGTGGGAGCT TGCCATTGCG AGCGAAGGCT TGAGCCCTCT CGACCTATCA AGCTGGCCCG AAAAAGGCCT CGACCAGCAA CTCATTGATG CCTATTTTGA TCACGTCAAC CTACATTTAC CGCTGCTTAA TCGAAAAGTC TTCCAAGAGC AGTATAATTC TTGCAAGTGG AAAAATCATC ACGGGTTTGC AAGGGTTTGC CTGTTACTAT TCGCGAATGG AGCGCGATAC CTTGATGACG ACCCCAGAGT CATTTGGCCA CTTGATCAGT GCGCATCGGA AAGTGGCCGC GATACCATAA TCAACGATCC GTATGATTTG AGGCGGTATT CTGCCGGATG GAGATACGTT CGCAGCTACT TGCGTATGGG TCAAAGTATC GTCCAGGGTC CAAATCTTTT TGAACTGCAG GGTAGAATTC TTACATGCCA ATTCGTTGCA GGAACTGCCA TCCCTCACTT TATCTGGATA GTCGCTGGCT ATGGTCTTCG TTCTGCTCAA GAACTCGGTA TCCACATGCG CGCCACCCTC CTCCATGCCG ACCCTACTGA GCGAGAGCTC TGCAGTCGAG CGTTCTGGTG TCTTTACCAT ATTGACCGCA TTAGTTGCTC TGCCATTGGG CGCACTCTCG CCATCCAGGA CTATGATTTC GATTTGCAAT ATCCTGCGGA TGTTGACGAC GAGTACTGGT CTACCGAAGA TCGAGAAAAG GATTTTCAGC AGCCAATAGG GAAGGGCCCA TCGATACTCG CCACGTTCAT CGAAACTCTG AAACTTGACC ACATTGTTGG AGCTGCTCTT CAGACAATTC ACACGCACAG CAAAGGATGG CTGAAATCCA AGGATATCAC TGCCAACTAC GCCGTCCTGG CTGAGCTTGA CTCTGCTCTC AACAATTGGG AAAAACAAGT CCCGCAGGCT TTAAAGTGGA ACCCCCAGCA GTCTGACACT CGGCTATTCA AGCAATGCGC CTTGTTGTAC TGTCGCTATC ACTATGTTCG AATGCTCATC CACAAGCCTT TTGTTCCGAT CCAATACAAA GCCAAAAAGC AACATTCTGA TGCTTTGCCA TCGCTCCGAA TATGCGTCGA GGCAGCTCGT TCCTTAGCAA GCATTTTTGA CGCGTGCCTC GTGCGTGGAC GGCGGGAACA ATATCAGCCT TCTGTAGATG TTGAGATGGC CCTCCCCATG TGGGATGCAG CCACCATCTT GCTCGTTGAT ATCTTTTCAA GCAAGCAGAC TGCTTCAGAA CGAGAAACGG CGTTGATGTG TCTGAGATGT TGCCAACAAA CGAGTGCTGT ATTGGAAGGA ACTTGGAGGC AAGTTGCAAA ATATTCTGAT TTTCTGGTAT CCTTGATGGA TGAGAGTATC ATGCCCCCCA TAGTTGATGA AAGTCGGGCA GGGAGTGAGA AGCGGACCAG GCAGGATGAT GACAGTGGGC GGTCGAAGAA AAGGGCTTGG AGAGCAGGTA GCGAGCCCAT AATGAATCCG AGAGGTGAAA ATGAGGAGAG CAATTTGCAA TCAAGGTCTC AGTCGAGGAT GAACACTCTC GTCTCTCAAC AAGAACCCAC TCGCACTCCG TTTGTCGGCA CCGACATACC TTTTGTGTCT TCTACCCCTA CTCACGATAT TCCCAACCCC AACATGCAAC TCCCTTCTTT CCCATACTCT CAAGCCCGAG GAGAATATCC TAGTCAACCA TCACCGGCTG CTTTTGTACC GTCTACTCAA GACACTTTTC AACCTTCGGA TATCATGTAT GATTGGTTGA TGAATATGGC CAGTGCTGCC CCACAAACAC TTAGTATGAG TTCGATGGCG TTAGGTGGGG CCGAAGGTGG GATTGACGAC CCAATTTGGG CGCAGCTTTT TGGAGATACG GTGTTTTGTG AGTCCCATCA AAAATATAAC TCCTCCATTG CACCGACTAA CACCCTGGAA GAAATGTATT ATTGCATCTT CACAATAGAT AATCCATGTA CTGTACACGA CCAAAAATGG TTGATGTTCA TTATTACATC ACATTACGTT CAAATTTGCC GCAATTATTT TATCACTGGC CCAAGGTCAA TACAATAGTA AGAA
|
Protein sequence | MNSLSSHKCA YCTERDLECT YIEEAQRRGP PKGYLETVEQ RCGRLEKLLQ ELHPDFNFDS YVGPPIDRQT FDLVTYHDLL DTLEIPPYPS QKPIVIDDLR PGLSASDPSV TPLSSESSAN SLSNLKRILT RFYKAVSLDA QRESDTEETE MEARTQLSVA ESMKRMGARD HHWRYHGTAS GIQFITHLQE LRSRAASRRM DFISCVNRVK RQRFWEVPEW ELAIASEGLS PLDLSSWPEK GLDQQLIDAY FDHVNLHLPL LNRKVFQEQY NSCKWKNHHG FARVCLLLFA NGARYLDDDP RVIWPLDQCA SESGRDTIIN DPYDLRRYSA GWRYVRSYLR MGQSIVQGPN LFELQGRILT CQFVAGTAIP HFIWIVAGYG LRSAQELGIH MRATLLHADP TERELCSRAF WCLYHIDRIS CSAIGRTLAI QDYDFDLQYP ADVDDEYWST EDREKDFQQP IGKGPSILAT FIETLKLDHI VGAALQTIHT HSKGWLKSKD ITANYAVLAE LDSALNNWEK QVPQALKWNP QQSDTRLFKQ CALLYCRYHY VRMLIHKPFV PIQYKAKKQH SDALPSLRIC VEAARSLASI FDACLVRGRR EQYQPSVDVE MALPMWDAAT ILLVDIFSSK QTASERETAL MCLRCCQQTS AVLEGTWRQV AKYSDFLVSL MDESIMPPIV DESRAGSEKR TRQDDDSGRS KKRAWRAGSE PIMNPRGENE ESNLQSRSQS RMNTLVSQQE PTRTPFVGTD IPFVSSTPTH DIPNPNMQLP SFPYSQARGE YPSQPSPAAF VPSTQDTFQP SDIMYDWLMN MASAAPQTLS MSSMALGGAE GGIDDPIWAQ LFGDTVF
|
| |