Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00810 |
Symbol | |
ID | 3254412 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 257701 |
End bp | 261290 |
Gene Length | 3590 bp |
Protein Length | 1050 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253571 |
Product | hypothetical protein |
Protein accession | XP_567643 |
Protein GI | 58260466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.510095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACCAAGAA AATCATCATC TTTAGCGTTA CCAATTTCGT GCCCCGGCAG ACTTCAATAC CTTTCCGCGG GAGAAACCAC ACGAAGACAA TCTCATGTCG TTGGCAGAAC AGCGTTCGAC TCAGGGTATC GTTCAGGAGC TCATGGAGGA GATCAGAGCT GGTTCTTCCA GAAACTCTCC CACGTCATCC ACCTCGACCA TCGAGTACAT TGGCCAAGTA CGTGTAACAG CAGGCTCTGG CCACAGGCGG ATACATTCCA CTCCCGGAGC ATCAGCGAGC TCGGAGGTTG ATGTCCGTGC GAGGACACTG AGTGATGGGG CATTCGCAAA GACGACTAAT ATGTTGGATG GGGGCAGTGC GAATCCGAGA GGCAGAATAA GAGAAGCAGT GACAGCGCCA ACATCGCCTG TGATGAATAA ACCCAAGCTT ACTTCAACGT CAGTCCCAGG ACCAAAGGAG AAAGTTGAAG CCGGAACAGC ATCAGCGGCC GAGACGAATG ATCCAAAACA GAGCTCAAGC TCATCCCTTC CGCCGTCTAA AATTCCCCTC GCTTTTCCTT CCACTCAGTC AGCTCAACAC GAGCCAAGCG CTTCACATCC TCAGGCGGAG ACGTCCCCAA CCACTCCCGA ATTGCGTATC ATCAAGTCGT CTGTCCCTCC ACCCTGTTCC CTGCCGTCGT CACCTCCGCT CTCTCCAACA AATTTGAGTT CAGGTGATAT GCACCCTTTC CCTGTGGTCG CGGATATTGA AAAAAACGTC TCACCGTCAA AAAAGCGGAG TGGAAATCTT CCTACACCTA GCACGAGAGA AAAGGGTAAG GGGAGTGAGA TAGGTGAAAG ACCAGGCGAA GGCAGAGCGC AGACAATGAG GAGGGGTTTC ATCGGGCCTG GTGGTGTTAT GCGTTAGTCG ATTCGTTTCG CGCAAGCCAC GAGGCTGATG ATCTCAATAG ACAAGATCGC TTCAAGTCAG TCTCTCAACA AGGTTCGAAC GGCCCTACCC TTTACCAGAA ACGTAGAAGC CGAACGTTCC AGTCCGGCCA TCTTGTCTTC AAAGGCCTTC TCCCCAACTA CGCAGGTTCA AACATCTTCT TCACTCAACA TTCCCAACCA AGCTACCTTC CTCCGTCCCG CCCCCCCTCC ACCGCGTACC CCCTCATACC GGCAGCCTCG CCAACCAGCC TTTGCGCGTC CTTCAACTCG CATATCTTCA CCTTCTCCAA GTCCATCACC TCACAAATCC CTTTCGCCTG GTGCAAATTT TGCACCCCGC CAAAGTCGAC CTAGAAAAGG CATGTCTAGT CAGTTGCCGA GGATGGGGCT GCCGGAGGAG GATGTTGAGA TGATCGAAAT ACCGAGATTT AAGAAGAGAG AGTTGAATTT GGGTATCGTG CGGAAACATC CTGGGAAGCA GAGGGCAGCG TGGCTGGCCT TAGGCGGATT ATGGGTTGTG AATGGGCTGT TGAGTCTGGT GAGTGTTAAT CCTGTTCGCC CTTTTGCGCT CGTCGTAGTG TTAAGATGGA GAAGGTTGAC GTCATCATTT ATCCTCAGTT CTTTGACATG AACGTAATTT ATATACTTGT ACAGCAAGTC GAAAGTTCTT TCATAACAGT TGGCAGGGAC CTAATTGAGC TGTAGATGTT CTATACACCC ATCATTTGAT ACCAACAATA CCAAGCTATG GCAATTTGCG GCCGCTGCGT ACGCAATCTT GTGGGCTCTC TCAACGATTG TGGTCTGGTT GGGCTGGGAG TTGGGGTATG AATTTTGGAG GCGTTGGAGA TTGGGTAAGT CCCTTAAAAT TATTATGCAG GTGCAGATTT GGTTAATGTT ACCGAAGAAC GCCCAGCCAT AGAGCCTATC TACCTTTCGC TGCCAGCATC TTTACATCTT TCCCTGAAGT CTTACGACCA CTTCATATTT TTGCTTCATA TCCGCACGTC AGCCTTTGGT ACTGCATACG CCAAAGACAT CATCCCAGAA ACATGTCATG CCCTTATCCA ACTCCTTCCC GGGCTTATAC CCCTTCTTCC CCGAGCAGCT ATTGCTGTCG TCATGTTGAT AAGCTTCTGG AAGCCAAGTG CCGACGTACA AGCGCCATAT GGAGGTCCCG TTGACAGGAC GGCCGATCGG GATCCAAATT TCTTTAGAAG TGACGCTCCC GGAGAGTTGA CAAATTACTC TAAAGGTGTT CTACTCACTT TTACAATGTA CATTGCTTTC AGACTCTTGG TTGTTATCGC TTCTGCGATT GGTTTATGGG TATCATCCGC TAGACCTCTT GGTGGATTCA TCGGGAAGCG TTTCCAACGC AGATCTCCGG TCGGCCCTGC TGGTCCGTCT ACTCCCCGTA GGCACCATCG AAAATCGGCT TTTCAACCGC GCGACCCCAG CCTCACCCAC TCTCCCCAGA AAAACTGGGT CGATGAGAGC TCCTGGGATT GGGCTTGGAA GGAGAGGATG AGGGCTAGGG TGCAAGATGC TTTCGAGCTC TGCATAGTAA GGCTTGATGG CGATGATGGT ATATTCAAGC AGACTGGTGA GACAGGAACG GGACAGGATG TGCCGTGGGC AAAATCTTCG TATAAAGCTA CTGAAAGGAT TCCCATGGAA GAAAGGGAAC AAAAGGACGC TAGTTATTCA GCCACCAATT TTATCGGCCA AATCATTGCG CATGAGTCCA GTCCATCTTG TTCACTATCC GCAATCGACG AGTTCGAAGC CACTCGGCCA GAGTCCATTC TTGATCCCAC TGGTGATGCA AAGGTTCAAC CGTCCTCGTC GAGAGCCGCT ACCAACCCTA CGAGTTCTAC GAACGATCTC TTTTACACTC CCCCCGCTAG TATCACTTCC GCTGCCAAGA AGGAATCATC CGTAGCTGAT GCGATCGTGA AAGGCGGGGT ACCTCTCAGC GCATACGAAA AGCCTTCCGG ATGGATGACC GAGTTTGGCG TCAAGGAGGA AAAAGCTCGA GAGACTCAGG ACTGTGAAGG AAGTGACAAT GAGTCAACAG GTCTCCTTTC TGCTCAGACG AGTCCACGGG AGTCCATGCT TTTCCGAGAA TGGTCTGGCT CAACTGCTTC TCATCTTTCC CACAAAATGT CCGGTGATGT TTCTCAGCAC TCTCACTCCA CGACATCTGG TACAGGATCC GGATCTGATA AGACCTCTAC AACATCTATC CGACAAAGTG CGTATACTAC TTCACATCCT CATGTTACGG ATTTGGCAAG AACAAGGTCA AGTAGTATCA CCATGATTAA AGAGAGTTTG AACGGCGTGG CACTTGCAGC TGCTAATGGG AGCGCGGGAC TTGTTAGAAG GGCAAGAAGT GGGACGATGT TGAGTAACGG GACCAAAGGG AAGAGGTATA ACAAGATTAG CGACGATGAA GGGGGTGAGG AGTTGGCCAA TGATGGTAGG TAAACGTGAT GCCTATCATT CAGAGGTTCT TAAACTGACG TTGCTACAAA GAACTTGTCA CACCACGCTC CAAGCATGAC AAAACTATGG GCCTTGGAGT ACCGTTTGTG TATCTGTCAC CACATGATGA CTATAATCAC CAAGGCACGA CAAGAAACGA TATGACATGA
|
Protein sequence | MSLAEQRSTQ GIVQELMEEI RAGSSRNSPT SSTSTIEYIG QVRVTAGSGH RRIHSTPGAS ASSEVDVRAR TLSDGAFAKT TNMLDGGSAN PRGRIREAVT APTSPVMNKP KLTSTSVPGP KEKVEAGTAS AAETNDPKQS SSSSLPPSKI PLAFPSTQSA QHEPSASHPQ AETSPTTPEL RIIKSSVPPP CSLPSSPPLS PTNLSSGDMH PFPVVADIEK NVSPSKKRSG NLPTPSTREK GKGSEIGERP GEGRAQTMRR GFIGPGGVMH KIASSQSLNK VRTALPFTRN VEAERSSPAI LSSKAFSPTT QVQTSSSLNI PNQATFLRPA PPPPRTPSYR QPRQPAFARP STRISSPSPS PSPHKSLSPG ANFAPRQSRP RKGMSSQLPR MGLPEEDVEM IEIPRFKKRE LNLGIVRKHP GKQRAAWLAL GGLWVVNGLL SLLWQFAAAA YAILWALSTI VVWLGWELGY EFWRRWRLDL VNVTEERPAI EPIYLSLPAS LHLSLKSYDH FIFLLHIRTS AFGTAYAKDI IPETCHALIQ LLPGLIPLLP RAAIAVVMLI SFWKPSADVQ APYGGPVDRT ADRDPNFFRS DAPGELTNYS KGVLLTFTMY IAFRLLVVIA SAIGLWVSSA RPLGGFIGKR FQRRSPVGPA GPSTPRRHHR KSAFQPRDPS LTHSPQKNWV DESSWDWAWK ERMRARVQDA FELCIVRLDG DDGIFKQTGE TGTGQDVPWA KSSYKATERI PMEEREQKDA SYSATNFIGQ IIAHESSPSC SLSAIDEFEA TRPESILDPT GDAKVQPSSS RAATNPTSST NDLFYTPPAS ITSAAKKESS VADAIVKGGV PLSAYEKPSG WMTEFGVKEE KARETQDCEG SDNESTGLLS AQTSPRESML FREWSGSTAS HLSHKMSGDV SQHSHSTTSG TGSGSDKTST TSIRQSAYTT SHPHVTDLAR TRSSSITMIK ESLNGVALAA ANGSAGLVRR ARSGTMLSNG TKGKRYNKIS DDEGGEELAN DELVTPRSKH DKTMGLGVPF VYLSPHDDYN HQGTTRNDMT
|
| |