Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH03550 |
Symbol | |
ID | 3259137 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 83359 |
End bp | 89046 |
Gene Length | 5688 bp |
Protein Length | 1602 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258128 |
Product | hypothetical protein |
Protein accession | XP_572519 |
Protein GI | 58270726 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.122713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGATCCACA ACACGTAACA CATTCTGTGC CAGCTAAGAT CCAGACAGGA CATAACCATG TTCCCAGTCC CATCACATCT CCACCGGTCA GGAGGGGAAC ATATTTCCTC TGAGGCAGAG GCAGAAGTCA GTGTTCAAGG CAACCAACCT TCCGTGGAGG CACAGTGCGA ACAGGCGCAA TCAAAAGTGC TAGATCTGCT CGAACCCCTT CTTCGGCCGC CTTTAGATAG TAGCGACAAA CGTAATAATG AGCGCAAAAG GAAAGTGTCC AGATGGAATG TTGATGAGAT CAAAAAAGTG AAGGAAAACC TCGAAAATGC AATTGAAGAG AACAAAGTAC GTCCCTCACT GTCTTGTCCA AATCAAGGTC AACAGGAACT AATCTACTCT GACGGTGATG GTAGACGAAA ACCCATGAAA TCCTGATGAA CAATCTTCCG TCCATATCAA GCCATATCCA AATCTCCTCC AACCTCTGTT CAGACTTTGC GGAACTCAAA ATCAAACTCA AAGCGCTGGA GAGTCAAGTC GATGTCTCGG ACCCAGCGAC ATCATTCATG CCGCCCTTAC TTTCCCTCTT GAACAAGCAT TTCAACTCCA TATCCGCTCG CGATACTTCG CAAGGACATA TAGCAGCCCT GAAAGCTTTG AAAGCCAAGG TGGAAAAGGT GAAAAAGTTG GAGGAAGTTG TGTGGGCGGG ACAAGGCGTG GAGGACTGGG TAGTGGACTC TGCTCCTTTG GCAAAGAAAG AGACTGGGGC AGAAGCAAAT GAAATAGCAG TGGATGGGCT GGAAGCTACC GCTATTTGCA AGGCGTTGGA AGAGAAGGAG AGTGTTTTGA GGACATTACT GAGGGATCAG TTGGTAGATG GGTTCAAGCG GGCTGTGGTG GTGCAAAAGG ATGGGATGAA ATTGGATGTC AGGCAGGAAA TCAGGTTGAG CAGCCCCAAT GGTGAGTCTG CTATGAAGCT CAGTATGTCT CTTAGCAACT AACCGACCTG GTCAATCAAG CATCGCGCCC GGATTACCTT CAGACCACAA CTGCTTCACC ATATCCTCTT TCATCGCTCT ACAAAGCCCT ATCGTCCCTC TCGCTGTTAC CAAATCTTCT CCAAGCGCTT CAAACGCAAT TATTAACAAC AATTATCCTA CCCGTCGTAT CCTCTCAACG CCGAATAAGC ATCTCTTCAT CTTCCGGAAT TTTCTCTCTT AAATCAGACC AAGTCAGCGG AGACATGGAG GACCAAGAGG TTTTGACTGG CTTAAAGCAA GTTTTGGACT TTACCTCTCG TACCCTGTAC CCCGAAGAAG CCAACACTCC AGAGCGCGAA CCCTTCATCC GCGAGATCAC GGTCAAAACC CTCGAATCCC TCCTTGATCA CCTCATCATC CCCTCCATTC CATCTTCCCC TTCGTCTATC CCCGGATGGC TCAATCTAAT TGTTTCATCC GTCGCTATCG AGGATACTGT TCTTCCCGTC TCGTCAGCTA ATGATCAGCT CCATCACAGT CGGCCGTTGA AGACGTTCTG GGAGAAACAG GCAGGGAAGG AGTATGCGAA TAAGAGACGA TATGATACGG CCGACCTTGT GAGGGTGCTT GTGTTAAACG AGTGGGGAAG ATGGGAAGGG ACGGAGAAGA AGAGAGAGAG AGAGGTCAGT GTGGTAGTTG AGGTGGAGGT CAGTAGTGAT GATGAGGATC AGCCCGTGAA GGAGCAAAGC AAAGAGCAAG GTGAAGCGGG AGAGGCGCAG GGCGACGGCT GGGGATTCGG AGAGACCAGT GAAAATCAAA TTGAGCAGGC GGGGGTGAAG CAAGTAAACC GGGAAGATGC GGAAGAAGCC GAAGACGGGT GGGGATTCGA TGAGGAGTCT ATCCCACCCG CGTCCTCGGC TGATCAAGCC CCACAAGAAG AAGACGAAGC CGATGGATGG GACCTAGACG TCGCCGCCAG TACGAGTTCG TCGCTGCCTG AGTCTAGCCC TTCTCTTCCC CAGCTTGATC CCTCGGTAGA GCTCGAAGCT AGGACTGAAC ACGCACCTGA ACCTCAACCT CAACCTGTAT CTCAACACGA ACCTCAACAC GAACCTCAAC GAGAGTCTGA GCCACAGCCA GAACCTAAAC CTGTATCAGC ACCTGCACCA ACATCGGCAC CTACTAAACC TCTTCGAGAA GCAAAACGTT TGGGCAAGAA AGTGGCCAAA AAGAGCAAAA CAGAGGAATA CGATCCCTGG GATCAACCGT TTGATAACGA AAACTCCCTC TCAGCGTCCA CTTCTACATT CACGTCAGCG TCGACTTCGA GTCTGTCGAA CCCAACACGG GCACTGTCAG CGGAAACTCC TTCCAATAAC GATATACCAC CTATTAAAAA ACCCCCTAAG GAAGCCAAGA AGTTGGGGAA GAAGGTGGGG AAGAAGACGA ATAAGGAAGC GGAGAGAGAT TTTTGGGATA CAGATATTCC GGTAGAGGAT TTAGGTGGTG GAGAAAGCAG TGTTCACCAT ACAGCTGGTG TTCTGGGCGA TAAAGTGGAA GGTGGCGGCG GTGGTAATGG TGATGGACGG AGCCGGGACG ATGATCCCAC GTCCACCTCT CCACAGATGA CGGCACCCCC CCAAAAAAGG AGGACTGAGC TTCGTGAAGA GAAAAGGGTT ATTGAGGAGA AATACTTTGT ATCCACTGCA TGCGAAAAGT TGTTGGATAT TGGAAGAGGT CTGCTCAAGG AGGTGGAGGA GTTACAATCT TCTAGGTAAG AACAGGCTTC AGGCGCGAAA AAAGTGATTT TTTTTGGATA TTAACGGTGG ATCGCAGTTT TGCCTCACCT TCCTTTACTT TTGATTCTAC AAGCCCAATC ATCATCCAGT CTTTGACAGA CATCTTCTCC CTTTATAGAG CCCTTCTTCC CATTCGGTAC GCTCAACAGC TGCAAGACAT CCCGGGTATC ACTATGCAAG CGTACGACGA TGGCAACTAC CTTGCTTCGC AACTTGGTTC ATTCGCTCTT CCCGCTTCAT CAACGCTTTC GTTGGACGAT GAGATTGCTC GCCTGAGAGC ATTGTCTGAG CGTATCTATG AGGATTTCCT AAAAAATCAG AGGGAGGGCA TAGACGAGGA GCTGGATAGC CTTAAAGGCG TGGAAGGAAC AGCGGATTAT AGTATCTGAT ATATCAATAT AGCTTGGCCA TGCATTGTAC CGAGCGAGTA CGCGACGCGT TTTCTTGTGT GAGGTGGAGA AAGATGAATC TGACGCGACG CGTTTTGGTG CACAGACCAC TTGCTCATTT CTTCCCACAA ATCTCTCTTT CCTTTATATT TCTTGCTTAT ATTTCACATC CCGCCATGCA GCACAGGACC CCCCTCACGC CCATCCCGCA GAACCGCTCC CCCCCCTCCG CCACATCCTC CAATGCCCCC AGCACTCCGA CTCCCAAGCG CCCCTCGTCC GCCAAACGGC CCCCGAAACT TGCACCCATC TTCTTGGCCA AGCGGGCCAG GCCGGAAAGG GAACGTCAAG AGAACACCGC ATGGGGTGTG TCTGCAAAGA AACGCGTGCA CCTGGCTGGG CCGTCGTCCA TCCCCCCGCT GGCCGCTGGC TCACGCTCCG CACCCATAGC GTCGGACGGG GAAGACGCTT CGCAGAGCAG CGATGACCAG TCGGATTCCG AGGATGACCG CCGGCTATAC ACACAGCGCA AATCGCAGAG TATACATGAC TACTTCCTGC CCAGTACCTC TCGATACGGA CGCGTCAAGG ACGAACGGCA GGACAAGGAG AACACACGCG TCATTGAGCC AATCACCAAG GAAGAATGCG TTTGGAAACG AAAGAGAAGA AGACTCGGAG TCAAGGGACA GGGGTCTTCT CTTGCAAACC AGCGTCGTAA GCTTTTGGCT TGCCCTGTTC AACGGCCAAC GCGCTGACAG AGCACATAGT GATACCCCCG CCCCAAGCAT ACCTGTCCAC ACTTTTACAT TCTCTCCTAC CTTACCATCC CCTCAGGCCT CCCGCTTCCA TCCTGTTACC CTCAATCCAT CCACCCACCG GCCGTCCTCG AGACTTTGCT CCTCCTCTCG CCGTCTCGTT CAACCATATC GCAAAGCAAT ACACTAGTTC CGACGCGCGT AGTCAAGGAT TACGCCGGTT GATCGGTATC GCGGGAGAAG AAGGCGGGGT GAGGATTTTA GATGTAGATG AGGGACTGGG GATGCATAGG GAAGAGAAGG GATGGTGGTG GAGAGCGCAT GGGAATGCGA TATTTGATTT AAAATGGTCC CCGGACGATA CGAAAGTGGC AAGTCGTCTT TCCTTCAAAG TCCCGAGTAG ATTTAACTGA AAATGTTACC ACAGCTCACT GCATCGGGCG ACCAGACATC ACGCTTACAT GCTCTGACCA CCCCCGTCCC CACTCTCCTC GCCACACTCC GTGGTCACAC ATCATCTGTC AAGACCGTTA CCTTTCTCGA CCCTTCTCGC TCCGCCAACA ACCCTTCACA GTCATCGGTT ATTGCTTCGG GCGGAAGGGA TGGGAACATT CTGATTTATG ATGTTCGAAC AAAAGGGCGT GATCCCGAAC TGGGCGAGTA TGACGCCGGG CCAGCGAGGA GGGAAGGCTC AAGAGAGAGG TACAGGGATG GTATACCGGG ATTTGCACCG CAAACGAGTG GCGAGGTGCT GGATCCGGTG ATAGTCATCA AGGGAGCTCA CGGGGACGGT AAACGCTCAG GTGTAAGTTA ATCCCCCACC TTGCATACTA TGAACACTTT AACATCCAAC CTCTTTCATA GCGAACTGCC ACACGATCCG TAACCTCCCT CCTCGCCCTC TCCTCCATCC CTGGCACCCT AGCCTCTGGC GGCTCATTTG ACGGTATCAT CAAATTATGG GACCTCCGTT TCCCTGCCCC AACCAACCGC TCTTCCAACC CACGTCCGAC GTGTACGCCA TCTGGCAGTT TACCTGATGC CACGCTGAGC GGCGAATCGC CCGCAAAACG AGCAAGAAGT ATAAACGCAA TGGTGGAATC ACCGGTGAAT GGGGATGTTT ATGCGCTTTG CGGTGATTCC AAGGTCCATG TTTTACGTCC TTCTGCCGCC CTCATTCAGC ACCACCCGAC CCACTCCTCT GGAGCGTTTA ATGCGTTTAC AGAAGAAGAG ACATACGCCG AAGCCGTCCA ACCGGCCACT TACTCGGACC CATCCATGCT CATTTCCAAT TTTTACATCC GTCTTTCTCT CTCGCCTGAT GGCCGGTATC TGGCTGCTGG GAGCTGTAAA GGCGGGCTGA TGACTTGGGA TACGCAGGAA CGAGGCCAGG ACATGAACAG AAGGAGAGAT GCGTCGACGA CGGTGGCGAA GAAGCTTGGG ATGGGGTTCG AGGTAGGAAT GAGTGAAATA GGGGGAAAGG ATAGAGAGGT TTGTGCGGTA GAGTGGGGGA AGGATATCGT GAGTCTGCTC TTCGTCTCTC ACCTTTCCTT TCGGTCTCCC GCATTGCTGC GCTGCTCTTC TATTATTCCA CTTTATTATA TACGGAGAGC TGACAACTTG TTTTATTATT ATTATTAATT TAAATCTCTA GCTCGCGGCA ACTTCAGATG GGTCCTTGAC ACGGATATGG AGGTCCGAAC CAGAGATTGC AAAGCAAATA GCAGAGAACC CTGGTGCCTA CGCCAATGAG TGGGCGGGTG CAATTTGA
|
Protein sequence | MFPVPSHLHR SGGEHISSEA EAEVSVQGNQ PSVEAQCEQA QSKVLDLLEP LLRPPLDSSD KRNNERKRKV SRWNVDEIKK VKENLENAIE ENKTKTHEIL MNNLPSISSH IQISSNLCSD FAELKIKLKA LESQVDVSDP ATSFMPPLLS LLNKHFNSIS ARDTSQGHIA ALKALKAKVE KVKKLEEVVW AGQGVEDWVV DSAPLAKKET GAEANEIAVD GLEATAICKA LEEKESVLRT LLRDQLVDGF KRAVVVQKDG MKLDVRQEIR LSSPNASRPD YLQTTTASPY PLSSLYKALS SLSLLPNLLQ ALQTQLLTTI ILPVVSSQRR ISISSSSGIF SLKSDQVSGD MEDQEVLTGL KQVLDFTSRT LYPEEANTPE REPFIREITV KTLESLLDHL IIPSIPSSPS SIPGWLNLIV SSVAIEDTVL PVSSANDQLH HSRPLKTFWE KQAGKEYANK RRYDTADLVR VLVLNEWGRW EGTEKKRERE VSVVVEVEVS SDDEDQPVKE QSKEQGEAGE AQGDGWGFGE TSENQIEQAG VKQVNREDAE EAEDGWGFDE ESIPPASSAD QAPQEEDEAD GWDLDVAAST SSSLPESSPS LPQLDPSVEL EARTEHAPEP QPQPVSQHEP QHEPQRESEP QPEPKPVSAP APTSAPTKPL REAKRLGKKV AKKSKTEEYD PWDQPFDNEN SLSASTSTFT SASTSSLSNP TRALSAETPS NNDIPPIKKP PKEAKKLGKK VGKKTNKEAE RDFWDTDIPV EDLGGGESSV HHTAGVLGDK VEGGGGGNGD GRSRDDDPTS TSPQMTAPPQ KRRTELREEK RVIEEKYFVS TACEKLLDIG RGLLKEVEEL QSSSFASPSF TFDSTSPIII QSLTDIFSLY RALLPIRYAQ QLQDIPGITM QAYDDGNYLA SQLGSFALPA SSTLSLDDEI ARLRALPLAH FFPQISLSFI FLAYISHPAM QHRTPLTPIP QNRSPPSATS SNAPSTPTPK RPSSAKRPPK LAPIFLAKRA RPERERQENT AWGVSAKKRV HLAGPSSIPP LAAGSRSAPI ASDGEDASQS SDDQSDSEDD RRLYTQRKSQ SIHDYFLPST SRYGRVKDER QDKENTRVIE PITKEECVWK RKRRRLGVKG QGSSLANQRL IPPPQAYLST LLHSLLPYHP LRPPASILLP SIHPPTGRPR DFAPPLAVSF NHIAKQYTSS DARSQGLRRL IGIAGEEGGL TASGDQTSRL HALTTPVPTL LATLRGHTSS VKTVTFLDPS RSANNPSQSS VIASGGRDGN ILIYDVRTKG RDPELGEYDA GPARREGSRE RYRDGIPGFA PQTSGEVLDP VIVIKGAHGD GKRSGRTATR SVTSLLALSS IPGTLASGGS FDGIIKLWDL RFPAPTNRSS NPRPTCTPSG SLPDATLSGE SPAKRARSIN AMVESPVNGD VYALCGDSKV HVLRPSAALI QHHPTHSSGA FNAFTEEETY AEAVQPATYS DPSMLISNFY IRLSLSPDGR YLAAGSCKGG LMTWDTQERG QDMNRRRDAS TTVAKKLGMG FEVGMSEIGG KDREVCAVEW GKDILAATSD GSLTRIWRSE PEIAKQIAEN PGAYANEWAG AI
|
| |