Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04700 |
Symbol | |
ID | 3254838 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 314756 |
End bp | 317861 |
Gene Length | 3106 bp |
Protein Length | 944 aa |
Translation table | |
GC content | 52% |
IMG OID | 638253941 |
Product | expressed protein |
Protein accession | XP_568013 |
Protein GI | 58261206 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCCTTCCAA CTCCGTCTAT CATGTCATGC CCACCCGCTC CCAACCGCAT TCACCAACTA CCAGGTACGT CATCGCGTCC CTTTTTTACC GCACTGTTCG TGTGCTAACA ACGTCAACCA ACCACAGTCC GCGTCTCCTA CTACATCCCT GCTTCCTCAC AGACCTACTC TACAATATTT CCGTCATTAC AACAAGTGTA CATACATCCT CAATCTGCTG GTGGAGACGA CGAGACATGG GGATCCATAT ACCTAAAAAC TGTGGTAAAG GGGGTTCTGG CCGCCAGGTG GGTCTTGTTT GAACACGTCG AAAAGGCGCC CATATCACTG AACCAGCTGA CCATCTCTAT TGCAGTCCTG AATTACATCC TGCCTATCCC GGAACATCCG ACTTATCTCT CTACGTCCTG GATCCCCGCG AAACATATTT TCGTCGTGCA CGCGTCGGCT CAGCCCACCA TGGCAACACC GACACGACTG GTAGCGAAGT ATGGACAGGT AAAGGTCTTG TTTCTTGGAG CCTCGCGGAA CATGGCCAAG GCAAAAACCT TATCGCCGGT CGGCTAGAAC GAGACGTAGC GTTTACTACG GCGATCAGAA CTCAGGGCAT GTCTGCTCTT GAAGCTTTGA TGGCGGCAGA CCAGATGGGT AATGTAAGCG AAGAAAGCTG GGGTATTGAC ATCTCATTGG GCTTGAACAT GGGATTAGGT GCGCCTGGAG GCTTCGCACG TGCACGTGCG AGCTCGCCGG CCGTGCGGAG GAGGCGCTCT AGTTTAGAGA AGGAAGAAAG GGGACATATA ACACCTCCTT ATCCTAGACA ACAAAAACGC GCGTCAGTCC CGCCCGCACC GCAAACTTTT GACATCCAAG AAAAACTTCC CGCCCACCCA TCATCATCTT CTTACTCCCA CGACCAGCCC TTTAGACCCG GTCACAATAT CAGTCACGCA AGCTCCCCCA TCGACCCCCC TCGTGCTCGT GGTCGACCAA CCAAATCTGC TAGTACTGGC GGCCGTCCTC GCAAGGTCAC TGCTTCGTCA CACAGCTCCT CCACTTCTAT TGATGGTGCC GACCCAAGAC AACGCCCTGC ACATAGTCAA CGACCATACG AAATCCCGCT TCCGTCCTCT GAAGGCCCTA ACGTCTACGA TGCCCTATCT TCCATTCCCG AAAATATCCT GGCACGTCCC GAATCTCTCA CACGCGAGCA AGCACAAAGA CTACTGGCAA GCCCTGCATT CATCGATATG CTTGGAAAAA TCACTGGCAC CTCGATTCCT ACGGGCAAGC CCAACCCCAA CAACAAGCGC CTGAGAGATG GAGAGGAACC GGAGTTTCAA CCTCCGAAGC GAAAGAGAGG AAGACCGACG AAGGCTGAAA AGGCGGCAAA GGAGGCTCAA GAAGCTGCCG CGAGATTGGC GGAAGCTGTC AAACAAAAGG CCGAAGCTGT CGATGGTCCA CAAGAGGACT CGAGAAACCC AGTATGCTGG AATTGCGGCA GAACGAAGTC TGCCATCTGG AGAACCAAGG TGATGGAAAA TGGACAGAGT GTGAGAGTGT GCAATGGTAC GTGATCATTA GTAGTGCATG TGGATTTTAT ATTGACCTTA AACAGCTTGT GGTCTTTACT GGAACAAACT TGGGACTATG AGGCCACCCA ACCTTTGGGC AGACGGCGAT GACGATCGTC GAAGCGAACG CGCGCAAAGT ACATTCTCCG AGGCTCCATC AGAACATGGA CCAGCACATC GAGTTGATAA ACCGCTTGCT CGGGCCCAAG AGTCATTTAA GCGGACATTA TCTGCAGCGG TCGAGCAAGA TGCTAGGCGT ATGGCCACCC GCCACAAGGG TCGGGCGCCT ACTTCTCCCA ACAAACATTC AAAACTTGGT CCAATGACTT CCCCGCCTCG TGGGTCTGCT TCTGCAACAA AATCGTTAAA GCAACGCAAA TACACTGCGG CCAGTTCTCC CGGTGGCTTC GTAGAGACGA TGCACGATTC GTTCGAGTCG GAAGTTAATG AGGCTAGTCC TGAGGATGAA GATGAATCTC CTCATTTCCG TGCCCGCCCT GCTCCTCATC CCCATCGCCT CACTCGTAAT CCACCTTCCA ATACTGATAC CAGGACGCTC GACCTTCCTC TCAGCGACGA CGGTTTAGGC AGCGGCGCAT CTCGCGGCAA CAATGAATGG AACGATGAAG TCTCTGCATT TTTCGATGTG GAGGGCTTCT CAATGTCCAC CATGGCCCAA CATGAAATTC CAGAATACAG TCGTCGTCAC CATCGTCAAG AGGTTATTCA AGAAACGAAG CTCGGTGCCC ACCGCTCAAA TCCTATCCAC GTGACTTCGT CGTCGATGAT GGAGAATGGG CATGTTGGCA CGGATTCTAC TCTTGAGGAA GATACCGTAC TTTCACAGTT ATTCAACCGT ACCTCTTCCA TTGCAGGGCC GGGCTCTTCT CCTTCAGGGT TTGACTTTTC CCAATTACCC CCTTCTTCGC CGCCCATGTC CGTTCTCAGC TCAGATGCTC TTCCCCATTC TGCACTCCTT TTGTCGAGCC CTGTGAAGAA GAACACTCCT AGTGTATCTG GACAGACGCC CAGTGCGTCT GGCTTGACGC CCGTAGACGG CAACCGCTTG CCTCAGAGCA GCTCCAAGTT GAGACACTCT GTCAACGCGG GTGACAGTCA CGATAACGAT AGACAGGCTG GTGGAAACGG TGGTGTACAG CAGCTAGACT TTGAGGGTAT TCAGAGGATG TTTAACTTGA TGTCCCATCC CGATTACGTC TCGCAGGAAA ATACGTCTGC CGGTACGAAT CACACCCCTC TTTCGACGTT TGAAGACCCT CAATACGGTG CTTTGAACGA GTTGATTGAT GGTCTCGGTG GAGGTGCGTC CGGGATGAAG GTGAATGTAG GAGGGGAGGT GGTTTGTGGA GGAGAGGGCG AAACGAGCAG TGCGAGTGCA GTCTTGGCAG ACGGGAATGG GGAAGACATC TTTGCATCAT TCTTGGACGG AGGAGCGTTT GTGTGAGAGA GGGTAAAACT AGCATTGGAC TTTACTTTTT TTAATACGAA TAACAACAGA CTCGGCATTA ATATGA
|
Protein sequence | MSCPPAPNRI HQLPVRVSYY IPASSQTYST IFPSLQQVYI HPQSAGGDDE TWGSIYLKTV VKGVLAASPE LHPAYPGTSD LSLYVLDPRE TYFRRARVGS AHHGNTDTTG SEVWTGKGLV SWSLAEHGQG KNLIAGRLER DVAFTTAIRT QGMSALEALM AADQMGNVSE ESWGIDISLG LNMGLGAPGG FARARASSPA VRRRRSSLEK EERGHITPPY PRQQKRASVP PAPQTFDIQE KLPAHPSSSS YSHDQPFRPG HNISHASSPI DPPRARGRPT KSASTGGRPR KVTASSHSSS TSIDGADPRQ RPAHSQRPYE IPLPSSEGPN VYDALSSIPE NILARPESLT REQAQRLLAS PAFIDMLGKI TGTSIPTGKP NPNNKRLRDG EEPEFQPPKR KRGRPTKAEK AAKEAQEAAA RLAEAVKQKA EAVDGPQEDS RNPVCWNCGR TKSAIWRTKV MENGQSVRVC NACGLYWNKL GTMRPPNLWA DGDDDRRSER AQSTFSEAPS EHGPAHRVDK PLARAQESFK RTLSAAVEQD ARRMATRHKG RAPTSPNKHS KLGPMTSPPR GSASATKSLK QRKYTAASSP GGFVETMHDS FESEVNEASP EDEDESPHFR ARPAPHPHRL TRNPPSNTDT RTLDLPLSDD GLGSGASRGN NEWNDEVSAF FDVEGFSMST MAQHEIPEYS RRHHRQEVIQ ETKLGAHRSN PIHVTSSSMM ENGHVGTDST LEEDTVLSQL FNRTSSIAGP GSSPSGFDFS QLPPSSPPMS VLSSDALPHS ALLLSSPVKK NTPSVSGQTP SASGLTPVDG NRLPQSSSKL RHSVNAGDSH DNDRQAGGNG GVQQLDFEGI QRMFNLMSHP DYVSQENTSA GTNHTPLSTF EDPQYGALNE LIDGLGGGAS GMKVNVGGEV VCGGEGETSS ASAVLADGNG EDIFASFLDG GAFV
|
| |