Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF00780 |
Symbol | |
ID | 3258187 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 244341 |
End bp | 249701 |
Gene Length | 5361 bp |
Protein Length | 1431 aa |
Translation table | |
GC content | 49% |
IMG OID | 638257202 |
Product | cleavage and polyadenylation specific protein, putative |
Protein accession | XP_571490 |
Protein GI | 58268668 |
COG category | [A] RNA processing and modification |
COG ID | [COG5161] Pre-mRNA cleavage and polyadenylation specificity factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAATACACA CGACTCCAAC ACAGGAGAGT CCCCACGCCA TGCACGCTCT CCACCAGACC CTTCTCCCCT CATCCTCCAT ACACCACTCC CTCTTCCTCC CACACTTCAC CCCGTCAACC ATCTACCCTC TCCCAAAACC ACCTGCAGCG CTCGATACTC TCGATGTCAA GGTCATCGGC AACCTTGTCG TCGCCGGAGC AGAAGTCCTG AGAGTGTTTG AAATACGGGA AGAGAGTGTC CCGATAATAG AAAATGTCAA ACTGGAAGAA GATGTGGCTG AGGGCGAGAA GGATGTGCAG ATGGAAGAAG TCGGCGACGG ATTCTTTGAC GATGGTCATG CAGAAGTGGG TCGCATGCAT CTTGTTGTAT ATGGAATATA TTGCTCATCA TAAAACCTGT CACAGCGAGC TCCGTTAAAA TACCAAACGA CTAGGCGGCT GCATCTCTTG ACGCAACATG AGCTCAATGG CACGATAACG GGTTTGGCAG CTACGAGAAC GCTTGAGAGT ACCATCGACG GTTTAGACAG GTTGATTGTG TCGTTCAAGG ACGCAAAGGT AAAAAAGGTT ATTGTATGGA GGGCGAGCTT TGTTTGCTGA CTCTCACTAT CAGATGGCAC TTTTGGAATG GTCAAGAGGG GATATCGCAA CAGTGTCTTT GCATACCTAC GAGCGATGCT CTCAAATGAA CACTGGTGAC TTGCAGAGCT ATGTCCCACT GCTCAGGACA GATCCGTTAT CGAGATTGGC TGTTCTTACT CTTCCTGAAG ACTCGTTGGC GGTTTTGCCT TTGATTCAAG AGCAGTCAGA ATTAGACCCA TTATCAGAAG GCTTCTCCCG GTGAGTTGCA TGTGTCTTTT TCGAACCAAA GCAGAACTGA CATTTTATAG TGATGCACCT TACTCGCCCT CATTTGTTCT CTCCCTTTCA GACATGTCTA TAACCATTAA GAACATTCAA GATCTCCTTT TCCTTCCCGG CTTCCACTCC CCCACTATCG CACTCCTCTT TTCACCCATG CACACTTGGT CAGGTAGATT ACAAACTGTT AAAGATACAT TTTGCTTGGA GATTCGTACC TTTGACCTTT CTTCCGGCAC TTCCTATCCC CTCCTCACAT CTGTCTCTGG TCTCCCTAGC GACAGTCTGT ACCTTGTCGC ATGCCCTTCT GAACTCGGCG GCATCGTCCT TGTGACCAGC ACCGGTATCG TGCATGTCGA TCAAGGTGGT CGAGTGACTG CTGCTTGCGT CAATGCCTGG TGGTCCCGAA TTACATCCCT CAAATGCTCA ATGGCTTCTG TATCCCAGAA ACTTACCCTC GAAGGATCTC GGTGTGTCTT TGTCACTCCT CATGACATGC TGCTCGTCCT TCAGAATGGT GCCGTCCATC AAGTGAGGTT TAGCATGGAA GGCAGGGCGG TCGGAGTGAT TGAAGTGCTA GATAAGGGAT GTGTCGTTCC TCCCCCATCA GATTTGACTG TGGCGGGTGA TGGTGCCGTG TTTGTCGGTT CTGCGGAGGG TGATTCATGG TTGGCCAAAG TCAATGTTGT CAGGCAGGTG GTAGAAAGGT CAGAGAAGAA GAAGGATGAG ATGGAAGTAG ACTGGGATGA AGGTGAGTAA TACCCTTGTT AACTGCGTAC CATTGCTGAT AAAGAATAAA AAATACAGAT CTGTATGGTG ATATTAATGA TGCTGCACTT GACGAAAAGG CCCAAGAACT GTTCGGTCCC GCTGCAATCA CTCTTTCTCC TTATGACATT TTGACAGGTG TTGGGAAGAT TATGGATATT GAGTTTGGTA TCGCTGCTTC CGATCAAGGA GTGAGCTTAT CCTTGTGGAA TCAATCATAA GAGGTGGCGG GACTGATCTT ATTTACAGTT ACGCACTTAT CCCCAATTGG TGGCTGTCAG TGGTGGATCT CGCAACTCAA CCATCAACGT CTTCCGTGTA AGTCGCCTCT TCCTCCTGCA TAAGCACGAA CATCCATAGA CTCCTGGCTC ACACAAGTCG ACAGCGAGGT ATCCCTATAA CAAAGCGCCG CCGATTCAAC GAGCTGCTCA ATGCCGAAGG TGTCTGGTTC CTCCCTATCG ACAGACAAAC GGGGCAAAAA TTCAAGGATA TCCCCGAAGC GGAGAGGGCA ACAATATTGC TGAGCAGTGA AGGAAATGCC ACCAGAGTAT GTTTTGACAT AATTTGAACT GCCCATGTTG GACGAGTGCA AAGATTAATA CTGTCTCATA GGTCTTTGCG TTGTTCTCTA AACCTACGCC CCAGCAGATT GGTCGGCTTG ACGGCAAGAC CTTGTCAGCA GCACCATTCT TCCAGCGATC ATGCATCCTG CGTGTTTCTC CCCTAGAGGT CGTTCTCTTG GATAATAGTC GGTCCCTCCC TTACTTTCTT CGCTCATCGC TGACCTTCAG GTAGATGGTA AAATCATTCA AACCGTCTGC CCTCGTGGTG ATGGTCCCAA AATTGTAAAT GCCAGTATCT CGGATCCGTT CGTGATCATC CGCCGAGCGG ACGATAGCGT CACTTTCTTT GTCGGTGATA CTGTTGCAAG AACGGTTGCT GAAGCTCCCA TCGTCTCGGA AGGAGTAAGC CATCTTCCAT TCAAGTTCGC TTTTGGTCAT AGCGCTGATG ATGAACGTTT AGGAATCTCC CGTTTGCCAG GCGGTAGAGG TCTTTACAGA CACCACCGGC GTCTACCGTA CTTTCGAGCC TTCCAAATCC GAGTCCTCGG AGCCCATCTC CCACCAAATT GACTCTGAAA ATAAACCCAA CATCACGAAC GGCATAAATG GCACCACTGC GCGTTCCGCT CGCCAAACCC AGCTCACTCC TCAGCAGATC AAGCGATTAC AAGAACAAGA ACCTGCCATC ACCACAGAAG CACCTAGCAT GGAGACTGCA ATCAACTCTC CACATGGCAC TCAATGGCTT GCGCTTGTGA CGAGGGGTGG CGAGCTACAG ATACGGTCGT TGCCCGATTT GCAGATTGTT TTGCAGAGTG AGGGTTTGGC ATCGTCTGCA CCGAGTTTTA CGGATGATTT GGGTGAGAAT CCGGGGTATG TACTTGGTGA GAAAAGGGAA GAGGGTGAAG AGGAGGATGA GATTATACAG ATGGTGTTCT GTCCCATTGG AAAGGGAACT GTCAGGCAGC ACTTGCTCGT AAGTTGTGCC CTCCGGAACT CGAAACTGTT TGGAAAACTG ATATCGATGG ATGTTTCAGG CTCTTCACCA CTCGGGTCGA TTAAACGCCT ACGAAGCTCA ACCTCGCTTC ACCGTCGATG CATCATCCCA TTCTCGTCGT TCGCTCGCAG TTCGATTCCG TAAAGTTCAT ACTCAACTTC TTCCCATCTC GGGCGGTGTC GGTACCACCA ACGGGAACGC TCGACTGCCT TACACCATCG TCCCCTTCAA CAACATTGAG GGTTTGACCG GAGCGTTCAT TACTGGAGAG AAACCGCATT GGATTATCTC GAGTGAAGCT CATCCTCTGA GGGCGTTTGC GTTGAAACAG GCGGCGATGG CGTTTGGTAA AACGACTCAT TTGGGTGGAA AGGGAGAATA CTTCATTAGA ATAGAAGATG TGAGTTAGTT CATAGATTTC GTTTTGTTTG CTGACACTGC CTTCTAGGGC TCCTTCATCT GTTACTTACC CCCCACACTC AACACTGACT TTGCTATCCC TTGTGATCGG TACCAAATGG AACGAGCTTA TACCAATATA ACATTCGATC CAACATCTGC TCATTACGTC GGAGCTGCTT CCATTGAGGT GCCGTTTCAG GCGTATGATG AAGAGGGAGA GATCCAGCTG GGGCCAGATG GTAAGTCGAG ATTCTCAATC TCAAATAAAG GTTTTTCCTG ACGGAACTAT AGGTCCTGAC CTCATTCCCC CGACTAACCA ACGATCAACT CTTGAACTTT TCTCGCAAGG TTCTGATCCT TGGAAAGTTA TTGATGGTTA TGAGTTTGAT CAAAACGAAG AAGTCATGAG TATGGAAAGT GTGAACTTGG AATCGCCTGG TGCGCCGGGT GGCTATAGGG ACTTTATCGC CGTGGGAACT GGTTTCAACT TTGGTGAAGA TCGAGCTACG AGAGGGAACG TAAGTTGACC CCTGCCGGCA GTACAGTAGA TGAATGATTG GCTGATAGTG TGTTATAGAC ATATATATTT GAGATCCTTC AAACTGTCGG ACCGCAAGGA GGGGGAGGTC CTGGTAGCGT ACCAGGGTGG AAATTGGTTA AGAGGACGAA AGACCCCGCG AGACATCCCG TGAATGCTGT AAATCACATT AACGGGTACT TATTGAACAC CAATGGTCCG AAGGTGAGCA ACAAAACGTT CTTCTGTTCC GATTGAGCTG ACAATTTGAA TTGCCCAGCT CTACGTAAAA GGCCTTGACT ACGACTCTCA GCTGATGGGT CTCGCTTTCC TCGATATCCA ATTATACGCT ACCACTGTCA AAGTGTTTAA GAACTTTATG CTCATTGGCG ACCTTTGCAA GAGTTTTTGG TTTGTTTCGC TTCAAGAAGA TCCCTACAAG TTCACGACCA TCAGTAAAGA TTTGCAACAT GTCTCGGTAG TGACTGCAGA CTTCCTTGTA CACGATGGCC AGGTGACATT TATCTCCAGC GACAGGAATG GTGACATGAG AATGCTTGAC TTCGATCCCA CTGGTGAGTA TGAACGACTG TTGTCTGGAA GCAGCGGCTG ATATATCTAA GACCCCGACT CGCTGAATGG TGAACGTTTG ATGCTGAGAA CCGAGTACCA TGCTGGATCT GCCGCCACAG TATCGAAGGT GATTGCCAGG AGGAAGACAG CAGAGGAAGA ATTTGCACCG CAAACACAGA TCATCTATGG TGGGTCGTGC TTTTAGGCGT ACATAACGCA GGATGCTAAC ATTTTGCCAA AGCTACGGCC GACGGAGCGT TGACAACTGT TGTATCTGTC AAGGATGCGA GGTTCAAGCG ATTACAGCTC GTATCTGATC AACTGGTCAG GAACGCCCAG CATGTTGCGG GCCTCAACCC TCGGGCCTTC AGGTATGATT TCGCCTGCCA TATACACGAG CTAGCTTGTG CTGATTGCTT CCGACAGAAC TGTTCGTAAC GACCTCTTAC CACGCCCTTT ATCGAAGGGT ATCCTGGATG GTCAACTTCT CAACCAATTC GCTCTCCAAC CGATAGGAAG GCAAAAAGAG ATGATGCGAC AGATTGGTAC GGACGCCGTT ACTGTCGCTA GTGATTTGCA GGCCTTAGGC GGTTTCTGGT AAAGAGGAAA AGCATATGAA AAGGGTTGTT GAGAAGGATA TATCCTCTAT TTTACAATTT CAATGAGCGC ATGTTAGTAA A
|
Protein sequence | MHALHQTLLP SSSIHHSLFL PHFTPSTIYP LPKPPAALDT LDVKVIGNLV VAGAEVLRVF EIREESVPII ENVKLEEDVA EGEKDVQMEE VGDGFFDDGH AERAPLKYQT TRRLHLLTQH ELNGTITGLA ATRTLESTID GLDRLIVSFK DAKMALLEWS RGDIATVSLH TYERCSQMNT GDLQSYVPLL RTDPLSRLAV LTLPEDSLAV LPLIQEQSEL DPLSEGFSRD APYSPSFVLS LSDMSITIKN IQDLLFLPGF HSPTIALLFS PMHTWSGRLQ TVKDTFCLEI RTFDLSSGTS YPLLTSVSGL PSDSLYLVAC PSELGGIVLV TSTGIVHVDQ GGRVTAACVN AWWSRITSLK CSMASVSQKL TLEGSRCVFV TPHDMLLVLQ NGAVHQVRFS MEGRAVGVIE VLDKGCVVPP PSDLTVAGDG AVFVGSAEGD SWLAKVNVVR QVVERSEKKK DEMEVDWDED LYGDINDAAL DEKAQELFGP AAITLSPYDI LTGVGKIMDI EFGIAASDQG LRTYPQLVAV SGGSRNSTIN VFRRGIPITK RRRFNELLNA EGVWFLPIDR QTGQKFKDIP EAERATILLS SEGNATRVFA LFSKPTPQQI GRLDGKTLSA APFFQRSCIL RVSPLEVVLL DNNGKIIQTV CPRGDGPKIV NASISDPFVI IRRADDSVTF FVGDTVARTV AEAPIVSEGE SPVCQAVEVF TDTTGVYRTF EPSKSESSEP ISHQIDSENK PNITNGINGT TARSARQTQL TPQQIKRLQE QEPAITTEAP SMETAINSPH GTQWLALVTR GGELQIRSLP DLQIVLQSEG LASSAPSFTD DLGENPGYVL GEKREEGEEE DEIIQMVFCP IGKGTVRQHL LALHHSGRLN AYEAQPRFTV DASSHSRRSL AVRFRKVHTQ LLPISGGVGT TNGNARLPYT IVPFNNIEGL TGAFITGEKP HWIISSEAHP LRAFALKQAA MAFGKTTHLG GKGEYFIRIE DGSFICYLPP TLNTDFAIPC DRYQMERAYT NITFDPTSAH YVGAASIEVP FQAYDEEGEI QLGPDGPDLI PPTNQRSTLE LFSQGSDPWK VIDGYEFDQN EEVMSMESVN LESPGAPGGY RDFIAVGTGF NFGEDRATRG NTYIFEILQT VGPQGGGGPG SVPGWKLVKR TKDPARHPVN AVNHINGYLL NTNGPKLYVK GLDYDSQLMG LAFLDIQLYA TTVKVFKNFM LIGDLCKSFW FVSLQEDPYK FTTISKDLQH VSVVTADFLV HDGQVTFISS DRNGDMRMLD FDPTDPDSLN GERLMLRTEY HAGSAATVSK VIARRKTAEE EFAPQTQIIY ATADGALTTV VSVKDARFKR LQLVSDQLVR NAQHVAGLNP RAFRTVRNDL LPRPLSKGIL DGQLLNQFAL QPIGRQKEMM RQIGTDAVTV ASDLQALGGF W
|
| |