Gene CNF00780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF00780 
Symbol 
ID3258187 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp244341 
End bp249701 
Gene Length5361 bp 
Protein Length1431 aa 
Translation table 
GC content49% 
IMG OID638257202 
Productcleavage and polyadenylation specific protein, putative 
Protein accessionXP_571490 
Protein GI58268668 
COG category[A] RNA processing and modification 
COG ID[COG5161] Pre-mRNA cleavage and polyadenylation specificity factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAATACACA CGACTCCAAC ACAGGAGAGT CCCCACGCCA TGCACGCTCT CCACCAGACC 
CTTCTCCCCT CATCCTCCAT ACACCACTCC CTCTTCCTCC CACACTTCAC CCCGTCAACC
ATCTACCCTC TCCCAAAACC ACCTGCAGCG CTCGATACTC TCGATGTCAA GGTCATCGGC
AACCTTGTCG TCGCCGGAGC AGAAGTCCTG AGAGTGTTTG AAATACGGGA AGAGAGTGTC
CCGATAATAG AAAATGTCAA ACTGGAAGAA GATGTGGCTG AGGGCGAGAA GGATGTGCAG
ATGGAAGAAG TCGGCGACGG ATTCTTTGAC GATGGTCATG CAGAAGTGGG TCGCATGCAT
CTTGTTGTAT ATGGAATATA TTGCTCATCA TAAAACCTGT CACAGCGAGC TCCGTTAAAA
TACCAAACGA CTAGGCGGCT GCATCTCTTG ACGCAACATG AGCTCAATGG CACGATAACG
GGTTTGGCAG CTACGAGAAC GCTTGAGAGT ACCATCGACG GTTTAGACAG GTTGATTGTG
TCGTTCAAGG ACGCAAAGGT AAAAAAGGTT ATTGTATGGA GGGCGAGCTT TGTTTGCTGA
CTCTCACTAT CAGATGGCAC TTTTGGAATG GTCAAGAGGG GATATCGCAA CAGTGTCTTT
GCATACCTAC GAGCGATGCT CTCAAATGAA CACTGGTGAC TTGCAGAGCT ATGTCCCACT
GCTCAGGACA GATCCGTTAT CGAGATTGGC TGTTCTTACT CTTCCTGAAG ACTCGTTGGC
GGTTTTGCCT TTGATTCAAG AGCAGTCAGA ATTAGACCCA TTATCAGAAG GCTTCTCCCG
GTGAGTTGCA TGTGTCTTTT TCGAACCAAA GCAGAACTGA CATTTTATAG TGATGCACCT
TACTCGCCCT CATTTGTTCT CTCCCTTTCA GACATGTCTA TAACCATTAA GAACATTCAA
GATCTCCTTT TCCTTCCCGG CTTCCACTCC CCCACTATCG CACTCCTCTT TTCACCCATG
CACACTTGGT CAGGTAGATT ACAAACTGTT AAAGATACAT TTTGCTTGGA GATTCGTACC
TTTGACCTTT CTTCCGGCAC TTCCTATCCC CTCCTCACAT CTGTCTCTGG TCTCCCTAGC
GACAGTCTGT ACCTTGTCGC ATGCCCTTCT GAACTCGGCG GCATCGTCCT TGTGACCAGC
ACCGGTATCG TGCATGTCGA TCAAGGTGGT CGAGTGACTG CTGCTTGCGT CAATGCCTGG
TGGTCCCGAA TTACATCCCT CAAATGCTCA ATGGCTTCTG TATCCCAGAA ACTTACCCTC
GAAGGATCTC GGTGTGTCTT TGTCACTCCT CATGACATGC TGCTCGTCCT TCAGAATGGT
GCCGTCCATC AAGTGAGGTT TAGCATGGAA GGCAGGGCGG TCGGAGTGAT TGAAGTGCTA
GATAAGGGAT GTGTCGTTCC TCCCCCATCA GATTTGACTG TGGCGGGTGA TGGTGCCGTG
TTTGTCGGTT CTGCGGAGGG TGATTCATGG TTGGCCAAAG TCAATGTTGT CAGGCAGGTG
GTAGAAAGGT CAGAGAAGAA GAAGGATGAG ATGGAAGTAG ACTGGGATGA AGGTGAGTAA
TACCCTTGTT AACTGCGTAC CATTGCTGAT AAAGAATAAA AAATACAGAT CTGTATGGTG
ATATTAATGA TGCTGCACTT GACGAAAAGG CCCAAGAACT GTTCGGTCCC GCTGCAATCA
CTCTTTCTCC TTATGACATT TTGACAGGTG TTGGGAAGAT TATGGATATT GAGTTTGGTA
TCGCTGCTTC CGATCAAGGA GTGAGCTTAT CCTTGTGGAA TCAATCATAA GAGGTGGCGG
GACTGATCTT ATTTACAGTT ACGCACTTAT CCCCAATTGG TGGCTGTCAG TGGTGGATCT
CGCAACTCAA CCATCAACGT CTTCCGTGTA AGTCGCCTCT TCCTCCTGCA TAAGCACGAA
CATCCATAGA CTCCTGGCTC ACACAAGTCG ACAGCGAGGT ATCCCTATAA CAAAGCGCCG
CCGATTCAAC GAGCTGCTCA ATGCCGAAGG TGTCTGGTTC CTCCCTATCG ACAGACAAAC
GGGGCAAAAA TTCAAGGATA TCCCCGAAGC GGAGAGGGCA ACAATATTGC TGAGCAGTGA
AGGAAATGCC ACCAGAGTAT GTTTTGACAT AATTTGAACT GCCCATGTTG GACGAGTGCA
AAGATTAATA CTGTCTCATA GGTCTTTGCG TTGTTCTCTA AACCTACGCC CCAGCAGATT
GGTCGGCTTG ACGGCAAGAC CTTGTCAGCA GCACCATTCT TCCAGCGATC ATGCATCCTG
CGTGTTTCTC CCCTAGAGGT CGTTCTCTTG GATAATAGTC GGTCCCTCCC TTACTTTCTT
CGCTCATCGC TGACCTTCAG GTAGATGGTA AAATCATTCA AACCGTCTGC CCTCGTGGTG
ATGGTCCCAA AATTGTAAAT GCCAGTATCT CGGATCCGTT CGTGATCATC CGCCGAGCGG
ACGATAGCGT CACTTTCTTT GTCGGTGATA CTGTTGCAAG AACGGTTGCT GAAGCTCCCA
TCGTCTCGGA AGGAGTAAGC CATCTTCCAT TCAAGTTCGC TTTTGGTCAT AGCGCTGATG
ATGAACGTTT AGGAATCTCC CGTTTGCCAG GCGGTAGAGG TCTTTACAGA CACCACCGGC
GTCTACCGTA CTTTCGAGCC TTCCAAATCC GAGTCCTCGG AGCCCATCTC CCACCAAATT
GACTCTGAAA ATAAACCCAA CATCACGAAC GGCATAAATG GCACCACTGC GCGTTCCGCT
CGCCAAACCC AGCTCACTCC TCAGCAGATC AAGCGATTAC AAGAACAAGA ACCTGCCATC
ACCACAGAAG CACCTAGCAT GGAGACTGCA ATCAACTCTC CACATGGCAC TCAATGGCTT
GCGCTTGTGA CGAGGGGTGG CGAGCTACAG ATACGGTCGT TGCCCGATTT GCAGATTGTT
TTGCAGAGTG AGGGTTTGGC ATCGTCTGCA CCGAGTTTTA CGGATGATTT GGGTGAGAAT
CCGGGGTATG TACTTGGTGA GAAAAGGGAA GAGGGTGAAG AGGAGGATGA GATTATACAG
ATGGTGTTCT GTCCCATTGG AAAGGGAACT GTCAGGCAGC ACTTGCTCGT AAGTTGTGCC
CTCCGGAACT CGAAACTGTT TGGAAAACTG ATATCGATGG ATGTTTCAGG CTCTTCACCA
CTCGGGTCGA TTAAACGCCT ACGAAGCTCA ACCTCGCTTC ACCGTCGATG CATCATCCCA
TTCTCGTCGT TCGCTCGCAG TTCGATTCCG TAAAGTTCAT ACTCAACTTC TTCCCATCTC
GGGCGGTGTC GGTACCACCA ACGGGAACGC TCGACTGCCT TACACCATCG TCCCCTTCAA
CAACATTGAG GGTTTGACCG GAGCGTTCAT TACTGGAGAG AAACCGCATT GGATTATCTC
GAGTGAAGCT CATCCTCTGA GGGCGTTTGC GTTGAAACAG GCGGCGATGG CGTTTGGTAA
AACGACTCAT TTGGGTGGAA AGGGAGAATA CTTCATTAGA ATAGAAGATG TGAGTTAGTT
CATAGATTTC GTTTTGTTTG CTGACACTGC CTTCTAGGGC TCCTTCATCT GTTACTTACC
CCCCACACTC AACACTGACT TTGCTATCCC TTGTGATCGG TACCAAATGG AACGAGCTTA
TACCAATATA ACATTCGATC CAACATCTGC TCATTACGTC GGAGCTGCTT CCATTGAGGT
GCCGTTTCAG GCGTATGATG AAGAGGGAGA GATCCAGCTG GGGCCAGATG GTAAGTCGAG
ATTCTCAATC TCAAATAAAG GTTTTTCCTG ACGGAACTAT AGGTCCTGAC CTCATTCCCC
CGACTAACCA ACGATCAACT CTTGAACTTT TCTCGCAAGG TTCTGATCCT TGGAAAGTTA
TTGATGGTTA TGAGTTTGAT CAAAACGAAG AAGTCATGAG TATGGAAAGT GTGAACTTGG
AATCGCCTGG TGCGCCGGGT GGCTATAGGG ACTTTATCGC CGTGGGAACT GGTTTCAACT
TTGGTGAAGA TCGAGCTACG AGAGGGAACG TAAGTTGACC CCTGCCGGCA GTACAGTAGA
TGAATGATTG GCTGATAGTG TGTTATAGAC ATATATATTT GAGATCCTTC AAACTGTCGG
ACCGCAAGGA GGGGGAGGTC CTGGTAGCGT ACCAGGGTGG AAATTGGTTA AGAGGACGAA
AGACCCCGCG AGACATCCCG TGAATGCTGT AAATCACATT AACGGGTACT TATTGAACAC
CAATGGTCCG AAGGTGAGCA ACAAAACGTT CTTCTGTTCC GATTGAGCTG ACAATTTGAA
TTGCCCAGCT CTACGTAAAA GGCCTTGACT ACGACTCTCA GCTGATGGGT CTCGCTTTCC
TCGATATCCA ATTATACGCT ACCACTGTCA AAGTGTTTAA GAACTTTATG CTCATTGGCG
ACCTTTGCAA GAGTTTTTGG TTTGTTTCGC TTCAAGAAGA TCCCTACAAG TTCACGACCA
TCAGTAAAGA TTTGCAACAT GTCTCGGTAG TGACTGCAGA CTTCCTTGTA CACGATGGCC
AGGTGACATT TATCTCCAGC GACAGGAATG GTGACATGAG AATGCTTGAC TTCGATCCCA
CTGGTGAGTA TGAACGACTG TTGTCTGGAA GCAGCGGCTG ATATATCTAA GACCCCGACT
CGCTGAATGG TGAACGTTTG ATGCTGAGAA CCGAGTACCA TGCTGGATCT GCCGCCACAG
TATCGAAGGT GATTGCCAGG AGGAAGACAG CAGAGGAAGA ATTTGCACCG CAAACACAGA
TCATCTATGG TGGGTCGTGC TTTTAGGCGT ACATAACGCA GGATGCTAAC ATTTTGCCAA
AGCTACGGCC GACGGAGCGT TGACAACTGT TGTATCTGTC AAGGATGCGA GGTTCAAGCG
ATTACAGCTC GTATCTGATC AACTGGTCAG GAACGCCCAG CATGTTGCGG GCCTCAACCC
TCGGGCCTTC AGGTATGATT TCGCCTGCCA TATACACGAG CTAGCTTGTG CTGATTGCTT
CCGACAGAAC TGTTCGTAAC GACCTCTTAC CACGCCCTTT ATCGAAGGGT ATCCTGGATG
GTCAACTTCT CAACCAATTC GCTCTCCAAC CGATAGGAAG GCAAAAAGAG ATGATGCGAC
AGATTGGTAC GGACGCCGTT ACTGTCGCTA GTGATTTGCA GGCCTTAGGC GGTTTCTGGT
AAAGAGGAAA AGCATATGAA AAGGGTTGTT GAGAAGGATA TATCCTCTAT TTTACAATTT
CAATGAGCGC ATGTTAGTAA A
 
Protein sequence
MHALHQTLLP SSSIHHSLFL PHFTPSTIYP LPKPPAALDT LDVKVIGNLV VAGAEVLRVF 
EIREESVPII ENVKLEEDVA EGEKDVQMEE VGDGFFDDGH AERAPLKYQT TRRLHLLTQH
ELNGTITGLA ATRTLESTID GLDRLIVSFK DAKMALLEWS RGDIATVSLH TYERCSQMNT
GDLQSYVPLL RTDPLSRLAV LTLPEDSLAV LPLIQEQSEL DPLSEGFSRD APYSPSFVLS
LSDMSITIKN IQDLLFLPGF HSPTIALLFS PMHTWSGRLQ TVKDTFCLEI RTFDLSSGTS
YPLLTSVSGL PSDSLYLVAC PSELGGIVLV TSTGIVHVDQ GGRVTAACVN AWWSRITSLK
CSMASVSQKL TLEGSRCVFV TPHDMLLVLQ NGAVHQVRFS MEGRAVGVIE VLDKGCVVPP
PSDLTVAGDG AVFVGSAEGD SWLAKVNVVR QVVERSEKKK DEMEVDWDED LYGDINDAAL
DEKAQELFGP AAITLSPYDI LTGVGKIMDI EFGIAASDQG LRTYPQLVAV SGGSRNSTIN
VFRRGIPITK RRRFNELLNA EGVWFLPIDR QTGQKFKDIP EAERATILLS SEGNATRVFA
LFSKPTPQQI GRLDGKTLSA APFFQRSCIL RVSPLEVVLL DNNGKIIQTV CPRGDGPKIV
NASISDPFVI IRRADDSVTF FVGDTVARTV AEAPIVSEGE SPVCQAVEVF TDTTGVYRTF
EPSKSESSEP ISHQIDSENK PNITNGINGT TARSARQTQL TPQQIKRLQE QEPAITTEAP
SMETAINSPH GTQWLALVTR GGELQIRSLP DLQIVLQSEG LASSAPSFTD DLGENPGYVL
GEKREEGEEE DEIIQMVFCP IGKGTVRQHL LALHHSGRLN AYEAQPRFTV DASSHSRRSL
AVRFRKVHTQ LLPISGGVGT TNGNARLPYT IVPFNNIEGL TGAFITGEKP HWIISSEAHP
LRAFALKQAA MAFGKTTHLG GKGEYFIRIE DGSFICYLPP TLNTDFAIPC DRYQMERAYT
NITFDPTSAH YVGAASIEVP FQAYDEEGEI QLGPDGPDLI PPTNQRSTLE LFSQGSDPWK
VIDGYEFDQN EEVMSMESVN LESPGAPGGY RDFIAVGTGF NFGEDRATRG NTYIFEILQT
VGPQGGGGPG SVPGWKLVKR TKDPARHPVN AVNHINGYLL NTNGPKLYVK GLDYDSQLMG
LAFLDIQLYA TTVKVFKNFM LIGDLCKSFW FVSLQEDPYK FTTISKDLQH VSVVTADFLV
HDGQVTFISS DRNGDMRMLD FDPTDPDSLN GERLMLRTEY HAGSAATVSK VIARRKTAEE
EFAPQTQIIY ATADGALTTV VSVKDARFKR LQLVSDQLVR NAQHVAGLNP RAFRTVRNDL
LPRPLSKGIL DGQLLNQFAL QPIGRQKEMM RQIGTDAVTV ASDLQALGGF W