Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB02560 |
Symbol | |
ID | 3255985 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 744835 |
End bp | 748626 |
Gene Length | 3792 bp |
Protein Length | 1064 aa |
Translation table | |
GC content | 50% |
IMG OID | 638254906 |
Product | cleavage stimulation factor, 77kDa subunit, putative |
Protein accession | XP_569179 |
Protein GI | 58263803 |
COG category | [A] RNA processing and modification |
COG ID | [COG5107] Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTTTCTCCG TCTCGCTCCA GCTCCCATGT CACACGAACA CCCCGCAGAG ATCGTCCACC AGCTGCAATC AATAGACTCC ATCCAGCAGG ACCTCGCAGA CACAGCCGCT GCCGTCGTCG ATGCTGCCTC CCAAAGCCTT CCCCCCCAAC ATGCTGCACA AGATCAAAAA TCCCACTCAA CCCTGTTGGA CAATGCAGAG GCAATGGAGT CTATTCTTGA AAGTGCCATA CCGGCAGCTC CAGGTTCATC GTCTACCGCT GCAGCATCTG ATGCCATTGG AGATAACAAT TTTACTGCGT CTGGTCCTCC TCCTATTTCC AATACTGTCG TTGCTCCATC TGCTGCAGAT GACGAAACTG GAGACGTGAT TGTCGTTGAA AGCGAGACTC CTACAACTAC CAATGTCGTA GAGGCAGTTG TCAACCCAGA AAATGATGGC GATGTCCCGA TAGAGTCCAC AGAACAGTCC TCAGGACAGC CCGCAGAACA GCCCACAGAA CAGCCCACAG AACAACCCAC AGAACAGCCC GCAGAACAAC CTGCTGTCGA AACACCTCAA GCACCCCAAT CCACTCCTGC TGCACCCGCC GCATCTGCCG TTGAAAGTAT CCCAACCACC TTGGAGGAAA CTCATCTTGA GCAAATAGTG CAAGCACCTG CCCCAACTGT TCATACTGAG CCTCTGACGC CTGTGGTCGA CATCAAAATG GAAGAAAAGC CTATGATCGA ACACGTCGCA TGGATACCAC CTCAAGGAAT CCACTCTGAT GTATTTCTCC CGGAAGGTCT GACGGAATAT TCCCCAAGTG TGAGCCAAAA TGGCGAACTG ATCAGGTCTT GGCGTGCTGG TGAATTTCGC ATCTCATTTC CATATTGTAA CCTGGCTGAT TAATGGTGCA GATCCTAGTA ACCCGACCCT GTTACTTTCT CTCTTCAACT GGGCCGTCCA AAAAACGGAA GTGGAGGATG CTAGGGCTTG GTATCGGGTT CTGGCGGTTG ATAATCCGAC TGCGGTACGT ACAAGCTGAT ATTTTAGAGC TATATTGTCT TACTTCCCCT TGTCAGGCCC AACCGCTGCT CGCTTTGATC AATTTGGAGT TAGCTCTTTC CAACTTCGCC GAAGTAGAGG CTATCTTTGC CAGTACACTC AAGGGAAGCG CTGGGATAAC AACTGCGGCC GACGTTAGTA TCTGGGCCGC GTATCTCCAT TACATCCGGC GACAAAACCC TCTTACTGAG GGTTCAGCCA ATGCTGCTGA TGTCAGATCA ACGATCACTG AGGCTTATGA GTTTGCGCTT CGAGAGTGTG GATTTGACCG AGAGAGCGGA GATATCTGGG ATGAGTACAT CAAGTTTGTT GCGAGTGGTC CTGTATGTCG ATTCTGATGG ACTACCTTTT TTTGACTGAC TTTTTGTAGG CTACCAATCA ATGGGACACG CAAGCCAAAA ATGATAACCT TCGAAAGATC TATCAGCGAG CTGTTTGTAT TCCCCTCAAC AACATTGAGG CTCTATGGAA GTCTTACGAC AATTTTGAAT CTTCCCTCAA TAAGCTTACC GCCAAGAAAT ATCTCGCTGA AAAGTCTCCT GCTTATATGA CAGCTCGTAC CGCCCTTCGC GAGCTTCGTG CGCTTTCGGA CCCGATCCCC AAACCTATCC TGCCTCCGTA TCCCACTTTC ACAGAACAGG ACAGGCAGGT TGTTGGTGCT TGGAAAGCAT GTCTGAGATG GGAAGAAGGG AATCCACTGG TTATTGAGAA TCACGAGCTG TTGCAATCCA GGATTGGGTA TGCATTGAGA AAGTGTTTGG GTGAAATGAG ACATTTTCCA GAGCTCTGGC ACTATGCCGC TAGCTATTAC TCCAAGTTGG GTAAACAGGA CGAGGCTGCA GAGATTCTCG AAGCCGGTGT GAATGCTTGT CCTAAAAGGT AAGCTTATTC CTCTTCATCC CCAGACCTAG AACTAATCAA CCTCAGCTTC CTCCTCACGT TTGCTTACGC TGAGCTTCAA GAAGAGCGCA AAGCTTTCCC GACTTGTCAT TCACTCTATA CTACCCTCAT CTCTAAACTG AATCCCGAAG TCGATGAGCT CCGCCAGAAC GTTGCTCGTG AAATTGACAT TGCTCGCGGT CCCCCTATTC CTGGCTCTGA AAAGGCTGCA GTAGCTGCTG CCGTTGGCGA CAGCATTGAC GCTGACGGTA ATGATATAAG CGATATCCAG AGGCTCGTGG AGGAACGGGA ACAGAGAGGG GCGCTTGTGG CGCAAAGGAG AGGGAAGGAC ATCGAAGAAC TGATGGTTGG CATAAGTGTC GTATGGATAA TGTACATGAG GTTTGCTCGC AGGGCAGAGG TCGGTTAAGC TTAATTCCTT GAGGCACATT CGCAAGAAGT AGGCTGATAA ATAGAAACAG GGTATCAAGG CCGCTAGAGG AGTATTTGGG AAGGCTCGGA AGTCGCCTCA TCTTACGTGG CAAGTATTTG AAGCATCAGG TGAGCAATAC GGTATAGTTG TCTTCGCAAC ATAATGAGAC TAACGGCCGT AAAGCTTTAA TGGAATACCA CACCAACAAG GACGCCGCTG TTGCTATTAG AATTTTCGAA TTGGGTCTGA AGCAGTTCTC CGAGGATGTT GACTATGTGA TCAAATACCT TCAATTCCTT TTGTCGATTA ACGACGATAA CAGTGCGTTT CATCTCACAT GCTTAATTCC TGCAAATATT GATTCTCTGC ATTTAGACGC TCGAGCTCTC TTTGAACGTT CGGTTGTCAG GATTATGGGC GACAAGGCTC GACCTCTATG GGATGCTTGG GCTCGTTATG AATACACCTA CGGTGACCTG TCTGCGGTAC ACAAACTTGA AGCTCGCATG TCTGAAGTCT TCCCCGAGGA TGCTCCTCTC AAGCGTTTCG CACAAAGATG GTCATACAAC GGAATCGATC AAATTGCTAT TCGGGATCTT GGCTTCAACC GTGCTCGAAT GGGCGTTGCT GCGCCTCCTG CATTTGCCAT CGCTCCTGTT CTCCCGCCGG TTCATGCTTC TATTCCCGCC CCTATTGCCG TCCCTACCCC AGTGCAGCCT CCTCAAGAAT CATACAAACG TCCTGCTCCC GAAGATATCC CCCCACGACG GCCTTCCTCT GCAGAATTCT CTCGTTCTCC CAAGCGTCAT CGTGCGCAGT CTCCCCCTCG CCGTTATCCT GAGCGTGATG ACCGTCCTCC TCCAGGCCGA TATCGTGACT CACTCCCGCC TGTCAAGGCC CCATCAAGCA TTCCACCACC TCCTCTCGCA GGACCAGCGT ATGCAACACC TTCCAGTGGA GCGTACGGGG GCGACAAAGA CAGGAGTGGT CTAGAAAAGC CTCTGGCGTG GTTCATGGCA CAACTGCCCA ATGCTCGTTC ATTTGATGGT GAGTTATAGC GTGACGCAGC GTTGAATGTA TAGGTGCTCA TTGCTAATGA CCCCAATCAG GCCCTGTTTT CCGTCCTGAC GATATAATGA AGCTTTTCGG TGGACTTTCT CTGCCAGGTG CAGGTATGCC CCCTGCACCT CCTATCAGCA GAGGTCCTCC CCCGCCACCC ATGCAAAGTA GAGGATATTA TGAACGTAAG TTTCGCCGTT GTCACAGCCT CTCAGCATCA GATACTAATC ATTGTGCACA GCTGAAAGGG ACAGAAGATA TGGGGGACAT GGAAGCGGAA GGTACTGATT GAGGAGGTAA TTGTTCAATG TTCATCATGT TGGAGTGAAA AATGTGGCAA GATAGACAAA GTTGAATGCA TCAACACAAG AC
|
Protein sequence | MSHEHPAEIV HQLQSIDSIQ QDLADTAAAV VDAASQSLPP QHAAQDQKSH STLLDNAEAM ESILESAIPA APGSSSTAAA SDAIGDNNFT ASGPPPISNT VVAPSAADDE TGDVIVVESE TPTTTNVVEA VVNPENDGDV PIESTEQSSG QPAEQPTEQP TEQPTEQPAE QPAVETPQAP QSTPAAPAAS AVESIPTTLE ETHLEQIVQA PAPTVHTEPL TPVVDIKMEE KPMIEHVAWI PPQGIHSDVF LPEGLTEYSP SVSQNGELIR SWRADPSNPT LLLSLFNWAV QKTEVEDARA WYRVLAVDNP TAAQPLLALI NLELALSNFA EVEAIFASTL KGSAGITTAA DVSIWAAYLH YIRRQNPLTE GSANAADVRS TITEAYEFAL RECGFDRESG DIWDEYIKFV ASGPATNQWD TQAKNDNLRK IYQRAVCIPL NNIEALWKSY DNFESSLNKL TAKKYLAEKS PAYMTARTAL RELRALSDPI PKPILPPYPT FTEQDRQVVG AWKACLRWEE GNPLVIENHE LLQSRIGYAL RKCLGEMRHF PELWHYAASY YSKLGKQDEA AEILEAGVNA CPKSFLLTFA YAELQEERKA FPTCHSLYTT LISKLNPEVD ELRQNVAREI DIARGPPIPG SEKAAVAAAV GDSIDADGND ISDIQRLVEE REQRGALVAQ RRGKDIEELM VGISVVWIMY MRFARRAEGI KAARGVFGKA RKSPHLTWQV FEASALMEYH TNKDAAVAIR IFELGLKQFS EDVDYVIKYL QFLLSINDDN NARALFERSV VRIMGDKARP LWDAWARYEY TYGDLSAVHK LEARMSEVFP EDAPLKRFAQ RWSYNGIDQI AIRDLGFNRA RMGVAAPPAF AIAPVLPPVH ASIPAPIAVP TPVQPPQESY KRPAPEDIPP RRPSSAEFSR SPKRHRAQSP PRRYPERDDR PPPGRYRDSL PPVKAPSSIP PPPLAGPAYA TPSSGAYGGD KDRSGLEKPL AWFMAQLPNA RSFDGPVFRP DDIMKLFGGL SLPGAGMPPA PPISRGPPPP PMQSRGYYEP ERDRRYGGHG SGRY
|
| |