Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE04060 |
Symbol | |
ID | 3257987 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 1145159 |
End bp | 1148140 |
Gene Length | 2982 bp |
Protein Length | 615 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256989 |
Product | conserved hypothetical protein |
Protein accession | XP_571060 |
Protein GI | 58267808 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | [TIGR01622] splicing factor, CC1-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.639663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAATATCTCA ATCTTCCTTC CTCACTCCTT AAACATTCTT AAAGACTTTT TATATCTAAT TTTAGCGAAG TCCATAATAG ATGTCAGCAA CTCCACCTAG GTCTTCTTAC CAGTCTGCCA ATGGTGGCGA CAAGACTCCT GAGCTCACCA CGTAAGTTGC CATCGTCTGT CTTTCGTCCC CAGTCTCTGA CTGCTCTGTA CAGTAAACGT GAGAAACGCC ATCGTGAGGA TGATGAAGAT GATCGTGACG TATCCGGCAG GTCTCATCGA TATCGTCCTG AAGACGTCCG CGACTACGAC CGTGACAGAA GTGATAGGGG AGACAGGGAC CGGGACAGAG AGAGGCGACA CCGCCACCGA CGCCGTGACG AAACTGAAGA GGAACGTAAG GAACGCCACC GTCGACGTGA GGAGGAGACC GAGGAAGAAC GCGAGGAGAG GCATCGACGA AGGAGGGAGC GTGAATTGAG GGAACGCGAG CGTGAACGAG AGGGCGAGGA CGATTATCGT CGAGGATCGA GAGAAATGTC TGTGGGTAGA CCCTTGAGCC ATAGGGACAG ATCAAGAGAG AGTCACCGCA GTTATAGGAG CCATCGCGAC AGGGACGAGA TGCCCCCCAT TAGGCCTATG AGTAGGGAGC CTAGGGACAT GGCAGAAGAG CGTAGGGAAA ATGACAGGCG TAGGGAAATG CATTTTGCAG ACGTAAGTCC CCAAACATCC TTATGCAGTA TCATACTAAT TGTCGTCGTA GCTTGAACGT GATGGTCGGA TGCGGTCTCC GCCCCCACGA CGCCGTCGTC TTTCGCCCGA GTATGGGGGT CCTCGCGGTC CGCCTCCCCG TCGTCCGCCT CCTCCCCCTC GTGACCCAGC TACGGCTTTG ATTGAAGAGG TAGACTCTGA AGCTCGTTCC ATCTTTGTCT CACAACTTTC TGCTAGGATG ACTTCCCAGG TTCTCGGCCT GTTCTTTGAA GACAAACTTG GCAGGGGTGC TGTTAGAGAC GCCAGAGTTG TGACCGACAA GGTTGCTAGG AGGTCCAAAG GGTAAGTCTT CAGGTATATA TGCCTTGCAC AGTTGATGCT GACATCTCAT AGTATCGGAT ACGTTGAGTT GGATAGCGTT GACCTCGTAA ACAAGGCTCT TGCTGTAAGT CCAATGATTA GTCAATTAGA TATCCATCTG ACCTATTTAG CTTTCTGGGA CGGTTGTCAT GGGTATTCCT ATTAATATCA TGCTTACCGA GGCAGAAAGA AACCATTCCG GCACTGAACT TATCACTGCT ACTGCCCTTG CGAGCAACGC GTGAGTATAG ATTGAGTCCT GCCAGTGGTA ACTGACTGAC TTTATACATA GGCGATCTCA TGGTGGTGGT GGTGGTCGTT CTTCTGTTCC GTTCACTCAG AACTATCCTC CCCTCTCCAC CGGTCTGGCT CTTCCCCCAG GTCTTGACCC CGACGCTCAC AAGGACGCTG CTATCCCTTA TCACCGTCTT TTCGTCTCCA ATCTCGCTTT CTCTCTGACC GCTGATGATG TGAGGCAGGT GTTCGAGCCG TTCGGCGAGA TTGAGTTTGT CGATCTTCAC ACGGATCTTG TAAGTTCATA TGGTTTGATT ACTTGTCAAG GCCTAGCTGA TGTACGATTT AGAGCGGACT GAGGAAGGGT ACAGCTTACG TCCAGTTCAA AGATGTCAAG TCTGCACAAA TGGCCCTTGA TGCGATGGCT GGATTTGACC TCGCAGGACG TCTCATCAAG GTCCAGACTA TTCAAGAACG CGGTACTTAC CAGACACCAG ACTTGATCGA AGATAGTGGC AACTATGGCA CCCGACTTGA CGCCAACCAG AGGCAACAGC TCATGTTTAA GCTCGCCAGG ACCGAGCCCA ATGTCAACTT GTCGTTGTCC GCCCCCAAGA TTAACGGTTC TCAGTAAGTG CACCGATATC CGTTTACTGG AGTCGAGTGT TGACGCCTGT GAATTAGGTC CAAGATACCA GCGATGGACC CTACTCCTCG AATCGTTGTT CATAACATGT TTAATCCCGA AGAAGAAACC GAGAGGAACT GGGATCTGGA CCTTGCCGAA GACGTCAAGG GCGAAGTTGA GTCCAAGTAC GGCAGAGTCA AGAGGATTAA GGTCGAGAAG ATGTCTGCAG TATGTCGTTC TGTTTCCCCA ATAAAGTAGC ATGTTCTGAT GTCGTTTTCT AGGGCGAGGT GTACATTGAA TTCATTGACA CTGACTCCGC TATCAAAGCT GTCAAGGGCC TCAATGGTCG ATTCTTTGGT GGACGCCAGT TGCAAGCAGG ATACATCACC GAGGCTTTGT TCAATGCGCA CCTCTAAACG GTATGCAGCA AAGCATCGTT TCACGGATGT GTAGTATGGT GGAACAAAAA TATTCCTTTT TTGTTTTCTG ATCTTTTGCT TAAAAGTATG GACCGTGTTA TTTATGGATC TTTGTGTTTA CAGGTTATTT CCAACCCCAA CTCATATTGG GTTATAACGC CGATCGCCTA CAAAATACAA ACTTCCATCC TCGATCCTCC TTCTCACCAT TCCGTGGCTG ATTCGACAAT CGTTGATGGT AAATTATAGG CGAGTTATAT CCCCCACTTT TTTCAGTTGC TTTCCTTTTC TCGAAGTGAA TCGCCTGTCT GGGATAAGTG GTCGTGTATG ACTTGCCTCC CTGCTGACTT GATACTGAAG GCTTATATAA ATCGTCACAA ATCTGTGTAG CGAGAAGGAG GTGAGTGTGG CACACGTAAA TCCCGTCATG CCATTCAGGT ATGGGCTTCC GGGTTGCATT TATTCTAGGC TCGACCCAAA TCCCTCGACG GACATCGGCC GGTCTCCGAC GAGGTACGAA TGGTTTGTCG CAAGTTGCAA CTACTTTTCT GTTCGTTGTC GGGGTGATTT TGGCCTTTTT TCGTTGGACC ATTTTCAAAC GGTCATTTGA TTGACGGTAA CTGGAACACA TTGACTGACC CT
|
Protein sequence | MSATPPRSSY QSANGGDKTP ELTTKREKRH REDDEDDRDV SGRSHRYRPE DVRDYDRDRS DRGDRDRDRE RRHRHRRRDE TEEERKERHR RREEETEEER EERHRRRRER ELRERERERE GEDDYRRGSR EMSVGRPLSH RDRSRESHRS YRSHRDRDEM PPIRPMSREP RDMAEERREN DRRREMHFAD LERDGRMRSP PPRRRRLSPE YGGPRGPPPR RPPPPPRDPA TALIEEVDSE ARSIFVSQLS ARMTSQVLGL FFEDKLGRGA VRDARVVTDK VARRSKGIGY VELDSVDLVN KALALSGTVV MGIPINIMLT EAERNHSGTE LITATALASN ARSHGGGGGR SSVPFTQNYP PLSTGLALPP GLDPDAHKDA AIPYHRLFVS NLAFSLTADD VRQVFEPFGE IEFVDLHTDL SGLRKGTAYV QFKDVKSAQM ALDAMAGFDL AGRLIKVQTI QERGTYQTPD LIEDSGNYGT RLDANQRQQL MFKLARTEPN VNLSLSAPKI NGSQSKIPAM DPTPRIVVHN MFNPEEETER NWDLDLAEDV KGEVESKYGR VKRIKVEKMS AGEVYIEFID TDSAIKAVKG LNGRFFGGRQ LQAGYITEAL FNAHL
|
| |