Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02980 |
Symbol | |
ID | 3256140 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 957390 |
End bp | 959286 |
Gene Length | 1897 bp |
Protein Length | 569 aa |
Translation table | |
GC content | 61% |
IMG OID | 638255520 |
Product | small nuclear ribonucleoprotein hPrp3, putative |
Protein accession | XP_569946 |
Protein GI | 58265580 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.326844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGTTG AAATAGCGAT GGATGTGTGT TACGTAATGT CGCGGATGTG TATATTGCCA GCGAATCGAG TGTTTGCGAG TGCACATGCG CCATGTCCCC CCCGAAGCGC CCCGGCTCCC CCGCGGCCCC CCCCCCCGCC AAGCGCCCGC CGCCCGCCGC CCCGCTCGAC ATCGCCGCCA TCCGCGCCCA GCTCGCTGCG AAAAAGGCCG CCCTCCAGCC CCCCCCCGGC CCCGCCGCGC TCCCCCCCCA GCCCCCCGTC CACGCCGACG TCGCACACAA GCTCGCAGCC GCCAAGGCCC GTATAGAAGC CCTCAATGCC CGGGCCGCAA ATCCCTACCT CTCCGGCGCC CCAGCCCCTG CGCCAGCGCC GGCCGCCCCC CAGCCGGGCG TCACCTCCAT CGCGCTCCAC CCCCTGCTCA TGGGCGGCGA TGCTCAGCAG CCCGCCGCCC AGGCAGACAG GAACGAAAAG CGGGCCATGC GCGACCGCTA CAAGACGATG GCGCCCAAGT TCACGTCCGT GAGGGCCAAC GCCAGCGCGG CCGCATCCCA GGCGCCGTCC CGGGCCTCGC CTGCCGTCAC CGCGGCGCCG GTGCTCAACC CGTATGCGTC CGCGTCGGCC GCCAACTCGC CTGCGCCAGA CGAGGAGAGA GCGCCCACGC GCAAGTCCAA AAAGCTCCAG TTCAGCAGGG CGGGTAAATA TGTCGAGCAG GGCGAGCAGC TGAGGAACGA ACAAAAGATG GAGGCTCTGC GGCAGAGGAT CGCAGAGGCC AGCCGGAAGG CTGGTCTCGA CAGCGAATTC GATACCCTCG AAAGGAGTCT CAAGGTGTGT CTTTTTCGCT CGCAAAGAGC CGGGCTGACC CGTTTCCGCA GCGACAACCT CCGCCGGCCG TCGAATGGTG GGACGAAGCC ATTCTTCCCA AGGGCGTCAC GTACGAAGAC GACCTCGAGT CTGCCTACAA CAACCTGTCC ACCTCGTCCG ACTCTCTCAT CACCCACCTC GTTCTTCATC CTATCCCTAT CCCCGCCCCC ATGGACCGCA GACAACCCGA GCGCGGTCTT ATGCTCACCA AAAAAGAGCA AAAGAAGATG CGGCGACAGC GACGCCAGGC TGAGCTTGAA GACAAGCGCG ACCGTCAAAA GATGGGTTTG CTGCCGCCCG ACCCGCCCAA GGTCAGGCTC GCTAACCTGA TGAAGGTGCT GACCTCGGAC GCCGTCCAAG ATCCGACAAA GGTGGAGGCC AAGGTCAGGA AGGAAGTCGC GATGAGGGCG TACAAACATG AAAAGGATAA TCAAGAGCGA AAGCTGACTG CGGAAGAGAG AAAGGAAAAG GAGTATAGTC AAATGGTCGC CAGAGAGAGG AATGGTATCC GCGGTGCCGT CTTCAAGTAC GTCAGCCCCA CCCATCATTT GATTCATCCA GCTCATTATA AACTTGTTAT TAGGATCAAG TATCTGACCA ACGGCCGACA CAAATTCAAA GTCCGCGAAA CCGCCAAAGC CGATCTCCTT TCCGGTATCT GCATCTTCCA TCCCTCCTTT GCCCTCGTCA TGGTAGAGGG AGTCGAAAAG TCCATCAAAC ATTTCAAGCG TCTCATGCTC TCACGTATCG ACTGGACCGA ACAAGCGCGG CCCATGGCTG ACGGTGACGG CGGCGAGGAC GCGCCAGGTT CGGACGAAGA CATGGACGGC CGCACGAACA GCAAGGAGGG CGAACAAGAC TTGGCAGATA ACAAGTGTGA ACTCATTTGG GAAGGCGAGC TGCCCGAGCG CGTGTTTAAA ATGTTTAGAG CGAGGCATGT CGAGACGGAT AGCAAGGCGA AAGAGTGGTT GACGCCGAGG TTTGAAGCCA TGTGGGATTT GGCCAAAAGG TGGCAGTGGG CAGGGGAGGA CCTCTAG
|
Protein sequence | MHVEIAMDRP GSPAAPPPAK RPPPAAPLDI AAIRAQLAAK KAALQPPPGP AALPPQPPVH ADVAHKLAAA KARIEALNAR AANPYLSGAP APAPAPAAPQ PGVTSIALHP LLMGGDAQQP AAQADRNEKR AMRDRYKTMA PKFTSVRANA SAAASQAPSR ASPAVTAAPV LNPYASASAA NSPAPDEERA PTRKSKKLQF SRAGKYVEQG EQLRNEQKME ALRQRIAEAS RKAGLDSEFD TLERSLKRQP PPAVEWWDEA ILPKGVTYED DLESAYNNLS TSSDSLITHL VLHPIPIPAP MDRRQPERGL MLTKKEQKKM RRQRRQAELE DKRDRQKMGL LPPDPPKVRL ANLMKVLTSD AVQDPTKVEA KVRKEVAMRA YKHEKDNQER KLTAEERKEK EYSQMVARER NGIRGAVFKI KYLTNGRHKF KVRETAKADL LSGICIFHPS FALVMVEGVE KSIKHFKRLM LSRIDWTEQA RPMADGDGGE DAPGSDEDMD GRTNSKEGEQ DLADNKCELI WEGELPERVF KMFRARHVET DSKAKEWLTP RFEAMWDLAK RWQWAGEDL
|
| |