Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00330 |
Symbol | |
ID | 3254441 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 102711 |
End bp | 105521 |
Gene Length | 2811 bp |
Protein Length | 712 aa |
Translation table | |
GC content | 57% |
IMG OID | 638253527 |
Product | cytoplasm protein, putative |
Protein accession | XP_567602 |
Protein GI | 58260384 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.362081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGTAACGA CCCCCGCCAT GAACACGCAC TCCCCCCCCC TCCTCGGCCT CCAGGCGCTC GCCGTCTCCC CGCCCGCCTA CCCGCCCGCC TACCGCTACA ACCCCGTGTT CGGCGCCCCC CCCGCCGCCA GCCCAGACTG GTCCCCCGCG AACAAGCGCA GCTCCCGTGG CCTGCCTAAT GTGAGCCCCC CTCCCCCCCC CAGACCACGC TGACAAGCCC AGAACTGGTA CGACCCCCAC GAGTACCGCC CCGACGTCCC CTCCCCGCCC CCCACCCTCT CCCCCCCCTC CTCCGCCCCC AACCCCCGCC AGCCGTACCC GTACCAGCCC TACCCCGACG GCGCCTTCGC CACGCCCATG GACATCGTCC GCACCCCCTC GCCCCCCATC GGCTACGCGC CCATCGGCTT CGCGGGCGCC TACGCCGTTT CCCGCCAGCA CCCGCAGGCG TCCCACCCCG CCGACGGGTG GAGGAACGGC TACGCCATGC CCGCCATCCC CACGCAGGAC GACGATGTCA TCCCGACCGC CATCGTGATC AAGAACATTC CATTCGCCGT CACCCGCGAG ACACTGCTGG GCGTGATGGA GTCCCTTGGC GCGCCGCTCC CCTACGCATT CAACTACCAC CACGACAACG GTGTCTTCCG CGGACTGGCG TTTGCAAATT TCCGCGCGCC AGACGAAGCA GCGAGTGTGG TTGCCGCGTT GAACGGGTAC GATGTCCAGG GGCGGAAACT GCGGGTCGAA TACAAAAAGG TCCTGCAGCC CGGCGAAAAG GACAAGATTG AGCGGGAAAA GGCGTTGAAG CGGATGAGGA GTTTGCAGTT TGACCGTAAA GAGATGCCTC CTCCGCTCAC CCTTCCCACC AGACACCAGC AGCCCCCGAC GCTCGCCGGC TACGACCAGT CGCCACCCCA TTCCGCATCC GCCGCGTCGA CCAACTCCAA CTCGCAGGCC GACAGTCTCC CGTCCACGCT CGACATGAAC GACCCCATGA CGCTTGACAT TTACTCCAGG GTCTTGGTCT TCAAAGAAGA CCGGATGCGG GACGAGCTCG CGTTCTCAAA GAACCTCTCC CCGACCGAAC GTCGTATAGT CCATCTCGTC GCGCAAAGGC TTGGACTCAC AAGCGTCACC CGTGGCGAGG AAGAACAGTG CGTCGTGGTC ACCCGCGAAC CGCAACGTCC CGCCCTCCTG TCGACCACCT CTCTCACATC CCCCACCTAC CTCCTCCCTT ATGGCTCCCA ACAAAATTCC GCTTCCTCTA GCGGCGGCGG CGGCGGCGGA TCGGGCGACC CGTCACCGAA TTTGAGGTAC AAGAAATCCA TGCCCGACTT GCGAGGCTTC AACGGCCCCG CCGTAGCGCG TGATCCTGCG AGGAGCTTGA ACCCACAGAG AAGTTCAGGA AACATTCGAG AGATGGGTAG AGAGTATGCG AGCATGGGCG CAGTCGGAGG AAGGAGGTTG AACGGACATG CTTCCGCAGC GACGCTGAAT GTCAACGGCC AGAATCATCC CTATGGAAAC GGGAGCGGAC GAGAACAAGG GTATGGAGCG TTCAACGGCC TGTTTGGAAA CTCGGTCAAT GATATCCCGC CCGTCCCGCC TCTTCCCAGT GGATTGGGGA TGCATAACAA AATGTACTCC GTCAGCAGCG TCCATAATGT GAACGGGATC GGCCATGGCC ATACCCACTC CCATTCATAT TCCCAAACCC ATTCGCATGC GCAGAATGGA CGAGTTTCGC CCGAGGACTT GCTCTCCCCA ACTTCTACCA CCAGTTTCTC GGGACAGCAG CAAATACCCT CTCCCCAACC TCAGCCTTCT CAATCCCAGC AGTCGCAGTC CTCCCAACCT CCATCATCTC AGACCGCTTC TCAGTCCCAG TCCCAGTCCC AACCACTTCG CAACCCGCGT GGACCAGCCG GCGAATCCCG CGGATTCGGC GGTACCCAGT CATCTCTCAA CGGTATCAGT GGCCTCGGCG GTCTGAAATC TGTGGCGAGT AGGAACATGT TGGGCGGCGG CGGCAGCGGC GCCGGTTCGG GCTCCGCCCC CGGTCCTCCC GGTTCAGTTA TCGGTGGAGC GCCCGTCTCG TCTTCGCCCG CTTCGACAGG TAGTGCGAGC GTTGGAACCG GAGGGTACGA GAGGGATACA GAGGAAACGA TGAGGACGAG GGAGAATTTG GATATTTAAT TGTTAATGTC TCGTACATCT TTTTTTCTTT TTCAACAGAG GTGAGTTTGC GTTTTTGGGA TAACGCTTTG GGATCTATAG GCTGATTTCA TGATAGAACT TTTCCCTTCT TTTTTATGCC CCTTCAACAC TATTATTATT AATCATCTGT TGACGTTCAA TGTCCCTCGC AACGCAAGTA CGCTAGAATA CGCTTCTCAT GCCCCAACTC AATCTTGCTA CGAATTTATA TAAAAAAACA AAGACGCCAT TCGCCATTCG CCATTCCACT TTATACCTTT TTTTTCCCTC TTTTACTTTT TTCTTTTCCT TGGACGTGCC GGGGCGGAGG GAGAGGCAAA CGGGATGATT TTGGGGCACG GGGCGAAACA ATGAAAAAAA AAAAAAAAAC GAAACGATGA AATGATCAAG ACCAAACAAC ATCTTATCTA AACAACTAGA TACCTCTGTT GCCTTTTTAA CGACATTCCA AATGTAGTTC TTTTATATCT TTTCACGGCG ACGATTTGAC AAGATGCTTT TTTCTTCTAC AGTACTTTTG CCTCTTACTG ATCTCAAATT ATCCATCACC TTTTCTTGCA CGATTGATTT TACCTATCTA TACATACAAT G
|
Protein sequence | MNTHSPPLLG LQALAVSPPA YPPAYRYNPV FGAPPAASPD WSPANKRSSR GLPNNWYDPH EYRPDVPSPP PTLSPPSSAP NPRQPYPYQP YPDGAFATPM DIVRTPSPPI GYAPIGFAGA YAVSRQHPQA SHPADGWRNG YAMPAIPTQD DDVIPTAIVI KNIPFAVTRE TLLGVMESLG APLPYAFNYH HDNGVFRGLA FANFRAPDEA ASVVAALNGY DVQGRKLRVE YKKVLQPGEK DKIEREKALK RMRSLQFDRK EMPPPLTLPT RHQQPPTLAG YDQSPPHSAS AASTNSNSQA DSLPSTLDMN DPMTLDIYSR VLVFKEDRMR DELAFSKNLS PTERRIVHLV AQRLGLTSVT RGEEEQCVVV TREPQRPALL STTSLTSPTY LLPYGSQQNS ASSSGGGGGG SGDPSPNLRY KKSMPDLRGF NGPAVARDPA RSLNPQRSSG NIREMGREYA SMGAVGGRRL NGHASAATLN VNGQNHPYGN GSGREQGYGA FNGLFGNSVN DIPPVPPLPS GLGMHNKMYS VSSVHNVNGI GHGHTHSHSY SQTHSHAQNG RVSPEDLLSP TSTTSFSGQQ QIPSPQPQPS QSQQSQSSQP PSSQTASQSQ SQSQPLRNPR GPAGESRGFG GTQSSLNGIS GLGGLKSVAS RNMLGGGGSG AGSGSAPGPP GSVIGGAPVS SSPASTGSAS VGTGGYERDT EETMRTRENL DI
|
| |