Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04020 |
Symbol | |
ID | 3253394 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 1076587 |
End bp | 1078981 |
Gene Length | 2395 bp |
Protein Length | 678 aa |
Translation table | |
GC content | 52% |
IMG OID | 638252722 |
Product | nucleus protein, putative |
Protein accession | XP_566748 |
Protein GI | 58258671 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3751] Predicted proline hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.201261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCCCTTCCT TCACATCAAC CACTTCCCCA ACCCCACAAC ACCATGCCAG CAGCAGTTAG AGAACGCTCG CCCAGCAGCC CATCTGGTAG GGCAGCAAAG AAGTCCCGGA ACAACACAGA ACACGCCGTG TTGAGCTCCA TCAACCACCC CTCGGCCGAG CAAGTGGCCG CCTACAGGGA AAAATACGTC AACGCAGCTC CCTTTAAACA TGCCGTCCTT AGTGACCTTT TGAGCGATGA CCTGGTATGT GCACGAGCCC TTCGGCCTGG CCAAGGCATG AAGGGATCGC ATAGCTGACG AGTACATGAA GCTTGAAGGT GTCGTGGAGG AGTCCAAGAA GTTCGGCATG AGAGGAGAGG AAGGCAGCCT CCCCGGATGG GGCTGGGAGC AAAAGGAGAC AGACATTTAT AAAATCCACC AAACTCCTGA TCTTTCTTCT CTCAGTCCTG AACACCTTCC TGATGAAACG CTCGAGGCGT TGCCATTGTT GACACGGTTG AAGGACGCTT TGTATTCCCA GGAATTTAGA AATTTGGTCC GTCAGGTTAC TGGTTGTGGT CCTCTTTCCG GTACAAAGAC CGACTTGTCT GCCGCCCTCT ACACCAAAGG GTAAGTCAAG TCATTATCCC TATATCTCTT ATTCCTAGGC TGACCTTCAT GACCAAGTTC CCACCTTCTT CTACACGACG ACTCCATCTC CACCCGTCTC ATCTCTTACA TTCTCTATCT CCCCTACTCC ATCGAAGAGG CCCCCGAGTC CCAGAACGTG GCTCTTCAAC GTTCTACGAA CGGGAAGTTC CTCAAGGGAT GGGACCCTGC TTGGGGTGGC TCTCTGGAGC TTTTTTCCGT AGAAACCGGA GAAGAAGTTG GTCCTCCCAG CGTGAAGCGA TTTGCAAAGG TCTCTGCTAC TTGGGGTCAA ATTGTCTTCT TTGAGGTACG TTTACAGTGA GAAATATAGG ATTCGAAGCT AATTGTTATC CAGGTGCAAC CGGGAAGAAG TTACCACTCT GTGGAGGAGG TTGTAATCGA TGAAGGCCGC AGGAGGTTCA GTGTCAGTGG TTGGTTCCAC CGACCCGTCG AAGGCGAGGA GGGTTATGCT CCCATTGACA AGGAGAAGGA GCAAAAGCAG CTCTCTTCTC TGGCTCAGAT TGTGAGTTTA CTATTTCAAA TATTATTATC CGTATCTTAC AGTCGCTACA GACAGCCGCT CCTTCAATGC CCTTCACCCC TTATAACACC ACTCCTCCTC CCGGCCTCAA GCCCTCCGAC ATTGCCTTCC TTTCCAACTA CCTATCTCCA TCCTACCTCA CTGTTGCCAC TCTTGAGCGA CTTTCTGGGC AATTCGTTGA AGCCTCCGAG ATTGTCTTGC ACAATTTCCT TCAGCCCGAA CTTGCGGCGA AACTCAAAGC AGAGACTGAA GGTGTTGACA AAAAGGACCA AGCTTCTTAT GAAGGCCTTC TTCCTCCTCA GGAGCTCGGT GAAGGTGACG GGTGGATCAT CCAAGGTCCT TCCTCTAAAC ACCGATACCT CAATCTCACC TCTCTTACCA CCTCCACTCC TATAGTCCAG TCTATCCATA ACGTGTTATT CCCCTCTGAG GCTTTCCGAG CATGGCTCTC TGTGGTCTCT TCTCTTGCTC CCACTGGCCA CCGCAACGAA GCTCGCCGAT TCCGAAACGG TCTCGATTAC ACTCTCGCCA ACGGTGAAGG CAAGGATGGA GATGCTAGAC TGGACGTCTC TTTGGGTATG ACATGGTGGG CCGATGTTCC GGCGGGAAGT GATGAGGAGG ATGCTTTGGT TGAAAACGGT GGTTGGGAGG CTTACCTCGC CGCTCCTGAT GAGGATGAGG ACCCTACTGT GTACCAAAGC TCTGTGGCAA AGAAGGCTGT CAAGGAACAC TCCCAGGAGC CCAAGGAACC CAACGGAAAG AAGGTTGAGG AGAAATCTAA GCCTCAGGCG AACGGTAGCA GCGAGAAGAA AGATGGACCT TCAATTTCAA TCGGCGGCCA AGAGCTTGAG TTCGACCCCG ACCAATTCTC TCCTTCTGAC TTTGACTCTG ATTCTGAAGC TGGCGACGAG GATGATGGGC CTTTGTTGAC CCAACCTGTG GCGTTCAACA AACTCTTGCT TGTTCTTCGT GATCCAGGGG TTATGAAGTT TGTAAAGTAC TTGGGAGCGA ACGCGCCAGG AAGCAGGTGG GATGTATCGG GTGAGTTCGA GGTAGGCGTC CTTGAAGAGG AGCCAGCTGA GGACGGTGCA CCCGAAGCTG AGGGTTCTGG TGAGGGCAAG GCGGATGCGT GATGTGTATG AAAGAAGTCT TCGTCATAAT TGGTCGTATT GCCTGTAGAT TGTATTTTTT TTCTACGCTT CCTTCATATT CATGCATCCT TACTT
|
Protein sequence | MPAAVRERSP SSPSGRAAKK SRNNTEHAVL SSINHPSAEQ VAAYREKYVN AAPFKHAVLS DLLSDDLLEG VVEESKKFGM RGEEGSLPGW GWEQKETDIY KIHQTPDLSS LSPEHLPDET LEALPLLTRL KDALYSQEFR NLVRQVTGCG PLSGTKTDLS AALYTKGSHL LLHDDSISTR LISYILYLPY SIEEAPESQN VALQRSTNGK FLKGWDPAWG GSLELFSVET GEEVGPPSVK RFAKVSATWG QIVFFEVQPG RSYHSVEEVV IDEGRRRFSV SGWFHRPVEG EEGYAPIDKE KEQKQLSSLA QITAAPSMPF TPYNTTPPPG LKPSDIAFLS NYLSPSYLTV ATLERLSGQF VEASEIVLHN FLQPELAAKL KAETEGVDKK DQASYEGLLP PQELGEGDGW IIQGPSSKHR YLNLTSLTTS TPIVQSIHNV LFPSEAFRAW LSVVSSLAPT GHRNEARRFR NGLDYTLANG EGKDGDARLD VSLGMTWWAD VPAGSDEEDA LVENGGWEAY LAAPDEDEDP TVYQSSVAKK AVKEHSQEPK EPNGKKVEEK SKPQANGSSE KKDGPSISIG GQELEFDPDQ FSPSDFDSDS EAGDEDDGPL LTQPVAFNKL LLVLRDPGVM KFVKYLGANA PGSRWDVSGE FEVGVLEEEP AEDGAPEAEG SGEGKADA
|
| |