Gene CNA04020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04020 
Symbol 
ID3253394 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1076587 
End bp1078981 
Gene Length2395 bp 
Protein Length678 aa 
Translation table 
GC content52% 
IMG OID638252722 
Productnucleus protein, putative 
Protein accessionXP_566748 
Protein GI58258671 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3751] Predicted proline hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.201261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCCCTTCCT TCACATCAAC CACTTCCCCA ACCCCACAAC ACCATGCCAG CAGCAGTTAG 
AGAACGCTCG CCCAGCAGCC CATCTGGTAG GGCAGCAAAG AAGTCCCGGA ACAACACAGA
ACACGCCGTG TTGAGCTCCA TCAACCACCC CTCGGCCGAG CAAGTGGCCG CCTACAGGGA
AAAATACGTC AACGCAGCTC CCTTTAAACA TGCCGTCCTT AGTGACCTTT TGAGCGATGA
CCTGGTATGT GCACGAGCCC TTCGGCCTGG CCAAGGCATG AAGGGATCGC ATAGCTGACG
AGTACATGAA GCTTGAAGGT GTCGTGGAGG AGTCCAAGAA GTTCGGCATG AGAGGAGAGG
AAGGCAGCCT CCCCGGATGG GGCTGGGAGC AAAAGGAGAC AGACATTTAT AAAATCCACC
AAACTCCTGA TCTTTCTTCT CTCAGTCCTG AACACCTTCC TGATGAAACG CTCGAGGCGT
TGCCATTGTT GACACGGTTG AAGGACGCTT TGTATTCCCA GGAATTTAGA AATTTGGTCC
GTCAGGTTAC TGGTTGTGGT CCTCTTTCCG GTACAAAGAC CGACTTGTCT GCCGCCCTCT
ACACCAAAGG GTAAGTCAAG TCATTATCCC TATATCTCTT ATTCCTAGGC TGACCTTCAT
GACCAAGTTC CCACCTTCTT CTACACGACG ACTCCATCTC CACCCGTCTC ATCTCTTACA
TTCTCTATCT CCCCTACTCC ATCGAAGAGG CCCCCGAGTC CCAGAACGTG GCTCTTCAAC
GTTCTACGAA CGGGAAGTTC CTCAAGGGAT GGGACCCTGC TTGGGGTGGC TCTCTGGAGC
TTTTTTCCGT AGAAACCGGA GAAGAAGTTG GTCCTCCCAG CGTGAAGCGA TTTGCAAAGG
TCTCTGCTAC TTGGGGTCAA ATTGTCTTCT TTGAGGTACG TTTACAGTGA GAAATATAGG
ATTCGAAGCT AATTGTTATC CAGGTGCAAC CGGGAAGAAG TTACCACTCT GTGGAGGAGG
TTGTAATCGA TGAAGGCCGC AGGAGGTTCA GTGTCAGTGG TTGGTTCCAC CGACCCGTCG
AAGGCGAGGA GGGTTATGCT CCCATTGACA AGGAGAAGGA GCAAAAGCAG CTCTCTTCTC
TGGCTCAGAT TGTGAGTTTA CTATTTCAAA TATTATTATC CGTATCTTAC AGTCGCTACA
GACAGCCGCT CCTTCAATGC CCTTCACCCC TTATAACACC ACTCCTCCTC CCGGCCTCAA
GCCCTCCGAC ATTGCCTTCC TTTCCAACTA CCTATCTCCA TCCTACCTCA CTGTTGCCAC
TCTTGAGCGA CTTTCTGGGC AATTCGTTGA AGCCTCCGAG ATTGTCTTGC ACAATTTCCT
TCAGCCCGAA CTTGCGGCGA AACTCAAAGC AGAGACTGAA GGTGTTGACA AAAAGGACCA
AGCTTCTTAT GAAGGCCTTC TTCCTCCTCA GGAGCTCGGT GAAGGTGACG GGTGGATCAT
CCAAGGTCCT TCCTCTAAAC ACCGATACCT CAATCTCACC TCTCTTACCA CCTCCACTCC
TATAGTCCAG TCTATCCATA ACGTGTTATT CCCCTCTGAG GCTTTCCGAG CATGGCTCTC
TGTGGTCTCT TCTCTTGCTC CCACTGGCCA CCGCAACGAA GCTCGCCGAT TCCGAAACGG
TCTCGATTAC ACTCTCGCCA ACGGTGAAGG CAAGGATGGA GATGCTAGAC TGGACGTCTC
TTTGGGTATG ACATGGTGGG CCGATGTTCC GGCGGGAAGT GATGAGGAGG ATGCTTTGGT
TGAAAACGGT GGTTGGGAGG CTTACCTCGC CGCTCCTGAT GAGGATGAGG ACCCTACTGT
GTACCAAAGC TCTGTGGCAA AGAAGGCTGT CAAGGAACAC TCCCAGGAGC CCAAGGAACC
CAACGGAAAG AAGGTTGAGG AGAAATCTAA GCCTCAGGCG AACGGTAGCA GCGAGAAGAA
AGATGGACCT TCAATTTCAA TCGGCGGCCA AGAGCTTGAG TTCGACCCCG ACCAATTCTC
TCCTTCTGAC TTTGACTCTG ATTCTGAAGC TGGCGACGAG GATGATGGGC CTTTGTTGAC
CCAACCTGTG GCGTTCAACA AACTCTTGCT TGTTCTTCGT GATCCAGGGG TTATGAAGTT
TGTAAAGTAC TTGGGAGCGA ACGCGCCAGG AAGCAGGTGG GATGTATCGG GTGAGTTCGA
GGTAGGCGTC CTTGAAGAGG AGCCAGCTGA GGACGGTGCA CCCGAAGCTG AGGGTTCTGG
TGAGGGCAAG GCGGATGCGT GATGTGTATG AAAGAAGTCT TCGTCATAAT TGGTCGTATT
GCCTGTAGAT TGTATTTTTT TTCTACGCTT CCTTCATATT CATGCATCCT TACTT
 
Protein sequence
MPAAVRERSP SSPSGRAAKK SRNNTEHAVL SSINHPSAEQ VAAYREKYVN AAPFKHAVLS 
DLLSDDLLEG VVEESKKFGM RGEEGSLPGW GWEQKETDIY KIHQTPDLSS LSPEHLPDET
LEALPLLTRL KDALYSQEFR NLVRQVTGCG PLSGTKTDLS AALYTKGSHL LLHDDSISTR
LISYILYLPY SIEEAPESQN VALQRSTNGK FLKGWDPAWG GSLELFSVET GEEVGPPSVK
RFAKVSATWG QIVFFEVQPG RSYHSVEEVV IDEGRRRFSV SGWFHRPVEG EEGYAPIDKE
KEQKQLSSLA QITAAPSMPF TPYNTTPPPG LKPSDIAFLS NYLSPSYLTV ATLERLSGQF
VEASEIVLHN FLQPELAAKL KAETEGVDKK DQASYEGLLP PQELGEGDGW IIQGPSSKHR
YLNLTSLTTS TPIVQSIHNV LFPSEAFRAW LSVVSSLAPT GHRNEARRFR NGLDYTLANG
EGKDGDARLD VSLGMTWWAD VPAGSDEEDA LVENGGWEAY LAAPDEDEDP TVYQSSVAKK
AVKEHSQEPK EPNGKKVEEK SKPQANGSSE KKDGPSISIG GQELEFDPDQ FSPSDFDSDS
EAGDEDDGPL LTQPVAFNKL LLVLRDPGVM KFVKYLGANA PGSRWDVSGE FEVGVLEEEP
AEDGAPEAEG SGEGKADA