Gene CNA02310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA02310 
Symbol 
ID3253579 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp601422 
End bp604832 
Gene Length3411 bp 
Protein Length1033 aa 
Translation table 
GC content52% 
IMG OID638252563 
Producthypothetical protein 
Protein accessionXP_567149 
Protein GI58259473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.22008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTATGAACAA CCAAAACCAG CAGCCACCGG TCAACCTCTC GGACCCGCAT CTACAGGCCC 
GCCTCGCCCT CCTCGCAGCC CAGCAACAGG GCCTTGCAGA CCGGCAGCCA GTCTCCCAAG
ACCACATCGC AGCAATGGCT GCTGCCATGG GCTCCATGCA AGGACAGAAC AGGGATGCCA
TCATGAAGCA GGCAAGCTAC TTCTCCCACC GTCCACGGCC ACCGCTCACG TCTCACAGTT
ACAAGCGCTC CAGAACTCAC AAGCCCACCG CGCAAAGCTC GCGGCTCAAC AGACAGCCAA
CCAGCAGCCG CAGCCCGGAC CTAGCGCCCC ATCGCCTGCA AACCAGCCGG GTTTATCTGC
ACCATCGCCG ATTACCGCAC CCTCCCCCTC CAACCCCCTC CATCAAGATA GCTTTCAACA
TCCGGGCATG CCCAGATCTG GCTCAGGCGA TAATCTTTTA AATCAGCAAC AACAACAGCA
GCAGCAGCAG CAACAACAAC AACAACAGCA ACAACAACAA CAACAACAAC CGTTCATGTC
GCAACAGCAA CGCGAACTGT TTCTAGCACA ACAACGAGCT CATCAGCAGC AAAGCATGGC
TGGACCATCT GATGGCTCTG TTCGTCCACC ATCACAGGCT CAGCCAAATT TACCCGGTCA
ACCTCAAGGC CAGTCACCAC GACAACCGCT CAGCCACGTG CAACAGCGAC AAAACTTTTT
AAAGACATTT ATGGGCTACT TTCAAAATAT CGGACAACAG CCGCCCCCAG CCATATTCGA
TAACGGCGAA AGAGAAGGAG CGTTCAAAGT TGGAGACGGA TGGATGGACG TCATTGATCT
GCTGATGGCC GTCATGAAGG CCGGTGGTAT TATGAATGTA GGTGCCTTCA TCGAGTCTTT
TAGCATTGCC TTGATCATCC TTTCTACAGG CGATGCAACA GCCAGCAGAC AGTCCCACAT
GGCGCAATCT TTTAGCAGTC AAGAACATCC CCACGACTCT CCCTTACCCC ATCCCTGCTC
CCAAGCCCCC CAATTCAGAT CCTAATGCCC CTCCTACAAT GACGACCGAT CCTGTCCAGT
ACCTCACCGC TGCCTACTTT GCCTACATCC ACGGGTTTGA GACACATATG CAAAAAACAA
GGCAGGCTTC GTATGCTAGG CAACAGGCAA TGGCCATCGC CCAAGGTAGA CCGCCTCCAC
CGCCACCCGT CATGCCTAGC CTGGCCGGTT TCAGACTTCC AGGACAAGCC CCTAGCCCTG
CAGAAAGTTG GTCATCAGCC CAGGCACAGA CACCAGCTCC GGCCGTCCCT TCTTTACCGC
CACCCCTCAA TACATCAGCA CAACCACCAC CTGCCTCCAC CGCTCCCACG CCTACTCATC
CCGCGCCTTC ACCAAGTGAT TCTTCTACAA AATCTTCGAC TACATCTGGC CCACCTCGAG
CGAGAAAGGC ATCCAGCAAA AAGGAAAAGA AGGATCTGTC TGTCAACACC AATGTGCCAA
GCCCAATTGA CGGAGAAGCA GAGACGCCTA CGGGCAGTTC TGGCGGGAAA AAGAGGAAGA
GGAAGAATGC GAATGCACCT CAACCAGTAT GTTCAAGGCA TGCTTCAATG CGACAGTTTC
TAAATTTTTA ATTTTTTATA ATAGGAAACC GCCTCTACAC CCGCTCCGCC CCCAATTGTT
TCTACTCCTG CCCCCGAACC TCCTGCGGAA CCAACCAAGC GAGCACGCTA CCGTGTCGAA
TACCGTCCCA TCAACTTTCC TGTGCAGACA TTTGCCGGCT GGGAACCTTC CATGGTCTCC
TCCACTTTCC CTAAACACTC CCTTCGACAA GGTACCCGCC CTATCCATGA TCTCGCAGTG
GTCGACATGG AGGCCATATT GATGGGCCTG AGAAGCCGTA TGCCAAAGGA ACTGGGTTAT
GCGGTCACAG TTTTAAATAT GCTGTCAATG TCGCACCCCG AAGAGAATAT CAACGGCCTG
CCGCTGCATC ACCTGAGAGA GATTTTCATC GAGCTCTTGG ATTTGACGGA AGAGGCCGCA
TTTGGGGATG GAGGGCGGAG TGGATGGCTG AAAAGTTGGC ATGATCTGAA TGACGTCAAG
GAAGAATCGG CTGCCGACGA TAAGAACAGC TGTATGGACA ATCTGAACAA GATGCCGTTT
TTCGAACTAG AACGATTGGG AAGGGATTTT GATTTCTCGG TATACAAGGA TGAGGAAGAG
TATCAATGGA GAAAAGAGGA AACGAGCGGG AGCACTGGAA TCGTTCTCGC GTGTATCAAT
ATGCTCCGCA ACTTTTCGAT GCTTCCAGAC AATCAAGAAC TCATGGCATC CTATCCTCAA
ATGATCAACC TCTTAGCTTC CATCTCAGAT GCCCGATTGT GTCGTTTGCC CGGAGAGAAT
TGCACCAAAA TCAAACGACC ATTTTCCATC ATAGAACTCG CCCGAATCCG CCGTGATTGC
GTCAGCATCC TTGTCAACAT TGGCGAGTAC GTTGAGCTCC CTCGGGTACC CTCGTCTTCA
AGCCTTGCCA TCTTCCGACT CTTATCCACC TTTATCGCAT CAGGCTGGGA GTCCAATGCC
CTTAGTGAGC CTGTTTATGG TCCCACGCTT TCATCATCCA TCCGCGATGT CGGTCCGCCC
ACCGTCGTCC CATCCATTGA CCGTGCCCTC GCCGCTTTTT CCCTTCTTGC CCAACCAGAT
GCAAATCGCG AGGCACTGGG GTCGTCTGTT CCCCCTTCAG AGCTGATAGA CATGTACGAG
TCCCTTCTCA AGCTCTTACC CGTCACCAAA CGCCAATTCG AAGCGATGCA TAGCATCGAA
GAAACCTTGG GCTACAACGA AACACTAGCC CTATGCCTTT ACTCCCTGGC CTTTCTCTCT
CCCTTGCATG TCAGAGCAAG TATGCGCAAC GTACCGGGAA GCGTACCGCT TCTCACCCGC
ATCATCTTTG ACACCGCTCT TCAAAAATCC GACTATCGCT CAAACCCTTT CGGAATCCTC
TGCCGTCGCT TATGCGAAAC CCTAGGCGTC CTCAACGGGA CCGTCTCGCC AGCAGGCACT
GTTGAAGGGC CATCTGGGAT GGGATTCGGC GCAGGAGGGA TCGAAGGTAG CGGCTGGAAG
TTTGCTAGTG GAAGGGTGGA AAACGGATGG TTGGCAGGTA AAGAAGAAGG TGTGTTGGGT
GCTATTTTAG GCGTAAAAGG AATGAGTTGG GCCGCGTTGG GAGAGCTGGA CGGTATGGTC
TGGGGTGGTG ATTCGATATA AATAAAATAA TGATAATAAT AATAATAGAG TGGATAATAA
TAGAGTGGGC TATAATTCAT TCATGTCGTG TAGTATAATT ATATGTAGTA GTCAGCCACG
CACGGATGTG GTGTAATATA GCTTAGCTGG GGTATAAAAT AGTGGAAGTC A
 
Protein sequence
MNNQNQQPPV NLSDPHLQAR LALLAAQQQG LADRQPVSQD HIAAMAAAMG SMQGQNRDAI 
MKQLQALQNS QAHRAKLAAQ QTANQQPQPG PSAPSPANQP GLSAPSPITA PSPSNPLHQD
SFQHPGMPRS GSGDNLLNQQ QQQQQQQQQQ QQQQQQQQQQ PFMSQQQREL FLAQQRAHQQ
QSMAGPSDGS VRPPSQAQPN LPGQPQGQSP RQPLSHVQQR QNFLKTFMGY FQNIGQQPPP
AIFDNGEREG AFKVGDGWMD VIDLLMAVMK AGGIMNAMQQ PADSPTWRNL LAVKNIPTTL
PYPIPAPKPP NSDPNAPPTM TTDPVQYLTA AYFAYIHGFE THMQKTRQAS YARQQAMAIA
QGRPPPPPPV MPSLAGFRLP GQAPSPAESW SSAQAQTPAP AVPSLPPPLN TSAQPPPAST
APTPTHPAPS PSDSSTKSST TSGPPRARKA SSKKEKKDLS VNTNVPSPID GEAETPTGSS
GGKKRKRKNA NAPQPETAST PAPPPIVSTP APEPPAEPTK RARYRVEYRP INFPVQTFAG
WEPSMVSSTF PKHSLRQGTR PIHDLAVVDM EAILMGLRSR MPKELGYAVT VLNMLSMSHP
EENINGLPLH HLREIFIELL DLTEEAAFGD GGRSGWLKSW HDLNDVKEES AADDKNSCMD
NLNKMPFFEL ERLGRDFDFS VYKDEEEYQW RKEETSGSTG IVLACINMLR NFSMLPDNQE
LMASYPQMIN LLASISDARL CRLPGENCTK IKRPFSIIEL ARIRRDCVSI LVNIGEYVEL
PRVPSSSSLA IFRLLSTFIA SGWESNALSE PVYGPTLSSS IRDVGPPTVV PSIDRALAAF
SLLAQPDANR EALGSSVPPS ELIDMYESLL KLLPVTKRQF EAMHSIEETL GYNETLALCL
YSLAFLSPLH VRASMRNVPG SVPLLTRIIF DTALQKSDYR SNPFGILCRR LCETLGVLNG
TVSPAGTVEG PSGMGFGAGG IEGSGWKFAS GRVENGWLAG KEEGVLGAIL GVKGMSWAAL
GELDGMVWGG DSI