Gene CNB04940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04940 
Symbol 
ID3255765 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1410356 
End bp1413269 
Gene Length2914 bp 
Protein Length570 aa 
Translation table 
GC content50% 
IMG OID638255138 
Productexpressed protein 
Protein accessionXP_568986 
Protein GI58263152 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.336379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCCACATTTT CTCATCTCAA TCATACATAG ACCCATAAGG ACAGCTAAGA CCATGGCTCA 
AAATAATGCA GCGCCTTTTG CGGGATCAAG CTCCTCAACT CCATCGATGG CCCACCCTAC
TCCCGCCAGT TCTTCTTCGG GCTTGATTCC GGGTGTGACT CGGAAGCAGA ACATTGCCTG
CGATCAATGC CGGTCAAGAA AGATCAGGTG TTTGAGAGCT GATAAGAAGG ATGTGGTAAG
CTTTACGATT TTGATATGAG AAGGCGAAAC TCACTCCAGT CCCATTGTTC TACAGTGTGA
ACAATGTAAA TCCAAAGGCA CTGAGTGTAC CAGTAATTAT ATCGAAGCCC TGGCAGAGAA
GAAGAAGAAA GCAGACGAAG AACATCTCAG CAGCTCTCGT AGGAAGAGGA GAAGAAAAAG
CAACCAAGAT CAAAGTCAAT CTCAACCACC ACCGGTACCA CTGAGTTCTG CTTCGTGTTC
GGTTCGCCCA GAGCTAGAAC GGCACATCAG CAACGAAAGC CGGGATAGCG AAGCATCTTC
TACCCCCAAG ATGGATCACA GTGGGATGCA GTCCCCTGCG ATGGGCATCG CATCAAGTAT
GGCTGGGGAT GGAGTTGATC CGGATAAGCT TCATCGGGTG CAAGCCGCTC AGCATGTCAG
GGAGAATGCC AGTGCCATAG CAGAGTGAGC CTGACCGCAG GCCATATGGG CGCGTCGTGA
TGTTAACAAA CTCCTTAGCT CCCTCCACCT ACTGGCGAAC CTAAGATCTA CCCAAAATTT
ATCCCTCAAT CTTCCTTTAC CACTCCCACT ATCTCTTCCT AATCACCCGC CACCGCAACC
ACCACCATTT CTCACTCCTG GTGAGAAGCA GCATGACCTT ATTCGCTATC TCCTCTCTCC
TTATCCCATT CTCACTCCCG TATTGGGATA TTCCGATGTA GCCAGTATTG AATCTTGCAA
GCGCGGCGAA AGCGATTTAT GGGAAGAAAT GGGAGGCAAG GTCTGGGAAG AAGAACCGAG
TGAGGTGCAT AAGAGTCTGG AAGCAGACGA ACTTACCAAG TAAGCTGTTC CTCATCACCT
TCTCTATATC TATACTAATG GCAATGCGGC TAGACTGGCC GATGACCTAA TCGACTCGTT
CTTTTCCGTA GCACACATTC GTAATCCATG TTACGATCCG CCTACATTCC GTGCACGCCT
CTACACACCC AACACCCACC CTCAAGGCCC TATCTCCCAT CCTATCCTCG CTACTGCTCT
CGCATGGGGA GCTCGTTTTT CTGATCATCC TGTTATCCAA CGAGATAGAG ATGAGTGCTC
TCAACGCCGC AGCGATGGTG AGGAACATGG AGTTCGGGAA AAGGGCAGGG GTATGAAGAG
GAGTCGGTTA GTGCAGATGG CTGTGATCCG GGCAAGGGAA GTGTGTGAAA TGTGCAGGAT
CTGGAGGATA CCGAGTATAG ATAATATCAA AGCTTTGGTC AACCTCGAAG GCTTGCTGGG
CCGTAAGTCT TCCTACCTCT TCACAAGAAT GCTCATACTG AACTCATCCT TAATTATCAG
AGGTTCTCGT CAAGAAAAAC AGTAAGTCCG CCACCGTCAT AAGAGTTTTC CGTAAGACTA
AACAGTCATG AAAAGACTAT CAAGCGGTCT ATGCCACTGC CGCTGTTAAA CACCTCTTAT
CGATGGGATA TAACTCCACA CGAGCAATCC TCACTATTCC CGACGAGAAA GAACGGACAG
ATATTGTCTT GATATGGTGG ATCCTCATCG TTACTGATAG TTATCGGTCC GTATTCTATC
GGATAAAACC TTGTTTGTAA GCCTTTTCAC TTTTCCCCCA TGGAGAACGA GGATATCGAT
CTGACGATCA CGTATCATAA CAAGATTAGA CGATGACTAT GATCTCGAAC CTCCTGGGAA
TACGGCAAAT CTCTTAAACT TTACCCCAGT AAGCCTTTCT GAGCCAAACC AAGCAATCGT
AAGTACATCT CCAACTATCC TAATCCCGCA ATGCTAAGTG CAACCGTCCC CCCTCGTCAC
TTGAAGCTAT GGTTCTCCGC CGCCCAATCC GCCGCCTCCA TGTGTCGCGC CCTCTCCCTC
CGCCTTTGCT CTCCCCATCT CCAAACTTCC GGTATCCCAT TATCCCTCCT CCGCACATTC
ATCCACTCTG CCTCTGTCTG GCGGACAAAC TACCTTTCAA AATTGGGCGT CCCACCCGTT
TGGCCCGAAA GCTGGGATTT TCTCCAAGCT ATTGCGGCAT GTTCGGCAGA CGTTTCGCAT
CATGCTCTGT GGCTTGTGCT TTATCGCGCG CTAGAAGAGT CGGGGGTTGA GGAAGAGAGG
AAAGGCGCGA TGGAAATGGG AGGTGTAGGG ATGGGGATAG GGGTTGAAGT GGAAAGTGTG
AAGAGGAGGA TAAAGGAAGA GAGTGAACAT GCGGCATTGA GGATCGCGGC GCTGGTTGCT
GTATTGGCAG AGAATGAATA TTTGAGTCTA GATCCGTGAG TCCATTCTTT CAACCTCATC
ATTAATTACA GGGCGACCAT TGATTAAAAA AAATCGTTAT TAGGCTCAGT ATCCACCATC
CAATATATGA AGCGGGCCTA CATTTAGCCC AACGCGGACG GGGGGAATGT CTGGCCTGTG
TAGTCGGGCT GAAGCAATAC GCAATCACGT TCCCTGCCAC GTGGGACAAT GCTGAGGAAC
TTGAAAATAT TTACGCAGAC AGTCAGCAGG GCGATCCACG ACTCGATAGC CGATTTCAAT
CGAGGGCCCA TAATGTTTCC ATGAATGTCG TGTCGAATGG CGAGACAGCA AGCAGCCTTA
CGGGAACTGG ATCCGCCTCC GCCGAGCCAG CCGCTATGCC GTCGATTGAG GAGCAGATGT
GGCAAGGAGA AAACATGTGG CATTTGTAGC GAGT
 
Protein sequence
MAQNNAAPFA GSSSSTPSMA HPTPASSSSG LIPGVTRKQN IACDQCRSRK IRCLRADKKD 
VCEQCKSKGT ECTSNYIEAL AEKKKKADEE HLSSSRRKRR RKSNQDQSQS QPPPVPLSSA
SCSVRPELER HISNESRDSE ASSTPKMDHS GMQSPAMGIA SSMAGDGVDP DKLHRVQAAQ
HVRENASAIA DSLHLLANLR STQNLSLNLP LPLPLSLPNH PPPQPPPFLT PGEKQHDLIR
YLLSPYPILT PVLGYSDVAS IESCKRGESD LWEEMGGKVW EEEPSEVHKS LEADELTNLS
EPNQAILWFS AAQSAASMCR ALSLRLCSPH LQTSGIPLSL LRTFIHSASV WRTNYLSKLG
VPPVWPESWD FLQAIAACSA DVSHHALWLV LYRALEESGV EEERKGAMEM GGVGMGIGVE
VESVKRRIKE ESEHAALRIA ALVAVLAENE YLSLDPLSIH HPIYEAGLHL AQRGRGECLA
CVVGLKQYAI TFPATWDNAE ELENIYADSQ QGDPRLDSRF QSRAHNVSMN VVSNGETASS
LTGTGSASAE PAAMPSIEEQ MWQGENMWHL