Gene CNC00040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC00040 
Symbol 
ID3256521 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp11105 
End bp14777 
Gene Length3673 bp 
Protein Length785 aa 
Translation table 
GC content46% 
IMG OID638255224 
Productexpressed protein 
Protein accessionXP_569355 
Protein GI58264398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.685236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCTGTTTTC CGATTATCTC CAAGCATTCA CCATCCCCTA TCTTCCGGTC ATGTCCATCC 
ATGACTCCAC TTTCTCGGCC CTCAAGGACA TGGGCATCGA CCCAATGTTG GCTCGAGAAG
CAGCTTCTCG TTTTCATTCT GTTGAGCCGG CCGTGAATTG GTGTTTTGGT GATGGCGCCA
ATGTGAGTTC TTCGCGATAC CTGGTCGCAG AAAGACATTA ACCCCTAGCA GTGGCAACCA
ACTACGGTAG CGGAATCTAT GTACGGTGGA TTCAACCCCA GACACCGTGA AGGAACATCA
GGTTTGTGCT TATCAGTCCG ACATTTTGAA ACCCAACATC TGATGTATAC CCGCAGTGGA
ACATCGCGAG GTCATTGAAG TCTTTGACTC CCCATCTGGC TCTCCCAAAC CACGACACTC
ACATATTCCT CTCGGTTCCA ATAACCCTTT CCTTCCCACT CAACCTAGAT CGCCACATCC
GGCTTCCCAC CCTTTACCCC AACCGCCTTT GCCTCCTCGT TCACCGTCCG TGCGTCTCGG
TACGCAGGGC CTACCAGTTG AGGTAGACGA TGGTAATGAT TCCGAAGGTG AGGAACTAAG
AAAAGCTATT GCTCTTTCGC AAAATGTGGA ACATTCTGTG TCAGAAGCAG CACCATTGCA
AGATGCGATG GATGTTGATG GGAAGCGAGA TTCAAGAGAA AGAAGCGAAA GGGCTAGCGG
AGCGCCGCCA CCTAGTCCGA AATCTGATAT GCCTGGGACC ATTGGAGTGA CTTTCGGTCC
GAGCACGAAA GACGATGTTG ATGGGAAGTT AGCATTGGTA CCGACATCGC AGGTAGGTCA
CTTTTCGCCA TGAAAATATT GTTGGTGCTA ATGCTAACCA CATCATTGAA GAATAATAAT
ACGACTTCAA ATGAAGACGC AGACCTGGAT AAAGCCATAC AAGAGTCGCT TATGACGGCC
AGTTTCCATT CCGCATCTGC TGAAGCGAAT AAGGAAAGAC CGCAGCCAAC AAAAAGGGGC
GATGGAGTGT GAGTCCTTAA CATTTTTTCT ATATAAACAT CTGACAAACC CGTTCAGCCC
ACTTGTTTTC TATTCCCCGT CCGGACGTTA CACATACGTT GCCCAGATTT TGCAAGCTTT
CGCCGCTGTC CCGCGTATCC AGACTGCCTT TCAGGAAAAT GTTGATTCTG CGTCTAGTGG
AGATGACATT CTAACTGAGC GTAAGTACAC TGTGACGAGC TATCGTATTA TATCAACGAA
ACTAATTGAG CGTTCAGGAA CTCAAGCATT AAGTGACTCA TTCGGACGTA TGATTGATGA
ACCGTTCAGC TTTTACGAGA TTGACGATAC GGCCAAAATT ATCACTGAAG GCTCTGAGAG
GAATCAGCTT CCGCCCCTGT TACCTACAAT GGGTGCGTCC AAGTCCCCTT TATGACTAGT
ACTCGATCCT GAAAGTTGAA CAGATTTTCA ACTGATATTC ATTGATCTCT TTGAGCGTGC
GGCGATACCG AGGTTGCAAG GGATGGCCAA GGACGAAGAC GAGCTCAATC GTCTTCGAGA
AGAGTGTCGT CGGTACGTTT GCATTAACGG CATTTCTCTT CATTCTTTTG CAACTGACCT
CTGTTGCGTC CCGTGTGCAG CGTATTTGAA ACTAAAGTCA CCGGCCATAC TCCAGATTCC
ATTACGTCGG CCTCCTACGT CACATTCGTC CGCCAGCCTA CAGGCAATGA CGTGTACACC
CAACTCGCGG ATGTGTTCTG GGGTCCCGAA GCCCAAACAA CGTCGATCAA AAAGTTGGGA
GATGTTTTGA CGGTGTATCT GGATTGGAAG CCCAGTTCAG CGAGAGAGGT TTGGAAGTTG
GATGAAGAGA TTACGCTTGG TAAGTGGCAT TGGCCTCGCA CTTTCGGAAA TACAAATGAC
TATGCAGTAT TGATATATGG GATATAGATC GGTTTTTACA GAGCAATGCG GCGTGGGCTG
CTAATAGGAG AGGGTTACAA GCCGTTTCCG CAGGTAACGC GAGAAGGATC AAGGAAAAGA
TTGAGTCTCT GACTCTGCAC GATGTATGTG AATAAACTAT TTGCACCAGT TCGATTTCTA
ACGGAAATTC CGGCCGTCTA GGGCAATAAT TACCTCGATA GTTTTGCAGC GTAAGTAATT
ATTACCTGAG AAATAAAGAT GTATTATACT GACTCGTCAT TAGACTTATA GAAAACCTGC
GGACAAAACC GAAGAACGTC GATATAACGC GAGCAGTATC TCGCGAGGGA ATGAAGGGCA
AGGTCAGTTG CTTCACAGCG AGTTCTTGTT GTGATGATGG CTGACTGGAT TGCTGTTAAT
AGATTGAGAC GATACTCACG GCTCTCAAAA CAAAAGTCGC AGATTTGGAG GAAGAGCTAA
AAGAACAGGA AAAAGCAGCT TCTGCAGAGG CGTTTGAGAC AGACGATCCT CAGTATAACC
AGCATGTCGT GAGTCATACC TACGTGTCGA TGTATAAGAC AGATGTTGAT CGTATACCCA
GTTCAATTTG CGAGCAATAT TGTTCCATGA TGGTGCTATG GTTGGTCAAA AACATCTGTA
TATGTATGTT AGAGGCAAAG ATGATAGATG GTGGAAGATA CAAGAGCATG AGGCTACAGA
AGTGAGTACT GGAGTCTTTT CATGGTAGGT AGCTGACTGT CGTTTGTAGG TCGAGTGGGA
AGCGATCATT GGAGATAGGA CTGGTATTTG GATGGAAGGG GGACCGTATA TGGTACGTCT
TCGAGATCTT TGAACGGCTG GTTAACAATG CCACAGTTGT TATATGCTCG TCAAGGCGAT
CGACCTTCAT CACCTGAATC ATCTCCAACC CGACCAAGTG ACGTCGACGT CCCTCCTCAC
TCCAACTCTA TCATCCTCGA AAGCAACAAC ACCTTAACTC CTACCCTTCC TCCTGACTAT
TCAGAATCCA CATCTACTTC GGCATCCAAT GGATCGAAAT CCCAGTCGTC AGACACTAAT
GAAAAGAATG GTGAAGGGCT ACTGTTGATG GATATGGAAA CCCCTGAGGC AGAGACAGAA
TGGGTGGAGG ATGCAGGGAG CGAGGCCAAT AAGGATTTGA TGGACATCGT AATGGGTGAA
GGTGGGTCAC CAAAAAAGGA AGAAGAGATA AAGGCCACTG TGCACGAGAA GGCAGATGAG
GAAAATAAGG AAGTAGAGGC GAGGAGCACA AATAAGGACG AAGATACTGT CATGTTGTAA
TGTGGTTGCT TTGTATAATG TGAGCAATAG TTACCTATGA TAAGAGGCAA CCTGTTTAGC
ATCTCTCTCA TCCTTGAATG GTTTATCTCA TGGTCAATCA AGCTCCGTTC TCATCAGAGA
CCCACGGGCT TTTGGTCGAA TATGTAATTT AATTTACCAA CATAGCTCAT CGACATTTTA
ACAATATGGC TCGTTTATTT GTTTCGAACT GTACATTACT TGTACATGTC AGCCGTCGGG
CAATTTGAGA AAGACCGCCT ACCGATCAGC AAGAAGGACT GATTCGCTCT CTCCAACTCC
AATTGCCCTT AGGTGAGTAT AGTTGAGCGA TAGAGCGGCT GACAGTACAA CAGGTTAGTT
AAACGTATGA AGTGATAGTA TTGCGATCTC TTATGCATAC TCATGATCGT ATCGGTTTCT
TTGCTTATAC AAT
 
Protein sequence
MSIHDSTFSA LKDMGIDPML AREAASRFHS VEPAVNWCFG DGANWQPTTV AESMYGGFNP 
RHREGTSVEH REVIEVFDSP SGSPKPRHSH IPLGSNNPFL PTQPRSPHPA SHPLPQPPLP
PRSPSVRLGT QGLPVEVDDG NDSEGEELRK AIALSQNVEH SVSEAAPLQD AMDVDGKRDS
RERSERASGA PPPSPKSDMP GTIGVTFGPS TKDDVDGKLA LVPTSQNNNT TSNEDADLDK
AIQESLMTAS FHSASAEANK ERPQPTKRGD GVPLVFYSPS GRYTYVAQIL QAFAAVPRIQ
TAFQENVDSA SSGDDILTER TQALSDSFGR MIDEPFSFYE IDDTAKIITE GSERNQLPPL
LPTMDFQLIF IDLFERAAIP RLQGMAKDED ELNRLREECR RVFETKVTGH TPDSITSASY
VTFVRQPTGN DVYTQLADVF WGPEAQTTSI KKLGDVLTVY LDWKPSSARE VWKLDEEITL
DRFLQSNAAW AANRRGLQAV SAGNARRIKE KIESLTLHDN VDITRAVSRE GMKGKIETIL
TALKTKVADL EEELKEQEKA ASAEAFETDD PQYNQHVFNL RAILFHDGAM VGQKHLYMYV
RGKDDRWWKI QEHEATEVEW EAIIGDRTGI WMEGGPYMLL YARQGDRPSS PESSPTRPSD
VDVPPHSNSI ILESNNTLTP TLPPDYSEST STSASNGSKS QSSDTNEKNG EGLLLMDMET
PEAETEWVED AGSEANKDLM DIVMGEGGSP KKEEEIKATV HEKADEENKE VEARSTNKDE
DTVML