Gene CNM01040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01040 
Symbol 
ID3255217 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp311991 
End bp315142 
Gene Length3152 bp 
Protein Length882 aa 
Translation table 
GC content49% 
IMG OID638254255 
Producthypothetical protein 
Protein accessionXP_568305 
Protein GI58261790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.149993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGGC CAACCCCACA CGACGAAATA CTAGCGGACC AGAGCCTTGC GCCTCTGAAA 
CGTAACCACG CATGTCGGCA ATGTAAAAAA CGCAAAACAA AATGTGACGG TGCCCATCCC
GTGTGTTCAC CATGTCTTCG ATCGCATGCG CATGCCGCTC GTTCTGCGAA TCGGAATGGA
ACAAGTGTGC CTGTGCTTGT TTGTACTTGG GCCGACGGCG AAGGTGGGGA GAATGGGAGT
CCTCCTATTC AGGAACCCAT GCAACGCCTT TCTAGGCCTG GGTCTGCATC TGGAGTGAAA
AGACCAGCAG TCTCTCAAGG TTCAAGACCA ACTCGCGACG AAGAGAATGA GATTCTCAGA
CGGAGGATCG GTGAGTAGCT GAAAAGGACT TTTGTGTGGT CGACATTAAC TTTTACGTTA
CTAGCCGATC TTGAAGCCAA ACTCGTCAAT CTATCATCCG CCACCAGACA ATCCGGGCCT
GAACCCATTG GACCAGAGCT CTCGATGCCA GAAAACACCC TTTCACCACC ACGCTCCGAC
AGTATCGTAG ATAATTGGAT CACAGAAGTC GGACATATTA TCAGATATGA TCTCAATCAT
TTCGGGACCT TTGGGAATAA GGAGGCGCCT TCGAGCAAAT CTTCTCAGCC CTCCGTTTCT
TCTTTGAAGG CCCCGCCTGG GGGTAATGGG GGTGGTATCG GAACTACCCA GACACGGGAG
AGCAGCGGTT CCGGCACTCG CCTCAATATC GGCACTATCC CTAAAGATGG GTTCTCCAGC
AGCTTTGGAT TAGACGACCT TTTCGTCATA CCTGCTGATT GGCCACGCGG TCTGCCATCA
CCTTGTCAGT CAAACATTCT TTTTTCGAGA CCGTATGTCG CTAATACACT TCCGTAGTCC
TCCTAGAACA CCTCGTCGAG ACCTTCTTCA ACCACGTCCC TCAAACTCCC CGAATGCTCC
ATCGCCCAAC TCTTCTCACT CGTATCAAAC TATCCCCCAC CTCTGGCAAC TTCCCCTTCC
CCGGTCTGCT CCATGCCATC TGTGCAACTG CTTCCAGCCA TACCGCATGG GTGAATAATC
TCTCTCCTCA TCAGATCGAA GCTGCTGTGC AGAGGCATGT CATCACCGGT ATGGATTTGA
CTAGTATCGA AGATTTCGGG TTGGCTCAAG CCGAAATGGC AAATAGGTCA GTTGACCTTG
TTGCTTCTGC TTGTGTGATG GGTGGAGGGG ACCTGATCTT CCAAGTCACT CAAACCTGCG
TGAGTTCATC TCCACCAATT AACTGAAGGG TTTTTACTCA TAATATCTAT CTGTAGATCC
TTCTTAGTGA TATTTACTTT TGCAAAGGTT TCCCTCTCAA AGGATGGTTG CTTGGTGGCC
AACCGGCACG ACTTATCAAC ACTCTCCAGC TCAGTGACCG CAACCCGCGG AAGTCATACA
AGGAGCCTCT CTTGCCGCGT CCTCGAAATT CAAGAGAGCG CGAGGAACGA TTGGCTACTC
TGTGGATGGC GTTTATCAAT GATTCTGGCT TGGCTTGCAA CAGCACTTGG GTGCCTAGTA
TGTCTCTTGC CGATATAAAA TGCAATTTAC CTACTACCGG TCAGGAATGG TCAAAGGTAC
GCACGTTTCG ATTTGCACAA TGACTTTTTA ATGCTGAGCA CAAAATACAG CTGGACGATA
TGCTGGAAAA TCCCCAGAGT CCAGAGTCGG GAGATCTCTT CACATCGTAA ATATCTTGTG
ATCCTGTTGC CGACTAGCTT ACTGAGGTTT CATCTTTTAT CTAGTCATCC CATGGAAGAC
GCATTCGTGT TGGTTATCAA GTCTACTATT CTGCTGAGCG AAGTTGCACA GTAAGCCCTG
GCCGATGAAG TTGTCCCACA AGTCGGATGA TGACTAATTA CTTGTCTAGA TGGCTTCGCA
ACTGGTCTCA ACGAACACAG GTCCCAGGAG ACGAACTGGC AGGACCTGAA ACGGAATCAT
TCAAGACTGT TGTCCGACAT ATTGAGGACT TCATGTGAGT GTACCATTAT ATCCATGAGT
CTAGGTCTAA TTTTCCATAC TTCCCAGTTC AACCATCCCT AATGCGCTGA AGAACGTATT
TAAACTTGTG GACTCTGCCA ACTCTGGCGG CCTCAACGTT AATCTTCTTT CACTGCATAT
TTTCCCCAAC GTCGCGCTCG CTCTCATGTT CGAACCTTTC ATTGAATGGA AGCCGTCGAA
TCAGTGTCTG AAAGCTACGC AACAAGCGTA TGAAGCCATC CTCGGTGTCC TGCACCTTAT
CCCGAGTAAC TTGGATGTCA CTATGGTCTT TACTCCCCTG ATCGCTTGGT GAGACAGTTA
GATATTACAA AGAGGTATAA TGTACTGACA GACGATGAAT AGCTCTTTAT ATACTGTTGG
ACGAATCATA GCGGATTATG TCAAGTATAC CATGAGGTCT CATCAATACA GTCTGGCCGT
CCGTTACCGT GCCGATCTCA CCACTATCCA GAACCGTGAG TAGATACTCT GTCCGACGCA
AACATAGTAT TCAGCAAACT TATCGCGACC TTGCAGTGCT CGAACGGTAT GGCCAACGCC
ACTCTCTCGG TAGCGCCATG TCCCATTTCC TCGAGAACTA TGTCCAGTAT CTCGGAAACG
AGTGCATGGA CCCTGCAGCG ATGTGCAGCA AACTCGAACG TCAATTGGCT TACCCTACCA
ATAACGGTAC TTATGTCATG GGCGCGGCCA ACGATGGTTA TGCTCATCTT AACGACCCCG
GAAGCTCTTG TTCGGGACCC AACGACCCTC CTTCAACCAA ATCATTCTCT GATTCATGGT
CAGCCAGCGG TCCTAGTCCA AGCGTCTCTA CGCCTGCTAT ATCAAAAACG AACGAACCCT
CTCCTGCACA AGTATATTCC ACGGCTGAGG GGAAAAGTCA AGATCCAATA TCGAATTGGG
ATTGGGGAAG AGAAGCCATC AAGATGATGG GTGTGGATGC GAGTAGATCT GTTTCGGGGC
TAGCGGCGTT GGATGGTATG CCAATGTATA TGGGCGAGAG GTCGAGCTTG AATATGGATG
GTCGGTTACC GGTTGGACCG TTTTCGGATG TAGCAGAGAT TGGGGGATTT GAGGGCTTGC
ATTGGAAGAC GGGCAATAGT GATACCATTT AG
 
Protein sequence
MNGPTPHDEI LADQSLAPLK RNHACRQCKK RKTKCDGAHP VCSPCLRSHA HAARSANRNG 
TSVPVLVCTW ADGEGGENGS PPIQEPMQRL SRPGSASGVK RPAVSQGSRP TRDEENEILR
RRIADLEAKL VNLSSATRQS GPEPIGPELS MPENTLSPPR SDSIVDNWIT EVGHIIRYDL
NHFGTFGNKE APSSKSSQPS VSSLKAPPGG NGGGIGTTQT RESSGSGTRL NIGTIPKDGF
SSSFGLDDLF VIPADWPRGL PSPFLLEHLV ETFFNHVPQT PRMLHRPTLL TRIKLSPTSG
NFPFPGLLHA ICATASSHTA WVNNLSPHQI EAAVQRHVIT GMDLTSIEDF GLAQAEMANR
SVDLVASACV MGGGDLIFQV TQTCILLSDI YFCKGFPLKG WLLGGQPARL INTLQLSDRN
PRKSYKEPLL PRPRNSRERE ERLATLWMAF INDSGLACNS TWVPSMSLAD IKCNLPTTGQ
EWSKLDDMLE NPQSPESGDL FTSHPMEDAF VLVIKSTILL SEVAQWLRNW SQRTQVPGDE
LAGPETESFK TVVRHIEDFI STIPNALKNV FKLVDSANSG GLNVNLLSLH IFPNVALALM
FEPFIEWKPS NQCLKATQQA YEAILGVLHL IPSNLDVTMV FTPLIACSLY TVGRIIADYV
KYTMRSHQYS LAVRYRADLT TIQNLLERYG QRHSLGSAMS HFLENYVQYL GNECMDPAAM
CSKLERQLAY PTNNGTYVMG AANDGYAHLN DPGSSCSGPN DPPSTKSFSD SWSASGPSPS
VSTPAISKTN EPSPAQVYST AEGKSQDPIS NWDWGREAIK MMGVDASRSV SGLAALDGMP
MYMGERSSLN MDGRLPVGPF SDVAEIGGFE GLHWKTGNSD TI