Gene CNM00420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00420 
Symbol 
ID3255284 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp104373 
End bp106098 
Gene Length1726 bp 
Protein Length492 aa 
Translation table 
GC content46% 
IMG OID638254201 
Productexpressed protein 
Protein accessionXP_568361 
Protein GI58261902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAGAGTAAT ACAAGGCCAA GTGAATCTCG TCGCACCATA CCATCATGCC TCACCTCAGC 
AAGAAACAGG AAGCTTTGCA TGAAATCGAC AATCGTATCT CTAGTTTGTT ACCTTACGTA
TCCAACTCCC TCATTCGACA CCAAACCGGA CTTCTTCTTT CCTTACGCAA AATACACGAA
AACAGCCGTT ACTATCAACG GATTCCTCAT AGTTCTCATT ATGAGAACCA CGATTATCTT
TTTCTGAGAT CCTCCCTTCT ACCATCTACC GATGACAAGA TAATCAAGAC CCTACGTATG
AATCGCGCAG AGTTCACGAG CCTTGTCGCT TTGTTCGGAG GTCACGAAAT TTTCAAGTCG
CAGGGGAGGA AGCCTCAGGC ACCGCCGGAG GTGCAGCTGG CGACATGCAT CTACAGAATG
GCAGGAGGCG AGAGAATGAG CACCGTGGAG AATCATTTTA ACCTGTCTCG TAAGTTGAGG
GGGCCTTCAA TCGTTATTCT TCGAGTTCAC ATTCAGGATT GATACGCATA CTCGGATGAC
AGATGGCAGC GTATCCCTCT ATACGGACCG ATCGTTAATA GCCATCGTAT CCTCTTTGAA
GCAGTACGTT TTTTGGCCCT CTGAGGCAGA ACGCGGTGTC CTTGCCCGTG AATTGTACGC
GCAATATGGA ATACCGTCAT GTATCGGCTT CATCGACGGA ACCGATATTG TCTTGCACCA
AGCACCATCA ATTGGACGCG AAAAGGCACA TACTATGCAC AGCTACAAGG AAAGATATGG
TTATAAAATG ATCGCTGTGG TCGATCATCT GAAGAGATTT AGATACGCCT GGTTCGGCTT
CTCTGCAGCT ACCAACGATC AGATGGCCCA AGATCTCTCA GATCTACACA GGAATCCCCA
TCGCTTCTTC TCTCCAAAGG AATATGTGTT GGGTGATGCT GGTATGAAAT CATCAGACAC
AGTTATACCT CTATTCAAGC GAGAGAGAAG GATGCAGGTC ACAGTCGGAC CCAAGGTGCG
TCTTTTCCCA CCCAAATTTC TACTGACTCA TCTCATGCCA GGCCTACTTT AACCACAAAT
GTGCCAAAGC TCGAGTGATG ATTGAACAGG CATTCGGTAT CCTAAAAAAC CGATGGCAAA
TTTTACAAGA TTGTCGTCTT ACTTGCCGGA CAGTAACGGA TGAGGCTCGC CTTTACCTAG
TCATCCAAGC CTGCATGGTT CTTCATAACC TGTTGGTTGA GACTTGGAAG GATTCGCTGA
CTACATGTGA AGTGGAAGGG GTGATGAATA TCAATGAAAG TGTCATTAAC ATTGGTGATG
AAGCAGGAAA TAGGAGGAGG GATGAAATTG TTGCGGAGAT GATCAGAGAG GAAGTTCGTA
GAGATCCTGC TTTTGACGTT AATGCATATG AAATGTGAAC TGCCGAAGAC GAGACATGCA
TGCACGAAGA CACAAGATAT CATTCGTCTT CGCCGAAGGC CCGACTTCGT CGAAGCACGG
TAAGCGCTTC TTGCAAGGAG ATCTTCATTT GAGATGCGAT CCTTGCAGCG GCTTCCACTT
CCTCAATTTG AATTTTTCGC TCATAATACT CTTTCTTAGA CTGGCTAAGG TCTTTTTGTT
CTTCAAAATA CCTAGCCACC CTTTCCCAGC GATCATCCTC CTTCTCCGCT CGCGCAATTT
CCCTCATTTC TTTATTTATG CTGCATTGCG CTGCCTCGCG ACTTAA
 
Protein sequence
MPHLSKKQEA LHEIDNRISS LLPYVSNSLI RHQTGLLLSL RKIHENSRYY QRIPHSSHYE 
NHDYLFLRSS LLPSTDDKII KTLRMNRAEF TSLVALFGGH EIFKSQGRKP QAPPEVQLAT
CIYRMAGGER MSTVENHFNL SHGSVSLYTD RSLIAIVSSL KQYVFWPSEA ERGVLARELY
AQYGIPSCIG FIDGTDIVLH QAPSIGREKA HTMHSYKERY GYKMIAVVDH LKRFRYAWFG
FSAATNDQMA QDLSDLHRNP HRFFSPKEYV LGDAGMKSSD TVIPLFKRER RMQVTVGPKA
YFNHKCAKAR VMIEQAFGIL KNRWQILQDC RLTCRTVTDE ARLYLVIQAC MVLHNLLVET
WKDSLTTCEV EGVMNINESV INIGDEAGNR RRDEIVAEMI REEISFVFAE GPTSSKHGKR
FLQGDLHLRC DPCSGFHFLN LNFSLIILFL RLAKVFLFFK IPSHPFPAII LLLRSRNFPH
FFIYAALRCL AT