Gene CND02010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND02010 
Symbol 
ID3257103 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp540446 
End bp543535 
Gene Length3090 bp 
Protein Length785 aa 
Translation table 
GC content49% 
IMG OID638256135 
Productexpressed protein 
Protein accessionXP_570230 
Protein GI58266148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTCA CTTCAACCCA TTCACAAAAT TCAATCAGTC CTGGCGCAGT CCCAGACGCC 
CCAGGAGCAC ATTCGGCGCA ACCGCAAACA GAAGCTGCTG GGGTTGACCT CCGCGTCCGA
TTTGCCCGCG CTTGTGATCG TTGCCGGTGG GTGATATCTA TCGTACACTA TTGCAAACTA
ACGGGGCCGA CAGACACAGG AAAATAAAGG TTAGTAATGT GATATTTAGC AGGCATTAAT
CCCCGCAGTG CGATAATGAA CACCCATGTG GAAAATGTGC GGCAGCTGGT GTTACCTGTA
CTTCGGGAAT GAGCAGAGCG ACTGGTGTAA GCAGACGGCG AGCTTCTCGA ACAAGCTTTT
CGGACACGCG GTGGGTATCG AACATAATTT AGACGTGATC TGACCTTGGA GTCTCAGCAA
CCATCATGAA GCTTTCATTG AGAAAGAAGG GTCACCTTCC AACAGAGAAC ATCGATCTAA
CAACAAAGGA CGCAAACGGA GAAGAGACTC TGAGGAAACA GATAAGATAG GAGCAACTGG
CAACTCCAGT GAGTAAAGAG TCCGAAATTT CCATATTCGC CTGATCTTTC CCTATTACAG
AAACGTCAAA TGTCCTTGAT CACCTCGATA CGGCGCACTT CTTCTTGTCG CAAGAAGGAA
TGACCCGTTT CGCCGGCTCT ACTTCGGGCC TTCCAGTCCT GTGAGGAGGA TCTACCAGTT
AGCTAGCTAT AACAGTCAAT AACATCACCT GTAGCGAAGC CACCCGACGT ATGCTGCAGA
AAATACCTGA TCCCGTCAAC GACGCGCAAA GTCCCATGGG TGAACCAAAT TGGTCATGGC
TCTCTTCATT ACTAGAGGAC GGAGAGGGAT CCTCAAAGAG CGGTAGGGAG TCTTTCCTTC
GTCGTCAAGA ACGAGACGAG CCGGAATACT TTCCTGGAAG GGATCACGCC TCGGAACAAG
GCGGAGAAGA CATTTTCGCA AGGATTTCGG AGATCAGTGA GCCACCATCT GCCTCTGGGA
CTTACAAAGC TAAATTTGGA TAGTACCACC TGATTTGATG GCCTCTCTTG TTCAGACTTA
TGTAGGTAAG AAGCGGTCAA CGTCCCTGAC ACGAATTAGT TCGCAGTTGT TCATCCTGTT
TGGCCTATTA TCCACATCCA GTCATTCTTT GCAGTAAGTA CACGTAGCCA CCTCTTTTCC
ATCAAAGGCC CAAGAATCTG ATAGCCGTGC CTAGGATTTC TATAAGTGGA CCAGCTATTC
CTTTGCGGCG TTAGTGGTAT CTATGTGCAT GTTGGCGACC CGCTACACGA ATGACCCAAG
AGTGCTAGCT GAACCGGGTG AGCTACAGAT GAATTTATAA CAACTAATTT GGCTGACTAA
ACTGGCTCTT GATTTTAGAT ATCTCAGCCA GTGCAGGGTT CCAATATTTT GAGCTTTTCA
GGCAATTGCG TGCACAAGCT TCGACAGAGG ATAACGTGAT TCAAGCAATC CAGAGCCTGT
TTTTCGCGGC ACAATACCAT TGTGTTGATA ATGTACCCCA CCAGGTTGCA CAAGGTCTTT
TTGCAGAGGC AGCAGCTCGA CTGCTCGATG GTGGACTACA TCGGTATGCA TTTGCCCGTT
CGCGAGAAAG AAACGTCAGC TGAATTACTC GATAAGAGAA ATCTCAGATA TTGCGCTTGG
CGATTCGCTG GAGAAGGAAA CGCGAACAGT GGGTCCAGTA GTATGCTCGC AGTTCCAAGA
ACTAATAGAA AGCAGCGTAC TGCATGGGCA TGCTATAGCT GGGATAAACA ACTGGCCGCC
ATTTGTGGTA AACCACCACT CCTACGCATT TGGGACTACG ACGTCTCATT GCCCGAAGTG
TTTGAAGAGA CGCAATCGGC TTTTGCTGAG CTCCCCGAAA ATCAAGATGA GAAGCTGAGC
GCAATCTTCA TCCAGCAAAT CCTGTTGTCA GTGGTACTTG AAAAAACGTT GACCTCGTGC
ACCCATCATC CCGAATTTGA TAACTGCGAA ATGCTCAACA GATGGGCAAG GAGCATGCGC
CCAGAAGTCG AAGATATGAA GGCTCTAGAT GATTCAATGC GGCTGCTGAA TGAATGGCGA
GAGTGCGTTT AACAACTGTA TTTGATGAAG CACACAAGGA TGCTAATGTA ACGCAGGGCC
CTCCCGCCAG CAATGTCAGA CCGCTCGATC GCTGGTCGCC TCGCTTCGCC GATGTACAGC
GTCGAATACG AACAGATTGC TGTAATCGAG CAGACCATAG AAATGCTTAT AGCGGGCCGC
AAACTGCAGC TTGCGACACT TGCCAAGACC CGGGAAAACA CGTCAACGAC TCGTCTGGAG
TCTGCACGTG ATGCTATTCT CGAAGCAGGA AAACACACGC TAGCATCAGC AGTAAAGATG
GGCTCTGCCA AAATGCTCGG CAAATGTGAT ATTCGTGAGT ACATATCATT TCAAATACCA
TGCTGACATT TGTTTAGTCC TGGCATATCG TATATTGATG GCTGGTCGGT TTCTACTAGC
CAGTCTGTTA TCAGCCCGCG CAGATTGCAA AGCCGAGCAG GAAGAAGAGG CGACCCGGGC
AGTGAGAGCA GCGATTGTAC TTCTCCGTCA TTTCTCGGAC GTATTCCCCA TCTCACTTGG
CTCTGCTGAA GTCTTGGAGG AAACATGCCG AGGTTGGTCA CTGCATACCA CTTTACTTGT
CTGACTCTTT TACTGACTCG ATACAATCGC AGTTTGCCGG GTTGACATCT CTTTGCCTAC
AGCGGCAACT CCCGGCCACC CTCGGCACAA CCTGTACGCC TGGCATCGGC CCCTCAGGCT
TCGTGACAAG CATTCCAATG AAAAAGATGG CTGTCGTTCT CAAGTCAGGT CACCAACGAA
TGTCATGTCT GGAGACGCCG CAGCGGACGC CATAGCGTCC ATGTTTTCCC CGGTGGACGC
AGCATTTGGG TTAGGTTCTT TACCTTTACC TTTCCCTGAG ATTGGAACAG AAGGAGAAAG
GCAGCCAGAC TTCTCATGGC TTCCTCCAGG TGGTAGTGTC CCTTACTACT TTCCCTCACA
TCCATCAGAG TAACGTAGCA GTATATACCT
 
Protein sequence
MSVTSTHSQN SISPGAVPDA PGAHSAQPQT EAAGVDLRVR FARACDRCRH RKIKCDNEHP 
CGKCAAAGVT CTSGMSRATG VSRRRASRTS FSDTRNHHEA FIEKEGSPSN REHRSNNKGR
KRRRDSEETD KIGATGNSKT SNVLDHLDTA HFFLSQEGMT RFAGSTSGLP VLEATRRMLQ
KIPDPVNDAQ SPMGEPNWSW LSSLLEDGEG SSKSGRESFL RRQERDEPEY FPGRDHASEQ
GGEDIFARIS EIIPPDLMAS LVQTYFAVVH PVWPIIHIQS FFADFYKWTS YSFAALVVSM
CMLATRYTND PRVLAEPGNC VHKLRQRITL HKVFLQRQQL DCSMVDYIGM HLPVREKETS
AELLDKRNLR YCAWRFAGEG NANSGSSSML AVPRTNRKQR TAWACYSWDK QLAAICGKPP
LLRIWDYDVS LPEVFEETQS AFAELPENQD EKLSAIFIQQ ILLSVVLEKT LTSCTHHPEF
DNCEMLNRWA RSMRPEVEDM KALDDSMRLL NEWREALPPA MSDRSIAGRL ASPMYSVEYE
QIAVIEQTIE MLIAGRKLQL ATLAKTRENT STTRLESARD AILEAGKHTL ASAVKMGSAK
MLGKCDILLA YRILMAGRFL LASLLSARAD CKAEQEEEAT RAVRAAIVLL RHFSDVFPIS
LGSAEVLEET CRVCRVDISL PTAATPGHPR HNLYAWHRPL RLRDKHSNEK DGCRSQVRSP
TNVMSGDAAA DAIASMFSPV DAAFGLGSLP LPFPEIGTEG ERQPDFSWLP PGGSVPYYFP
SHPSE