Gene CNA02420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA02420 
Symbol 
ID3253529 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp632814 
End bp635958 
Gene Length3145 bp 
Protein Length786 aa 
Translation table 
GC content48% 
IMG OID638252574 
Productrsec15, putative 
Protein accessionXP_566633 
Protein GI58258441 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000851251 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTACTCCCCC AACACCACAC AAGGCGCAAA ATGATACGCA GACAGAGGCC CACATTCACA 
ACGTCAGAGC TCGAACTGCA GCTCCAGCAG GTACGTGTCT TCCCGCTCTC AGTCAACTCA
CGCCCCAAAA GATCAACCTC GATCCCACAT CATCCACAAC AGAAAATCTC GAAGCTCTGG
CCCCTCTTAT CAAGTCAATA CAAGATACGG ACTCGGAGCA GCTGTACCTC AGAAGCCTGG
ACAATTTTGT GGAAGAGAAA GAAAGGGAGA TTGAGGAAAT ATGTCAAGAG AATTATGAAG
TCTGTCGCGT GAACAGTGCA TTCGCCAAGT GCTGATGATT GGCAATCAAC TCTAGGACTT
TGTCTCTTCC GTATCTACCC TTCTCACAAT CAGGCAAGGG ACTGTGCATC TGAGGCGGCG
AATTGGTGAG CTTGATGGCC AAATGGGCGA TGTAGGAAGG GCGTTGGGCG AAAAGGTAGG
TCGTCACGCC GCTGGAATCT ACTGACATAT TTTCCAAACA GAAACGCGCT CTATTGGAAC
AGAAAAAGGT AGCGAGAAAT ATGGACGATG CGATTGAAAC TCTCCAGACA TGCCTACGTC
TCCTGGACCT TGTTCACCGC ATCGACGAAA TGGTTCGAGA GGGCAAGTAT TGGGGGGCGT
TACGGGTACG TTTATATCTG GGCTTTGATA AATACGTACT CAAAACCAGC ATACTAGTCA
CTTGAAGATC TACTTCATCT CCCTCCCCCA TCCATCTCAC AAACGCCCTT TTACGCACAC
ATTCTTTCCT CGCTTCCCTC TCTCCGCCTT TCTATCAAAG ATGCTGTCAC GGCCTCGACA
AAAACGTGGT TATTCGATGT GCGTGAAAGT AGTGCAAAAG TGGGCAAGCT TGCATTGGAG
CAAATGGCAC TGAGGACGAG GAAATGGAGA ATGAAGCGAG AAAGGGAAGG TGGGGTTCGG
TTAGCTCGGG TTGGGGGGCC GTTGGAACTG GTGCATAACG AACGAGTCGA ATGTAAGTGT
GCCCCGATGT GCGCGCGCTC TGGCTAACTG TTTTAGTCGA TGCGCTGGAT AATGATGAGA
TCAAAGTGGA TTTCAAGCCC CTTTATCATT GTATACACAT CTACGAAGCC CTGGGCCAGA
AGCCTGAGCT GCAGCGTAGC TATCAAGAAG ATCGAAAGGT ATAATCTATT ACAAGAACAT
TTTATGTTCG CAGCTTACCT TTCATTCAGA CGCAAGCGAC TCTCATTCTG ACTTCTCGAT
TATCCACGAC CCCTTCAACT CTTGTCAACA CCCTTCCTCT TTTGATGCAG GAGCTAGTGG
GCTTTTTTAT CATAGAAGCC CATGTGCTCG ACACGATGCC TGATTTCCGT ACTCAGAGAG
ATGTGGACGA GCTCTGGGAT GAAATGTGCA GGAGGATTGT GGAAGTTATG GGTCAGGGCT
TGAAGGGATG CAGCGAGCCT GGAGTGTTCC TGAGTAGTAA GACTGAAGTG TTGCTATTCG
TGCAAACTTT GGAGGTGAGT TGGTCTGCGT GATGAAATAA AGCTAATGTT TGACAGGTGT
ATGGGTATAA TATAACAGAG CTAAACGGCT TACTCATAAC TCTCTTTGAA CGGTACTCGG
AGCTGCTATT GCGCAAATTC GGCGCGGACT TTGACCAGGT CAGTCGCAGC ATTGCAAGGT
TTCATTTGTC TCATCTGACT CTTGGTGGTG GGCACAGATT GTGTCGGACG ATGACAATCA
GCCGATGATG GTTAATGACC ACGAAGAGTT TGACCAAGTT GCAGGAGTCT GCTGGCTTGC
AAAGGGAGAA GCCGAGTCTC TGGCGATGTA AGTCTTTTCA GAGCGTCATT GTTCTTGCCC
TTTGGGCTGA TTCCGTGGGT CAGGCAAGGA TTCCCTCAAC CGATGCCCTT CTCCCAAACG
TATCCCATGT GCTGTATCAA CATTCGAAAT TTTGTGGATC AGTTCTACCA GTTTACAGAT
GGAGTGGCTC AACAGCATCT CGATGTCGAT GAAGTTCTGC GAAGGGTTAG TTTTTTTTTT
TTCTATAGTA CGCATGACAG ATTTCTAATC CAAATACAGT CTTTGGATGG TTTACTATCC
GACCATGTCA GCAAACAAAT TGCAAAGAAA CTACAGACTA TGCTCAACCT GTCGCAAATT
GCGCAAGTAG TCATCAATCT GGAGCATTTT TGCACTGCTT GCAATGAGCT TGAAAGCGTG
CTGATGAATC TTCGGTGAGT CACGTTTTTC CCAAAACGGT CCAAGTTGCG AGTCTGAATA
GTAGTATCAG CGCATCACAG CGTGGAGGGC CGATCAAGCT TGCATCCGGT GCCTCGTTCA
ATTCCACCTT ATCAGCAGCA CAATCACGTA TCGACAGTAT CATCAACTCT AAACTCGAAT
CCTTCTTTGA GCTTGCAGAG TACAATTGGA TGCCTGTTCG CCCCCAATCG ACGGTGGAAG
AGCCAAGCAC TTATGTGTTT GAAATGATCA CGTTCTTAAC GGCATATGTG GATTCGGTGT
TAATCGGGCT GAAGGAAGAA TTCAAGACGG GAGCGTATCG AAACGCCTTG ATGAGGATCA
ATCGGTGGTT GATGGTATGT TTCCCATTGC CTTGAGATAA TGCGGAGTTA ACAAGAGATA
GGATACATTG ACGGGTCAAG AGGTTGTCAG ATTGAACGAA AGCGCATTGG CGAGTGTACT
GGCCGACGTG TCGTTCATCG AGACTGAGAT CAAACGGCTT GGTAAATCTG AGCTTGATCA
CGTCTTTGAC GAAGTAAAGC ATGTGTGTCA ATCCTTTTCT ACGTCTATAG CTTTCCAGAG
ACTGATTCTT TACCCTCAGA CGATCAACAT TATCCTTTCG GATGCCGTTC AGGCGTATAT
GGAACCTTCG ATTCGTTCCA TGTCGTATCC CTCTGTCAAA CCCCTCAGCC TGGCGATGAT
CTTGGCCAAA CTGAGCAAAG CCGCCTCGTC CCAGGGAGGT CAGGCAAACA TGTTCAAGGC
GGCAAGGAGA AGGGGAGAAG CAGATGAAGT GGCTAGATTG GCGGGTAAGT AGAAGGGCGA
GCAGGGATGT TGTATAGTTT ATTTCTTCAT GACGCGATCT TGATATGTAT CGATTATTAA
AAGTCAAGGA GGTCGATAGG GATGC
 
Protein sequence
MIRRQRPTFT TSELELQLQQ INLDPTSSTT ENLEALAPLI KSIQDTDSEQ LYLRSLDNFV 
EEKEREIEEI CQENYEDFVS SVSTLLTIRQ GTVHLRRRIG ELDGQMGDVG RALGEKKRAL
LEQKKVARNM DDAIETLQTC LRLLDLVHRI DEMVREGKYW GALRSLEDLL HLPPPSISQT
PFYAHILSSL PSLRLSIKDA VTASTKTWLF DVRESSAKVG KLALEQMALR TRKWRMKRER
EGGVRLARVG GPLELVHNER VEFDALDNDE IKVDFKPLYH CIHIYEALGQ KPELQRSYQE
DRKTQATLIL TSRLSTTPST LVNTLPLLMQ ELVGFFIIEA HVLDTMPDFR TQRDVDELWD
EMCRRIVEVM GQGLKGCSEP GVFLSSKTEV LLFVQTLEVY GYNITELNGL LITLFERYSE
LLLRKFGADF DQIVSDDDNQ PMMVNDHEEF DQVAGVCWLA KGEAESLAMQ GFPQPMPFSQ
TYPMCCINIR NFVDQFYQFT DGVAQQHLDV DEVLRRSLDG LLSDHVSKQI AKKLQTMLNL
SQIAQVVINL EHFCTACNEL ESVLMNLRAS QRGGPIKLAS GASFNSTLSA AQSRIDSIIN
SKLESFFELA EYNWMPVRPQ STVEEPSTYV FEMITFLTAY VDSVLIGLKE EFKTGAYRNA
LMRINRWLMD TLTGQEVVRL NESALASVLA DVSFIETEIK RLGKSELDHV FDEVKHTINI
ILSDAVQAYM EPSIRSMSYP SVKPLSLAMI LAKLSKAASS QGGQANMFKA ARRRGEADEV
ARLAGK