Gene CNA04430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04430 
Symbol 
ID3253342 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1192469 
End bp1197268 
Gene Length4800 bp 
Protein Length1217 aa 
Translation table 
GC content47% 
IMG OID638252763 
ProductU2 snRNA binding protein, putative 
Protein accessionXP_566804 
Protein GI58258783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACATTCTTC ACGTTCTTCA TATTCCTCAC AACCTCCCCA ATGCACCTCC TCAACCTCAC 
CCTCTCTTCC CCGACAAATG TCTCCACGGC TGTCGTAGGC AGCTTTTCCG GGTCAAAGAG
CCAGGAGATC CTGTGCGTCA GAGGAGGCAC GAAGTTGGAG ATTTTCAAGT TGAACGCCAC
GACCGGGCAG TGTAAGCAGA TAATCTCACG CCCATGGCCA AGCTGATGTG CATGTAGTGG
ATACTATTGT CTCTACAGAG GTATGGCTTG GAAATCTGGA CGGCTGTGAG AAGTTATGCT
AACGACTGAT GATGTTACAG GCGTTTGGAA CGATTAGAAA TATTGCAGGA TTCAGATTAG
CCGGTATGAC AAAAGGTAAG CTGCTTTATG AGGCTCTAAT AGACGTACGG ATTCATGCTA
ACTATCTTGG CGTAAATTAG ACTACATCTT GGCGACATCG GACTCCGGCA GACTATCAAT
CCTCGAGTTT GTCATCTCGC CCACACCACA TTTTGAGAGC CTGTATCAGG AGGTTTTTGG
GAAGAGTGGT AGCAGGTGAG TGAGGCATGA ATTGTACAGG TCTGGAGGAA ATCGAAATGT
GTTGGGGACG AATGGAAATG GAGAAAATAT CTGCGAGTAG GCCAGAGAGA AGGATGCTAA
GTTGGAATAT TTAGGCGTAT CGTCCCTGGC CAATTCTTGG CCGTTGACCC CAAAGGTAGA
AGTTGTCTTG TTGGATCGTG AGTTTCGTTT CACTTTATGT CGTTTTTACA CGGCTAACTC
CTTTTTCTAG CTTGGAAAAG TAAGTGTAGA GACAAATTTG ATGGAGAAAA ATTGCCATTA
ACAGCGCAAT TGCTGTTAGG ACAAAGCTGG TGTATGTGTT GAACAGAAAT ACCGAGGGAA
AGCTCTATCC ATCTTCTCCT CTTGAGGCCC ATAAGAATCA TACTCTCGTA ACCCACATAG
TCGGCGTTGA CCAGGTGGGT TTTGGTCGGG GTATGCGAAA GCTCGAAAAA GCTAAAATTA
TCTAGGGATA TGACAACCCT TTGTATGCTG CGTTAGAAAC TGACTACTCT GAATCGGATC
AAGACTCTAC CGGAGAGGCG TATGAAAACA CTCAGAAGGT AAGTTAAAAT TAGAAGTTGT
CCAAGGTAGT ACTAAATCTA TACGCATAGC ACTTGACGTT CTACGAGTTG GATCTTGGAT
TGAACCACGT TGTGCGAAAA TGGAGTGAAC CCACAGACAG ACGGGCAAAT CTCCTGGTGC
AAGGTGAGAC CATACCTGTT TGGCTGAGCA TTGTCGTTGC TGACAGCCCT CAAAGTTCCC
GGCGGCCAAA ACGCCAACTC TGACAGATTT GAAGGCCCTT CAGGTGTTCT TGTCTGCACA
GAGGACCACA TCATCTGGAA GCACATGGAT GTGGAGGCCC ACAGAATACC TATTCCCAGA
CGGCGAAACC CTCTTGTGCA AAGAGGTGAC AAGAGTCGAG GTTTAATCAT CGTTTCAGCG
GTCATGCACA AAATAAAGGT ATACTTCTTT GAAGCCTAAA TGGAAACAAA AACTGGGACT
AACAAAGAAT AGGGTGCCTT TTTCTTCTTG CTCCAGTCTG AAGATGGTGA CCTGTACAAG
GTTTGGATTG AACACAACGG TGAAGACGTT GTTGCACTCA AGATCAAGTA CTTTGACACT
GTCCCTGTGG CAAACAGTCT TTGTATCTTG AAGAGAGGTT ACATCTACGT GGCCAGCGAG
TTCAGTGACC AGTAAGTACG CCCGAGTCTA TCCAGTTATC CTGACGATGC CGCAGGAATT
TGTACCAATT CCAAAGTCTT GCGGAAGATG ATGGCGAGCA AGAGTGGTCA TCTACCGATT
ATCCAGAGAA TGGTAACATT GATGGACCAC TTCCCTTTGC CTTCTTTGAC CCGCAACCTC
TTCGTAATCT TCTCCTTGTT GATACTGTTC CTTCTCTGGA CCCCATAACC GATGCTCATG
TCGTCAACCT TCTCGGTGCC AGTTCTGACA CTCCCCAAAT ATACGCAGCT TGTGGACGTG
GTGCCAGAAG TACTTTTAGA ACGTTGAAGC ACGGATTGGA TGTTGCGGAG ATGGTTAGCT
CTCCATTGCC CGGTGTGCCT ACCAATGTCT GGACATTGAA ATTGACAGAA GATGGTAAAT
TGGCCTACAA AAAGTAGTCA TGACATACTG ACATCAGTCC AGATGAGTAC GATTCCTATA
TAGTCCTGTC ATTCCCCAAC GGTACTTTAG TTCTTTCTAT CGGTGAAACG ATTGAAGAAG
TCAACGACAC TGGGTTCCTT TCTTCAGGCC CTACTCTTGC TGTTCAGCAA CTCGGTAACG
CCGGTCTTCT ACAAGTTCAC CCGTACGGTC TTCGACACAT CCGAGCCGCC GATCGAGTAG
ATGAATGGCC CGCTCCTCCC GGACAAACCA TTGTTGCTGC TACCACCAAC CGGCGGCAGG
TCGTCATTGC GTTGAGTACG GCCGAGTTAG TTTACTTTGA GCTTGACCCT GAAGGAAGCT
TGAGCGAGTA CCAAGAGAAA AAGGCGTTGC CCGGTAATGC CACTTGCGTG ACTATTGCTG
AGGTGCCTGA GGGGAGGAGA AGGACATCAT TCTTAGCCGT TGGTTGCGAC AATCAAACAG
TGTCCATCAT CTCTTTGGAA CCCGATAGCA CTCTAGATAC TTTGAGTCTT CAGGTAGGTC
TAGTCAGATC ATCTCATAGA CATGACTGAA CGTTTTATAT TCAGGCCCTC ACTGCTCCGC
CCACTTCGAT CTGTCTCGCG GAGATCTTTG ACACCAGTAT TGACAAGAAC CGTGCTACTA
TGTTTTTGAA CATTGGTCTC ATGAACGGCG TTCTCCTTCG TACCGTTGTC GACCCTGTTG
ACGGATCTCT CTCTGACACT CGACTTCGAT TTCTCGGTGC CAAGCCGCCC AAACTTGTCC
GTGCGAATGT TCAGGGCCAG CCTAGTGTCA TGGCGTTCTC CAGCAGAACT TGGTTGCTCT
ACACGTATCA AGATATGCTA CAGACCCAGC CACTTATCTA CGATACTTTG GAATACGCTT
GGTCACTCTC AGCGGCTATG TGTCCTGATG GATTAATCGG TATCTCGGGT AATACCTTGA
GGTACGTTCT TCAGTTTGAG TCACTTAAAG TCGAGGTTGA CATGTAGTAC AGAATTTTCA
ACATCCCCAA GCTAGGTGAA AAGCTCAAGC AAGACTCCAC CGCCTTGACG TATACACCTC
GCAAGTTCAT TAGCCATCCC TTTAATTCTG TCTTTTACAT GATCGAAGCG GATCACCGAA
CATACTCAAA GAGTGCCATT GAAAGGATTG TCAAGCAGAA GGAGTCGGAA GGTAGAAGGG
TTGATACCTT ATTGTTAGAC CTCCCCGCCA ATGAGTTTGG CCGGCCTAGA GCCCCTGCTG
GTCACTGGGC GTCTTGTGTA CGAGTTTTGG ATCCCCTTGC TGTAAGTATT TTTGATGGCT
TATGAACTAT ACTGATAAAA TGTTGTTAGA ACGAAACCAT TATGACTCTT GACCTCGACG
AAGATGAAGC TGCATTTTCG ATTGCTATTG CCTATTTTGA ACGTGGTGGC GGCGAGCCGT
TCCTCGTGGT TGGTACTGGT GTAAAGACGA CGTTGCAGCC CAAAGGATGT AAAGAGGGAT
ATTTGAGAGT ATATGCGATT AAGGAACAAG GCAGAATCCT TGAGTTTTTG CACAAGGTCA
GTGGAATCAG GCTGCATAGC ATAGTCATGG CTGATCCCTT TGCCTTTCTC TAGACCAAGA
CCGATGACAT ACCGCTTTGC TTGGCTGGCT TTCAAGGCTT CCTATTGGCA GGTATCGGCA
AGTCTCTGAG ATTGTATGAA ATGGGTAAAA AGGCGTTGCT GAGAAAATGC GAAAACAATG
TAAGATTACG ATCCTACTCC AGCGGACGAC AACTGATATC TGGCAGGGAT TCCCCACGGC
TGTTGTTACC ATCAACGTCC AAGGAGCCCG AATAATCGTC GGTGACATGC AAGAATCAAC
TTTCTACTGT GTTTATCGCT CCATTCCCAC CCGACAGCTC CTCATTTTCG CCGACGATTC
CCAACCTCGC TGGATCACTT GTGTCACGAG CGTTGATTAT GAGACCGTTG CATGTGGGGA
CAAATTCGGA AATATCTTCA TCAATAGACT GGACCCTAGT ATATCAGAGA AGGTGGATGA
CGACCCTACG GGTGCTACAA TCTTGCACGA GAAGAGCTTC TTGATGGGTG CGGCACATAA
GACAGAGATG ATAGGGCATT ATAATATTGG AAGTGTCGTC ACTTCGTGAG TATGACTTTA
CACTATGTAA TTCATGAGCT AATTTATGTT GTAGTATAAC AAAAATCCCA CTGGTAGCTG
GTGGACGAGA TGTGTTGGTT TATACCACCA TCTCAGGCGC TGTGGGTGCC CTTGTTCCCT
TTGTGTCTTC GGATGATATC GAATTCATGT CCACTCTGGA AATGGTAAGC TAACCTGTCA
AAGTGTTGTT TGCATCATTG CTGACTGATG TACGATAGCA CATGCGAACA CAAGACATTT
CTCTTGTAGG CCGAGACCAC ATTGCTTACA GGGGTTACTA CGTTCCCATC AAAGGTGTTG
TTGATGGGGA CCTGTGTGAG AGCTTTAGTC TCTTGCCGTA TCCTAAGCAA CAAGCGATCG
CTTTAGATTT GGATAGGAGC GTGGGTGATG TTTTGAAGAA GCTTGAGCAA ATGAGGACGA
GCAGCGCATT CTAAAGGGAT ATATGGAAAG TATTTGCTGT AGGTGTTATG GTAGAATCAA
 
Protein sequence
MHLLNLTLSS PTNVSTAVVG SFSGSKSQEI LCVRGGTKLE IFKLNATTGQ LDTIVSTEAF 
GTIRNIAGFR LAGMTKDYIL ATSDSGRLSI LEFVISPTPH FESLYQEVFG KSGSRRIVPG
QFLAVDPKGR SCLVGSLEKT KLVYVLNRNT EGKLYPSSPL EAHKNHTLVT HIVGVDQGYD
NPLYAALETD YSESDQDSTG EAYENTQKHL TFYELDLGLN HVVRKWSEPT DRRANLLVQV
PGGQNANSDR FEGPSGVLVC TEDHIIWKHM DVEAHRIPIP RRRNPLVQRG DKSRGLIIVS
AVMHKIKGAF FFLLQSEDGD LYKVWIEHNG EDVVALKIKY FDTVPVANSL CILKRGYIYV
ASEFSDQNLY QFQSLAEDDG EQEWSSTDYP ENGNIDGPLP FAFFDPQPLR NLLLVDTVPS
LDPITDAHVV NLLGASSDTP QIYAACGRGA RSTFRTLKHG LDVAEMVSSP LPGVPTNVWT
LKLTEDDEYD SYIVLSFPNG TLVLSIGETI EEVNDTGFLS SGPTLAVQQL GNAGLLQVHP
YGLRHIRAAD RVDEWPAPPG QTIVAATTNR RQVVIALSTA ELVYFELDPE GSLSEYQEKK
ALPGNATCVT IAEVPEGRRR TSFLAVGCDN QTVSIISLEP DSTLDTLSLQ ALTAPPTSIC
LAEIFDTSID KNRATMFLNI GLMNGVLLRT VVDPVDGSLS DTRLRFLGAK PPKLVRANVQ
GQPSVMAFSS RTWLLYTYQD MLQTQPLIYD TLEYAWSLSA AMCPDGLIGI SGNTLRIFNI
PKLGEKLKQD STALTYTPRK FISHPFNSVF YMIEADHRTY SKSAIERIVK QKESEGRRVD
TLLLDLPANE FGRPRAPAGH WASCVRVLDP LANETIMTLD LDEDEAAFSI AIAYFERGGG
EPFLVVGTGV KTTLQPKGCK EGYLRVYAIK EQGRILEFLH KTKTDDIPLC LAGFQGFLLA
GIGKSLRLYE MGKKALLRKC ENNGFPTAVV TINVQGARII VGDMQESTFY CVYRSIPTRQ
LLIFADDSQP RWITCVTSVD YETVACGDKF GNIFINRLDP SISEKVDDDP TGATILHEKS
FLMGAAHKTE MIGHYNIGSV VTSITKIPLV AGGRDVLVYT TISGAVGALV PFVSSDDIEF
MSTLEMHMRT QDISLVGRDH IAYRGYYVPI KGVVDGDLCE SFSLLPYPKQ QAIALDLDRS
VGDVLKKLEQ MRTSSAF