Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04430 |
Symbol | |
ID | 3253342 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1192469 |
End bp | 1197268 |
Gene Length | 4800 bp |
Protein Length | 1217 aa |
Translation table | |
GC content | 47% |
IMG OID | 638252763 |
Product | U2 snRNA binding protein, putative |
Protein accession | XP_566804 |
Protein GI | 58258783 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACATTCTTC ACGTTCTTCA TATTCCTCAC AACCTCCCCA ATGCACCTCC TCAACCTCAC CCTCTCTTCC CCGACAAATG TCTCCACGGC TGTCGTAGGC AGCTTTTCCG GGTCAAAGAG CCAGGAGATC CTGTGCGTCA GAGGAGGCAC GAAGTTGGAG ATTTTCAAGT TGAACGCCAC GACCGGGCAG TGTAAGCAGA TAATCTCACG CCCATGGCCA AGCTGATGTG CATGTAGTGG ATACTATTGT CTCTACAGAG GTATGGCTTG GAAATCTGGA CGGCTGTGAG AAGTTATGCT AACGACTGAT GATGTTACAG GCGTTTGGAA CGATTAGAAA TATTGCAGGA TTCAGATTAG CCGGTATGAC AAAAGGTAAG CTGCTTTATG AGGCTCTAAT AGACGTACGG ATTCATGCTA ACTATCTTGG CGTAAATTAG ACTACATCTT GGCGACATCG GACTCCGGCA GACTATCAAT CCTCGAGTTT GTCATCTCGC CCACACCACA TTTTGAGAGC CTGTATCAGG AGGTTTTTGG GAAGAGTGGT AGCAGGTGAG TGAGGCATGA ATTGTACAGG TCTGGAGGAA ATCGAAATGT GTTGGGGACG AATGGAAATG GAGAAAATAT CTGCGAGTAG GCCAGAGAGA AGGATGCTAA GTTGGAATAT TTAGGCGTAT CGTCCCTGGC CAATTCTTGG CCGTTGACCC CAAAGGTAGA AGTTGTCTTG TTGGATCGTG AGTTTCGTTT CACTTTATGT CGTTTTTACA CGGCTAACTC CTTTTTCTAG CTTGGAAAAG TAAGTGTAGA GACAAATTTG ATGGAGAAAA ATTGCCATTA ACAGCGCAAT TGCTGTTAGG ACAAAGCTGG TGTATGTGTT GAACAGAAAT ACCGAGGGAA AGCTCTATCC ATCTTCTCCT CTTGAGGCCC ATAAGAATCA TACTCTCGTA ACCCACATAG TCGGCGTTGA CCAGGTGGGT TTTGGTCGGG GTATGCGAAA GCTCGAAAAA GCTAAAATTA TCTAGGGATA TGACAACCCT TTGTATGCTG CGTTAGAAAC TGACTACTCT GAATCGGATC AAGACTCTAC CGGAGAGGCG TATGAAAACA CTCAGAAGGT AAGTTAAAAT TAGAAGTTGT CCAAGGTAGT ACTAAATCTA TACGCATAGC ACTTGACGTT CTACGAGTTG GATCTTGGAT TGAACCACGT TGTGCGAAAA TGGAGTGAAC CCACAGACAG ACGGGCAAAT CTCCTGGTGC AAGGTGAGAC CATACCTGTT TGGCTGAGCA TTGTCGTTGC TGACAGCCCT CAAAGTTCCC GGCGGCCAAA ACGCCAACTC TGACAGATTT GAAGGCCCTT CAGGTGTTCT TGTCTGCACA GAGGACCACA TCATCTGGAA GCACATGGAT GTGGAGGCCC ACAGAATACC TATTCCCAGA CGGCGAAACC CTCTTGTGCA AAGAGGTGAC AAGAGTCGAG GTTTAATCAT CGTTTCAGCG GTCATGCACA AAATAAAGGT ATACTTCTTT GAAGCCTAAA TGGAAACAAA AACTGGGACT AACAAAGAAT AGGGTGCCTT TTTCTTCTTG CTCCAGTCTG AAGATGGTGA CCTGTACAAG GTTTGGATTG AACACAACGG TGAAGACGTT GTTGCACTCA AGATCAAGTA CTTTGACACT GTCCCTGTGG CAAACAGTCT TTGTATCTTG AAGAGAGGTT ACATCTACGT GGCCAGCGAG TTCAGTGACC AGTAAGTACG CCCGAGTCTA TCCAGTTATC CTGACGATGC CGCAGGAATT TGTACCAATT CCAAAGTCTT GCGGAAGATG ATGGCGAGCA AGAGTGGTCA TCTACCGATT ATCCAGAGAA TGGTAACATT GATGGACCAC TTCCCTTTGC CTTCTTTGAC CCGCAACCTC TTCGTAATCT TCTCCTTGTT GATACTGTTC CTTCTCTGGA CCCCATAACC GATGCTCATG TCGTCAACCT TCTCGGTGCC AGTTCTGACA CTCCCCAAAT ATACGCAGCT TGTGGACGTG GTGCCAGAAG TACTTTTAGA ACGTTGAAGC ACGGATTGGA TGTTGCGGAG ATGGTTAGCT CTCCATTGCC CGGTGTGCCT ACCAATGTCT GGACATTGAA ATTGACAGAA GATGGTAAAT TGGCCTACAA AAAGTAGTCA TGACATACTG ACATCAGTCC AGATGAGTAC GATTCCTATA TAGTCCTGTC ATTCCCCAAC GGTACTTTAG TTCTTTCTAT CGGTGAAACG ATTGAAGAAG TCAACGACAC TGGGTTCCTT TCTTCAGGCC CTACTCTTGC TGTTCAGCAA CTCGGTAACG CCGGTCTTCT ACAAGTTCAC CCGTACGGTC TTCGACACAT CCGAGCCGCC GATCGAGTAG ATGAATGGCC CGCTCCTCCC GGACAAACCA TTGTTGCTGC TACCACCAAC CGGCGGCAGG TCGTCATTGC GTTGAGTACG GCCGAGTTAG TTTACTTTGA GCTTGACCCT GAAGGAAGCT TGAGCGAGTA CCAAGAGAAA AAGGCGTTGC CCGGTAATGC CACTTGCGTG ACTATTGCTG AGGTGCCTGA GGGGAGGAGA AGGACATCAT TCTTAGCCGT TGGTTGCGAC AATCAAACAG TGTCCATCAT CTCTTTGGAA CCCGATAGCA CTCTAGATAC TTTGAGTCTT CAGGTAGGTC TAGTCAGATC ATCTCATAGA CATGACTGAA CGTTTTATAT TCAGGCCCTC ACTGCTCCGC CCACTTCGAT CTGTCTCGCG GAGATCTTTG ACACCAGTAT TGACAAGAAC CGTGCTACTA TGTTTTTGAA CATTGGTCTC ATGAACGGCG TTCTCCTTCG TACCGTTGTC GACCCTGTTG ACGGATCTCT CTCTGACACT CGACTTCGAT TTCTCGGTGC CAAGCCGCCC AAACTTGTCC GTGCGAATGT TCAGGGCCAG CCTAGTGTCA TGGCGTTCTC CAGCAGAACT TGGTTGCTCT ACACGTATCA AGATATGCTA CAGACCCAGC CACTTATCTA CGATACTTTG GAATACGCTT GGTCACTCTC AGCGGCTATG TGTCCTGATG GATTAATCGG TATCTCGGGT AATACCTTGA GGTACGTTCT TCAGTTTGAG TCACTTAAAG TCGAGGTTGA CATGTAGTAC AGAATTTTCA ACATCCCCAA GCTAGGTGAA AAGCTCAAGC AAGACTCCAC CGCCTTGACG TATACACCTC GCAAGTTCAT TAGCCATCCC TTTAATTCTG TCTTTTACAT GATCGAAGCG GATCACCGAA CATACTCAAA GAGTGCCATT GAAAGGATTG TCAAGCAGAA GGAGTCGGAA GGTAGAAGGG TTGATACCTT ATTGTTAGAC CTCCCCGCCA ATGAGTTTGG CCGGCCTAGA GCCCCTGCTG GTCACTGGGC GTCTTGTGTA CGAGTTTTGG ATCCCCTTGC TGTAAGTATT TTTGATGGCT TATGAACTAT ACTGATAAAA TGTTGTTAGA ACGAAACCAT TATGACTCTT GACCTCGACG AAGATGAAGC TGCATTTTCG ATTGCTATTG CCTATTTTGA ACGTGGTGGC GGCGAGCCGT TCCTCGTGGT TGGTACTGGT GTAAAGACGA CGTTGCAGCC CAAAGGATGT AAAGAGGGAT ATTTGAGAGT ATATGCGATT AAGGAACAAG GCAGAATCCT TGAGTTTTTG CACAAGGTCA GTGGAATCAG GCTGCATAGC ATAGTCATGG CTGATCCCTT TGCCTTTCTC TAGACCAAGA CCGATGACAT ACCGCTTTGC TTGGCTGGCT TTCAAGGCTT CCTATTGGCA GGTATCGGCA AGTCTCTGAG ATTGTATGAA ATGGGTAAAA AGGCGTTGCT GAGAAAATGC GAAAACAATG TAAGATTACG ATCCTACTCC AGCGGACGAC AACTGATATC TGGCAGGGAT TCCCCACGGC TGTTGTTACC ATCAACGTCC AAGGAGCCCG AATAATCGTC GGTGACATGC AAGAATCAAC TTTCTACTGT GTTTATCGCT CCATTCCCAC CCGACAGCTC CTCATTTTCG CCGACGATTC CCAACCTCGC TGGATCACTT GTGTCACGAG CGTTGATTAT GAGACCGTTG CATGTGGGGA CAAATTCGGA AATATCTTCA TCAATAGACT GGACCCTAGT ATATCAGAGA AGGTGGATGA CGACCCTACG GGTGCTACAA TCTTGCACGA GAAGAGCTTC TTGATGGGTG CGGCACATAA GACAGAGATG ATAGGGCATT ATAATATTGG AAGTGTCGTC ACTTCGTGAG TATGACTTTA CACTATGTAA TTCATGAGCT AATTTATGTT GTAGTATAAC AAAAATCCCA CTGGTAGCTG GTGGACGAGA TGTGTTGGTT TATACCACCA TCTCAGGCGC TGTGGGTGCC CTTGTTCCCT TTGTGTCTTC GGATGATATC GAATTCATGT CCACTCTGGA AATGGTAAGC TAACCTGTCA AAGTGTTGTT TGCATCATTG CTGACTGATG TACGATAGCA CATGCGAACA CAAGACATTT CTCTTGTAGG CCGAGACCAC ATTGCTTACA GGGGTTACTA CGTTCCCATC AAAGGTGTTG TTGATGGGGA CCTGTGTGAG AGCTTTAGTC TCTTGCCGTA TCCTAAGCAA CAAGCGATCG CTTTAGATTT GGATAGGAGC GTGGGTGATG TTTTGAAGAA GCTTGAGCAA ATGAGGACGA GCAGCGCATT CTAAAGGGAT ATATGGAAAG TATTTGCTGT AGGTGTTATG GTAGAATCAA
|
Protein sequence | MHLLNLTLSS PTNVSTAVVG SFSGSKSQEI LCVRGGTKLE IFKLNATTGQ LDTIVSTEAF GTIRNIAGFR LAGMTKDYIL ATSDSGRLSI LEFVISPTPH FESLYQEVFG KSGSRRIVPG QFLAVDPKGR SCLVGSLEKT KLVYVLNRNT EGKLYPSSPL EAHKNHTLVT HIVGVDQGYD NPLYAALETD YSESDQDSTG EAYENTQKHL TFYELDLGLN HVVRKWSEPT DRRANLLVQV PGGQNANSDR FEGPSGVLVC TEDHIIWKHM DVEAHRIPIP RRRNPLVQRG DKSRGLIIVS AVMHKIKGAF FFLLQSEDGD LYKVWIEHNG EDVVALKIKY FDTVPVANSL CILKRGYIYV ASEFSDQNLY QFQSLAEDDG EQEWSSTDYP ENGNIDGPLP FAFFDPQPLR NLLLVDTVPS LDPITDAHVV NLLGASSDTP QIYAACGRGA RSTFRTLKHG LDVAEMVSSP LPGVPTNVWT LKLTEDDEYD SYIVLSFPNG TLVLSIGETI EEVNDTGFLS SGPTLAVQQL GNAGLLQVHP YGLRHIRAAD RVDEWPAPPG QTIVAATTNR RQVVIALSTA ELVYFELDPE GSLSEYQEKK ALPGNATCVT IAEVPEGRRR TSFLAVGCDN QTVSIISLEP DSTLDTLSLQ ALTAPPTSIC LAEIFDTSID KNRATMFLNI GLMNGVLLRT VVDPVDGSLS DTRLRFLGAK PPKLVRANVQ GQPSVMAFSS RTWLLYTYQD MLQTQPLIYD TLEYAWSLSA AMCPDGLIGI SGNTLRIFNI PKLGEKLKQD STALTYTPRK FISHPFNSVF YMIEADHRTY SKSAIERIVK QKESEGRRVD TLLLDLPANE FGRPRAPAGH WASCVRVLDP LANETIMTLD LDEDEAAFSI AIAYFERGGG EPFLVVGTGV KTTLQPKGCK EGYLRVYAIK EQGRILEFLH KTKTDDIPLC LAGFQGFLLA GIGKSLRLYE MGKKALLRKC ENNGFPTAVV TINVQGARII VGDMQESTFY CVYRSIPTRQ LLIFADDSQP RWITCVTSVD YETVACGDKF GNIFINRLDP SISEKVDDDP TGATILHEKS FLMGAAHKTE MIGHYNIGSV VTSITKIPLV AGGRDVLVYT TISGAVGALV PFVSSDDIEF MSTLEMHMRT QDISLVGRDH IAYRGYYVPI KGVVDGDLCE SFSLLPYPKQ QAIALDLDRS VGDVLKKLEQ MRTSSAF
|
| |