Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA00520 |
Symbol | |
ID | 3253520 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 146891 |
End bp | 149713 |
Gene Length | 2823 bp |
Protein Length | 847 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252385 |
Product | peptide-binding protein, putative |
Protein accession | XP_566479 |
Protein GI | 58258133 |
COG category | [A] RNA processing and modification |
COG ID | [COG5104] Splicing factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCGTTCCAC AACTGAAAAC AAGAATGTCC GGTGTTCCTC CTGGACCTCC TCCGGGAAGA CCTCCAGTCG CCAATGTGCC ATACATCCCT CAACCAGGTG CACCTGCTGC CCCGCCGTTC CGTCCACCTG CTGCTCCGGG ATTCGGCTTT CCTCCAGGTC CACCTCCGGG GATGCCCTTC AGTGTCGGAG GGTTCCCACA TCCCCTTCCT CCTGGATGGT CTGAACATAG AGGTAGGTCC TATTGATCTT TATTCCGAGT AAGCGTGGAG GGTGAAGGAA AGTGCTGACG CATGTAAGCG TAGCCCCGGA TGGCATAACG CCTTATTACT ATAACGCTCA AACTCGCGAG TCGACATATA TCCGCCCCAC TTTCCCTCCC TTCCCTCCCG GAACCACTCC ACCCACTGGG TCTCCAGCTC CTGGCGCAGT TACCCCAGGC GGAACTGGAG CCGCTGAGGA AAAGACGAAG AAGAAGAAGA AGGAAAAGCC CAAGGACAAG GTGCCGATCC CCGGTACTGG CTGGATGAGA ATCACGACAA CTGAAGGGAA TGTGTTTTAC TTTGAAAAGG AGAACAAAAG ATCGGAATGG ACTGTTCCAG ATGAGATCAA GGACGCTGTA GCTCAACTGG AGGAGGAGGA GAGGAAGAAA AAGGAGGAAA AAGATAAAGA GGAGCAAGAG AAGATTGAGC AGGAGAGGAT TGAAAAGCTG AAAGAACTGG AGAGGATCAG GGCGGAGATA GATGCTGAGA GGAAGAAGAA GGAAGAGGCA GAAAAGGAAA GAAAGAGGAA ACAGAGGGAG GATGGCGAAG ATGATGCACC GGAGGAGAAG AAGGCCAAGG TGGGAGGCCA AGAGCAAGAG GAGGAGGACG TGGTTGGCCC AGAAGGCGAA GAGGACGAAG AGGCCTGGAT GAAGGCTGTT GCCGCCGAGT TCGCCGAGAA GGATGCTCAA ACCAAAAAGG ATTTGGAAGA GCAAGATGAA AAGACTAAAA AGGAAGAGGC CGAAGCCGCA AAAAAGGTCT TTGCCGTCCC CGAAAAGGTC AACGTTTCTG TCGAAGAAGG ACGTGCATTG TTCAAGGCTT TGCTCATTGA AAAGGACATC TCGCCCTTTG CCCCCTGGGA TCAATCCCTG CCCTTGTTCA TCAACGATCC GCGCTACGTG CTTCTCTCAT CTATGAAAGA CCGACGTGAG GTGTATGAAG AGTACTGCCG TGAAGTTGGT CGAGCCAAAC GCCTCAAGAA GGGTTCTGCG GCGGAAGAGA AGAAGGCAGA GCCAGAGAAG GAATACAAAG CCTTGCTGGA TAAGGAAGTG ACAAGTACTA GGACGAGATG GGATGACTTT AGGAAAAAAT GGAAGAAGGA TAGGAGGTTT TATGCTTTTG GCAGAGATGA TCACCAGAGA GAAAAGGCGT TTAAGCAGCA TTTGAGAGAC CTTGGCGAGC GTGAGTACGG TAATTGTTTG ATCGTTGATA GAAATAACTT TTAGGTCTGC AGGCAAACGA GCTGCAGCTC AGAAAGCCGA AGAAGACTTC AACACTCTTC TCAAAGAATC AACGAACATC ACCTCATCCT CTCAATGGTC TTCTGTCAAA CGCTCAATCT CTTCCGATCC GCGATATGAT GCTGTTGGTT CTTCTTCGCT GCGTGAAGAT CTATTCAACA ATTATATCCG TGCACTCTCT TCAACTTCGA ATCCTGCAAA ATCAAATCCC GAGGAGCAGA GCATAAAAGA AAAGGAAGAG GCGGCCGCTC GCCGGCTTGC AGAGCGCAAA GCTGGTCAAT CGGCTCAGCC TTCTGAATCG AAAGAAGATG CGGCTGCACG TAAACTTGCA GAGCGCAAAG CCAAGTCAGA AGCGAGTCTG CGCGAGCGGG AGGCTAAAGT CAGGGAAGAG AAGGAAAGGG TGGAACGTGA GATGCACAAG AGTAAAATGG GAGCTGGGAG GGAAGAGGCT GAGGCATTGT TTAGGAGTTT ATTAGTGGAT TCTGTGAAGG AACCCAACGT AAGGTTTTTT TTGTCTCCTT TCGCCATTTG TAGATACTGA TAGCGTATAG GTTACGTTGG ACGAGGCGCA ATCTTACCTT TCTTCTGACC CTCGATGGAA CCACCCTTCG TTGAGTGCCC GCGATAAACA ACGCCTCTTC GCTGCCCATT CCGAACGGCT CTTTTCTAAA CGTTCCAATG CACTCCACTC GCTCTTCGAA TCCAAGACGC CCGCTCTCAA CACGTCTTAC GACGACGTTT ACCCACAAAT CATCGATGAT CCGCTTGTCA AGCGTCTCGG CTTGCAAGGG GAAGCGCTCG AAGACCGCTG GAGGTCTTGG CGACGAAAGA AGGAGCATGA TGCGAGGATC GAGTTTGATC AGATGTTGCG TGAGAACAGC TTTGTAGAGT TTTGGTCCAA GATGAGGAAG AAGACGATGG ATGAGAAGGC TTTGGAAGTG CAGGAGCAGG ATGAGTATGA TGAAGGCGAA GGAATGGGAG AAGGTGGAGC GGCGGACTTG ACGCAGTTGG CAAAGCAAAT TGATTTGGGC GAAATCAAGG CTGTATTGAG GGTGCGTTCC TTTGTTTTGG TTCATCTTTG CGTAGTCAAA AACGCTAATG GTTGTGGCTC TCAGAGAGAT AAGAGGTATA CTGTTTTCGA TCACATGCCT GACGAGCGCG AAAAGTGGTT GCGAGTGAGT CCTATTCTGC CCCTTGAAAA GAGTAAAGAG CTGAGTCTTT GGCTTTAGGA CTATCTGGAA AACGTTGAGG CAGCCTCGGG ATCAAAAACT ATCCACAACG TCGGTCTTGA CAGATAAAAA ATTAGAAGAT TGACTTTGCA GTA
|
Protein sequence | MSGVPPGPPP GRPPVANVPY IPQPGAPAAP PFRPPAAPGF GFPPGPPPGM PFSVGGFPHP LPPGWSEHRA PDGITPYYYN AQTRESTYIR PTFPPFPPGT TPPTGSPAPG AVTPGGTGAA EEKTKKKKKE KPKDKVPIPG TGWMRITTTE GNVFYFEKEN KRSEWTVPDE IKDAVAQLEE EERKKKEEKD KEEQEKIEQE RIEKLKELER IRAEIDAERK KKEEAEKERK RKQREDGEDD APEEKKAKVG GQEQEEEDVV GPEGEEDEEA WMKAVAAEFA EKDAQTKKDL EEQDEKTKKE EAEAAKKVFA VPEKVNVSVE EGRALFKALL IEKDISPFAP WDQSLPLFIN DPRYVLLSSM KDRREVYEEY CREVGRAKRL KKGSAAEEKK AEPEKEYKAL LDKEVTSTRT RWDDFRKKWK KDRRFYAFGR DDHQREKAFK QHLRDLGERK RAAAQKAEED FNTLLKESTN ITSSSQWSSV KRSISSDPRY DAVGSSSLRE DLFNNYIRAL SSTSNPAKSN PEEQSIKEKE EAAARRLAER KAGQSAQPSE SKEDAAARKL AERKAKSEAS LREREAKVRE EKERVEREMH KSKMGAGREE AEALFRSLLV DSVKEPNVTL DEAQSYLSSD PRWNHPSLSA RDKQRLFAAH SERLFSKRSN ALHSLFESKT PALNTSYDDV YPQIIDDPLV KRLGLQGEAL EDRWRSWRRK KEHDARIEFD QMLRENSFVE FWSKMRKKTM DEKALEVQEQ DEYDEGEGMG EGGAADLTQL AKQIDLGEIK AVLRVRSFVL VHLCVVKNAN GCGSQRDKRY TVFDHMPDER EKWLRDYLEN VEAASGSKTI HNVGLDR
|
| |