Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA06730 |
Symbol | |
ID | 3253209 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1833217 |
End bp | 1835947 |
Gene Length | 2731 bp |
Protein Length | 726 aa |
Translation table | |
GC content | 49% |
IMG OID | 638252995 |
Product | RNA splicing-related protein, putative |
Protein accession | XP_566905 |
Protein GI | 58258985 |
COG category | [R] General function prediction only |
COG ID | [COG5191] Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.525366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTCGTTGTG CCCGACTCTA CATCTACATA TCCGGACAGA AGGTCGCTTC CGGGGAATAT ACACCATGGC AGGAAGAGAT CCGAGAGACC GTGCTCCAAG AGTGCGCAAC AGAGCACCCG CTGCCGTACA GGTCTGTGCA TAATCCCAAG CGGGCTGCAA ACATTGGCTA ATTGTTGCTT GTAGATCACA GCGGAGCAGC TTTTGAGAGA GGCGCAAGAA CGACAAGAAC CTGCTATCCA GGCACCCAAA CAGCGTGTTC AGGATTTGGA GGAGCTTTCA GAGTTTCAGG CGAGGAAAAG AACGGAGTTC GAGTCAAGGA TCAGATACTC AAGGGACAGC ATTCTCGGTT AGTACATCGC TATTGCGTCT GTGGTCGAAA GATGCTGACA TGTGTTGAAT TAGCATGGAC CAAATATGCA CAATGGGAAG CCAGCCAGAA TGAGTATGAG CGATCAAGAT CAGTGTTTGA GCGAGCGTTG GATGTTGATC CCAGATCAGT GGACCTCTGG GCAAGTATCC TTGCTCAGTG CCGACGTCGG ACGTATGCTG ACAATAAGTG CAGATTAAGT ATACCGACAT GGAGCTGAAA GCTCGAAACA TCAACCACGC TAGGAATCTT TTTGACAGAG CTATCACCCT CCTTCCCCGT GTTGACGCAG TGAGTGTTAT TAATTGGCTC TTCCCGCCGT CAATTGACAA TGCGTAGCTT TGGTACAAAT ATGTCTATCT TGAAGAATTG CTTCTCAATG TTTCAGGTGC TCGTCAAATC TTTGAGAGGT GGATGCAATG GGAACCCAAC GACAAAGCTT GGCAGAGTTA CATCAAGCTC GAAGAACGTT ATAATGAGCT GGATCGGGCT TCCGCCATTT ACGAGCGCTG GATTGCTTGC CGCCCGATTC CCAAGAACTG GGTGACATGG GCCAAGTTTG AAGAGGACAG GGGTCAGCCG GACAAGGCTC GGGAGGTTTT CCAGACAGCT TTAGAGTTCT TCGGTGATGA AGAGGAGCAG GTGGAAAAAG CCCAATCGGT ATTTGCTGCG TTCGCAAGGA TGGAGACTAG ATTGAAGGAA TTTGAAAGGG CGAGAGTGAT TTACAAATTT GCCTTAGCAA GATTGCCTAG ATCAAAATCT GCTAGCTGTA AGTCCTCACC GTGACGTTTT CCGCAATGCT GACTAGTTCG TAGTATATGC CCAGTATACC AAATTCGAGA AGCAACGTGA GTTGGTGAAC TTGTAGCAAC AATTATACTA ATCCCGATTA AGATGGTGAC CGTGCTGGTG TTGAGCTCAC TGTTCTTGGC AAACGGCGCA TTCAGTACGA GGAAGAATTG GCCTATGATC CTACGAATTA TGATGCGTGG TTTTCTCTTG CCAGGTTAGA GGAAGATGCC TATCGCGCGG ACAGAGAAGA TGGTGAAGAT GTTGAGCCAA TGCGAGTTCG GGAAGTGTAC GAGAGAGCTG TGGCCAATGT CCCTCCTGCG CTGGAAAAAA GATATTGGAG AAGATACATT TATCGTATGT GAAGTCTCGT TTCATGCTGC TCCGCTAATG TTCGTAGTGT GGTTGCAATA TGCTGCATTT GAGGAGATTG ACACCAAGGA CTACGACAGG GCGCGGGATG TCTATAAAGC AGCTGTTAAA CTTGTGCCAC ATAAAACGTT CACTTTTGCT AAGGTAAGCT GCGTCTGCTC TGGGCAAACG AAAGAATACT TATAACATCT TCAGCTCTGG CTGGCTTACG CTTACTTTGA AATTCGTCGA CTTGACGTCT CTGCGGCCCG TAAAGTTCTT GGTGCTGGTA TCGGCATGTG CCCCAAACCG AAGCTCTTTA CTGGTTATAT CGAACTGGAG ATGCGGCTGC GAGAGTTTGA TAGGGTTCGA ACATTGTACG AGAAGTTCTT GACTGTGAGT TGTCTTCACT GTGTAAGCAT GTGACTGGAA GCTAACATCT ACGTAGTATG ACCCCTCCCT CAGTTCTGCC TGGATCCAAT GGACTCAAGT TGAATCTGCC GTCGAAGATT TCGAACGTGT TCGAGCAATT TTCGAACTCG CCGTGCAACA ATCTCTGGAT ATGCCCGAGA TCGTCTGGAA AGCGTACATT GACTTCGAAG CCGGCGAGGG CGAGCGCGAA CGTGCGCGTA ATTTGTACGA ACGCTTGCTC GAACGTACTT CTCACGTCAA AGTCTGGATT TCATACGCAC TCATGGAGAT CGCGACGCTT GGTGGGGGAG AGGATGAAGA TGGCAATGAG ATTGAAGGCG AGGCTGGGGA CGCTGATTTG GCGAGGCAAG TGTTCGAAAG AGGTTATAAG GACTTGCGAG CGAAGGGTGA AAAGGAAGAT AGAGCCGTGT TACTCGAATC TTGGAAGAGT TTCGAGCAGG AGCATGGCGA CGAGGAGACG TTGGCCAAGG TAGAGGATAT GTTGCCCACA ACTCGAAAGA GGTGGAGGAA GGCAGAGGAC GGGAGTGGAG AGTTGGAAGA ATACTGGGAC TTGGTATTCC CCGACGATGA AAGGGAAGCG AACCCGACTA GTTTCAAATT CTTCCAGGCT GCCCAAGCTT GGGCTCAACA GCGTGCTGGG CAGGGAGAAG AAGGCGGTTT ATCCTACGAT TTGCCGTCAG ATTCAGAGGA CGAAAACGAG GACGGAGACG AGGACGGGGA CGGTAGGGAA GAAGAGGGAA TGGACCAGGA TTAGACCTTG CGTTGTTCTT TATAGTCAGA TATCATAACA T
|
Protein sequence | MAGRDPRDRA PRVRNRAPAA VQITAEQLLR EAQERQEPAI QAPKQRVQDL EELSEFQARK RTEFESRIRY SRDSILAWTK YAQWEASQNE YERSRSVFER ALDVDPRSVD LWIKYTDMEL KARNINHARN LFDRAITLLP RVDALWYKYV YLEELLLNVS GARQIFERWM QWEPNDKAWQ SYIKLEERYN ELDRASAIYE RWIACRPIPK NWVTWAKFEE DRGQPDKARE VFQTALEFFG DEEEQVEKAQ SVFAAFARME TRLKEFERAR VIYKFALARL PRSKSASLYA QYTKFEKQHG DRAGVELTVL GKRRIQYEEE LAYDPTNYDA WFSLARLEED AYRADREDGE DVEPMRVREV YERAVANVPP ALEKRYWRRY IYLWLQYAAF EEIDTKDYDR ARDVYKAAVK LVPHKTFTFA KLWLAYAYFE IRRLDVSAAR KVLGAGIGMC PKPKLFTGYI ELEMRLREFD RVRTLYEKFL TYDPSLSSAW IQWTQVESAV EDFERVRAIF ELAVQQSLDM PEIVWKAYID FEAGEGERER ARNLYERLLE RTSHVKVWIS YALMEIATLG GGEDEDGNEI EGEAGDADLA RQVFERGYKD LRAKGEKEDR AVLLESWKSF EQEHGDEETL AKVEDMLPTT RKRWRKAEDG SGELEEYWDL VFPDDEREAN PTSFKFFQAA QAWAQQRAGQ GEEGGLSYDL PSDSEDENED GDEDGDGREE EGMDQD
|
| |