Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA00710 |
Symbol | |
ID | 3253470 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 206616 |
End bp | 209983 |
Gene Length | 3368 bp |
Protein Length | 946 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252404 |
Product | pre-mRNA splicing factor prp1, putative |
Protein accession | XP_566496 |
Protein GI | 58258167 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.547118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTGCAGCTG GATTCTCTTC TTATTTTCTA ACGCCTAAAA CATCGCGAGT ATGTCCAACG TCGGCACAGT GAAGCATATT CCCAAGGAAG TGCGATACAA CTTCCTCAAC GTACGTATCT CCTCAGTTTG TTGAAGCGGG ATCGGGAGCT AACAGTACCC ATAGATGGCC GCTCCGGCGA GCTATGTTGC TGGTCTTGGT CGAGGGTAAG TGGCATCTGC GCCCAATTCC TGTGTGACTT TAAACTGACA GACGTATAGT GCTTCAGGTT TCACAACTCG GTCCGATATT GGTCCCGCCC GAGCTGGTCC CAGTGCAGAG GTTGTTGCGG AAGCTCAAGC TCGCCGTGGT GAAGAAGAAA TCCCTGATCC AGATGCTTTC CAAGATCCGG ACGACGAAAG GAATCTGTTT GCTGGTACAG TCTACGAAGC GGACGATGAA GAGGCCGACA GGGTGTGGGA CAGTGTTGAT GCCAGAATGG ATGCGAGAAG AAAGGCCAGA CGGTGGGTCT TCTCTATGGA TATCTTGTCT CATGTTCGAT ACTGATTAGA TATGCAGAGA CGCGGCAGAA GCGAAAGCGG CGGCTGAAGA ACGTGCTCGC AATCCCAAAC TTCAAACGCA ATTTGCAGAC TTGAAACGAT CTCTATCGAG CCTCAACGAT GCTGATTGGG ACGCCATCCC TGAAGCAGGA AACTTGACTG GAAAAAGGAG AAAGGCCAAT TTGCGATTGG AGGAAAATCA GAATGGAAGA AGCTACAATG TCAGCGACAC TGTCATTGCA GATGCCGTGA AGAGAAATGC CATGGTAGGA GAGTTGGATC CTGCAGAGGT GGGTAACCTA GTAGTAACTC GTCATCGAAA CTAATCACTT CCACAGGCTG GTATTGGTAT CGATGGTACC GAAACGGATC TTGTATCTAT CGGTAATGCC AGGGACCGAG TATTGTCGCT GCAGCTTGAC CAAGTGAGTT TAGATTTAAT CGTGGGTTAT GGTATTAGTC ATTGACAGCT TTGGTTAGGC CACAAGAGAC GCCTCAAACG GCTCTTCTAC CAGCATCGAC CCTAAAGGCT ATATGACCGC TCTTAACAGT CAGATTGTTC AAACAGACGC TCAAATTGGT GATATTAAGC AAGCTCGCCA GCTCTTGCAA AACCTCATTC AGTCTAATCC CAAACACGCC CCAGGATGGA TCGCCGCCGC TTCCTTGGAA GTACACGCGA AGAAGATGGT CGCTGCCAGG AAGATTATCG CTGAAGGATG TGAGAAGTGT CCGAAAAACG AGGATGTTTG GTTCCATGCC GCTGAACTCA ACACACCGGA GAACGCGAAA GTTATCTTGG GTCGAGCTAT ACAGCACGTT CCTCAATCTG TTAAAATTTG GCTCAAGGCT GCTTCTCTAG AAACAGACAT AAACGCCAAG AAGCGCGTTC TCCGAAAAGC CCTTGAATTC GTTCCCAACT CTGTGGGGTT GTGGAAGGAG ACTGTCAACC TGGAAGATGA TCCTGAAGAC GCCCGCGTTC TCCTTACCCG TGCTGTTGAA GTCATCCCCA ACTCTGTGGA GCTCTGGCTT ACTCTGGCCC GTCTTGAAAC TCCTGAAAAC GCCAAGCAGG TTCTCAATTC TGCGCGCAAG CGTATCCCTA CCTCTCACGA AATCTGGATC GCTGCCGGTA GGCTTGCTGA GCAGTCACCT TCCGCCGTGG CTGTCAAGCC AGAGGTCAAG ATGGAGGACG AAGCGGAATA CGAGGCTGAG CAAAGAAAGA AGCTTGCTCA GCAGGTCAAC AAACTCATGG CTGGTGCAGT CAATTCATTG CGCAAGAATC AGGTCATTCT TTCGCGAGAA CAATGGTTGC AAGAGGCCGA GAAATGTGAA CAGGACGGCT CACCTCTTAC AGCGCAAGCT ATCGTGAAGG CTACCATCGC TCAGGACGTC GAGGAAGAAG ATAGGAGATC TGTCTGGATT GAAGATGCGG AGAGGGCGAC AAAGGGTGGA TTTTACGAGG TCGCGAGAGC TTGTTACGCC GTCACTCTCG AGGCTTTTCC TAATACTCCA TCAGTCTGGA GAAAAGCCGC CGAGTTCGAA AAGGCCCATG GCACACCGTG AGTTACTTTA TGTAATTGCA AGGCCAGATG ACTGACAAAA GATTGTAGCG ATGCTGTCCA AGAAATTCTC GCCCAAGGAT CCCAACACTG TCCTCATGCG GAGGTTCTCT GGCTTATGGC TGCGAAAGAG AAGTGGGTCG GCGGCGATAT CCCCGGTGCT CAAGCCATTC TTGCCGAAGC TTTCAAACAA AACGAAGATT CCGAATCTAT CTTCCTTGCT GCCGCCAAGC TAGCAGCTGA GACCGGCGAG ATGGAGGCTG CTATCCAGAT CCTTGAGAAG GCCAAGGCAC AGGCAGACAC AGAGAGAGTC TGGATGAAGT CAGCGGTACT GTTGAGGCAA TTGGGCAAGT TGGACGAGGC TCTTTCAACC TTGGAAGTTG CAATCAAGAA ATTCGCTTCC TTTGACAAAT TGCACATGAT CCGAGGGCAG ATCTACGAGT CCCGTAATGA GGTTGCGCTT GCGCGAAATG CATATGCTCA AGGATGCCGA TCATGTCCGA AGAGTATCCC ATTATGGATC TTGTCGGCTC GTCTGGAGGA GAAGGCGGGT GTGACGATCA AGGCAAGGGC ATTGCTCGAA AAGGCGAGGT TGCATAATCC CAAGAATGAT GAATTATGGG CGGAAAGTAT CAAGATTGAA GAACGAACGG GCAGCCCACA GCAAGCGAAA TCTGTCCTTG CTCGAGGTAA GTATCACTTT ATTTCTAGCC CATCCTGCGA TTTACTAACA GCTTCATTCT ATAGCAATGC AAGAATGCCC CGCCTCTCCT CTTCTTTGGT CCATGGCCAT CTTCATGGAG ACTCCTCAAC AAAGAAAAGG TCGTTCCGTT GACGCAATCA AAAAGGCCGG CGAACATCCG GCCGTCATCT TGGCGGTCGC GAGAAACTTC TGGAGTGAAA GGAAGATTGA AAAGACGAGA CAGTGGATGG CCAATGCTAT TACCGCCGAT GAAGATTGGG GAGATGCCTG GGGTTATTGG CTGAAGTTCG AGAGGCAACA TGGAGAGAAA GTGAGCTGAT TCTTGTTTTC AGTTTCTAAA AGGATATCTG GACTGATACA TGAATATATA GGAACGTCAA GAAGCGGTCG TTGAAAAATG CATCGCGGCA TCACCACGCC ATGGTCCGGT ATGGCAGTCG GTATCAAAGG ATTTGGCCAA TGTTGGCAAG TCTACAAAAG AGATACTGGA GTTGGTCGCG GACAAACTGG AATAATGTTT TAGTATAGTG CCAGTTCTTT TGGTATGTCA TGATCACAAT ATGTCCCG
|
Protein sequence | MSNVGTVKHI PKEVRYNFLN MAAPASYVAG LGRGASGFTT RSDIGPARAG PSAEVVAEAQ ARRGEEEIPD PDAFQDPDDE RNLFAGTVYE ADDEEADRVW DSVDARMDAR RKARRDAAEA KAAAEERARN PKLQTQFADL KRSLSSLNDA DWDAIPEAGN LTGKRRKANL RLEENQNGRS YNVSDTVIAD AVKRNAMVGE LDPAEVGNLA GIGIDGTETD LVSIGNARDR VLSLQLDQAT RDASNGSSTS IDPKGYMTAL NSQIVQTDAQ IGDIKQARQL LQNLIQSNPK HAPGWIAAAS LEVHAKKMVA ARKIIAEGCE KCPKNEDVWF HAAELNTPEN AKVILGRAIQ HVPQSVKIWL KAASLETDIN AKKRVLRKAL EFVPNSVGLW KETVNLEDDP EDARVLLTRA VEVIPNSVEL WLTLARLETP ENAKQVLNSA RKRIPTSHEI WIAAGRLAEQ SPSAVAVKPE VKMEDEAEYE AEQRKKLAQQ VNKLMAGAVN SLRKNQVILS REQWLQEAEK CEQDGSPLTA QAIVKATIAQ DVEEEDRRSV WIEDAERATK GGFYEVARAC YAVTLEAFPN TPSVWRKAAE FEKAHGTPDA VQEILAQGSQ HCPHAEVLWL MAAKEKWVGG DIPGAQAILA EAFKQNEDSE SIFLAAAKLA AETGEMEAAI QILEKAKAQA DTERVWMKSA VLLRQLGKLD EALSTLEVAI KKFASFDKLH MIRGQIYESR NEVALARNAY AQGCRSCPKS IPLWILSARL EEKAGVTIKA RALLEKARLH NPKNDELWAE SIKIEERTGS PQQAKSVLAR AMQECPASPL LWSMAIFMET PQQRKGRSVD AIKKAGEHPA VILAVARNFW SERKIEKTRQ WMANAITADE DWGDAWGYWL KFERQHGEKE RQEAVVEKCI AASPRHGPVW QSVSKDLANV GKSTKEILEL VADKLE
|
| |