Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA06230 |
Symbol | |
ID | 3254007 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1695605 |
End bp | 1698589 |
Gene Length | 2985 bp |
Protein Length | 900 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252944 |
Product | hypothetical protein |
Protein accession | XP_566972 |
Protein GI | 58259119 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTGCAAA CATCAACTTT CAAAGGTCAT GTTGAGACCC AGCATAGTCA GAAAGGTAGC CAGGATCAGG CATGCCAGAC TTGCTTGCCA AAATTTACAC ACGTAAGTTG GCCAACGCTC ATTACTCCCG CCAGCTGATG TTTGAGCAGA CAAGATACAT ATCCTCCTTC CTTCCATTCC TTGTCGGAGA AGAGTACCAC GCAAACCTCA TCTGAAAGAA ATTTCCCTCT CTATAAACCC CAGAAGCATG CCCAAGAAAT ACCTCTAACT GCAGACACTC TTGCAGCTCT TCCTTCCGAA CAAGCTCTCT CCCGACAACT CAAACGGCGT GTCACCCAAA AAGCAGTCGA AAATCTTCCG GAATTCACCG AGGATGAGTT GAGGGATTTC TACGCAGCCT TGGTTAAATC CGGATCCAAG GACAATGGCG GGATATATCA AGCGCTGGAA GCGCCTTCAG AGACAAAGAA AATCCCTATG CCGAAAAAAG AGAGGGAAGA AATCTTAACG GGCATGGCGG GAAGGCTGGT TGGGGAGGGG GGGCTTGCCC TGCCAAACGG TTCTACGCGA TATGGTAAAT CTTTGGAGCT TGTTGTAAAT GAAAACCAGT CTTCCGATGC TCCTTACGCC ATTCTTGATG TCTTGTCCCA AGCAGCCATA TTGAGTGGTC CGAGTGAGAC AAGCAAAGGG AAGGCAATGG AAGGTCAACT GCCCCTTAGA AGCCTTGAAG TACCTTTGGG ATTGGTCACG TCCAAAGAAT GGGATGCTCT TTTCCAGGAA TTTGTAAGCC TTTTCTGGAT CAAATATGAT ATACCTGCTC ACTATTCGTA GATACGGAAA GGAGATGCTG CAGCTGCCGA GTCTCTTTTG GACATTATGG ATGTAAGCGA CATTGAGATA TGTGAGAGTA ACCCACACTG ACAGTCATTC GGTAGGCCCA TGGTGTCCCA GTCGGTATCG AACGTATAAA CAGCGTCATA CATGTCTATG CTCTTCAAGG TCGTTCTGAA GATGTGGCTC GTTTAACCTT GGATGTCACT GCAAGTAAGT TATATCTTTT CCGTCTCGAT CATCACCATA ATCTTATAAT GCGACATAGT GGGCGAAACT GTCTCGTCAG ATCAGCAAGA TGATTATATT CTGTCTATCC TCCGGCAAAC ACCATCTAAT CCCGGTCCGG CGATTGCCCG CCTCCGTCAA GCTGAAATAG CTGGCACTCC TTTCCCCCAA TCGTCCTATC AAGTCGTCAT CTCACATCTC ACTTCACCGT CTTCCATCGT CCAACCAAAC TCTCATACTC GGGCAGTTGC TTGGGACTTG TTTGCCCATA TGCGTCTAAC TGCGCATCCT ACTCCGACGC GGGAGCTGTA TACGACAATG ATCAAGTCAT GTGGTGAAGC CAGTCAGCCT GAACCCGAGC GAGCGAGGGA TCTGTGGATT GAAATGACAG AGCAGGAGAA AATTGAGCCT AGCACGGAAG CTTATAACGC GATCATCAAA TCTCTAGGAA GTACGAAGAA GGACTACCTC GAAGCATTTG ACTTGCTTCG GCAGATGCTG GTCAAGCACA ATGAGGCAGT ATTCGTCCCT TTTGAAGAAG AAGACGGTAT CCCAAGGTTC AGCGAGTATG TCCCAACGCT AGAAACGTTT GCGTCGGTAC TAGAAGGGAC GAAGCGGGCA GGAGACGTCA ATCGAGCCAG ATGGGTCTTA TCAGAGGTCA TTAAACTATC AAGAGCCGGC CAGGCCCTGA ACGTGCCAAA GTGGAAGGAA GGGCCTAGTG GTGATTTGAT GGCGGCTGTG TTCATGACTT ATGCAGCTTG GAATCCAGTG CTGCGCAAGA GAGAGGTCAA AATGAAGCAA GCTTCGGAGA AGCCATATGG AGCAGCGTCC GAAACGGCGG TCGTCGAGAA GGAAGAGCAT AGGCTTTCGA AGGGCGAAGA GTTCCTCAAC GTGGATGTTC TGCAAGACGT GGTTGAGCCT TCTGCCGGCT CTTCGTCGAT TGAGTATACT CTTGAAGAAC CGCCTGTCAA TGATTTTTTC ACCCCGCTGT CAAGCGCTGA CGCGATCCGC GAGTGCACGA GTCTTTTCCA CCGCATCCTC TCCGATATCC AGTCTTCTTC CTCCACCTCT GATTTCCTAC CTTTTCGCCA CGTATATATT ACGCCAAAGC TCATCAATTC GTATATCTCT GTGTACAATG CTCATGGCCC GTCCCTGTCA TCTGTGAAAG ATGTGTACGA TACCGTGTGG GAAGAAGCGT CGAGCGTCAC TGGTCGTTCC GTCAGACCCA ACGCCTGGAC TTTTATCCCT CTGCTCGAGA AATGTGCTTC AGGATCTCGA GGGGGTATAT ACCCGGCCGA CCGATCCCTA GCTCTCTCCT GGGGACGTGC CCTGTGGTCG TCATACCTCG CATTCTTCAA ATCCGCTTCT TCTGAACTCG CTTCGATCCC ATCCTCTGAA CACCTCACTA TCCAGCGTCG CAAGTACCTC TTCGGTTTAG GCGACAGGCA GACTGAAAAA ATGTGGAAGG CTGTTATCAA ACTCGAGGCT TTGCACGGCG ACGTTGATGA AGCAATGAGG TTGCTTGAGG AATTCCATGC GACGTATAAG CCCAGTGCTA TCACACAATT GTACGCACCG TTGCCGGAAG TGGGCCTGCA GCTGAAAATG TTCACACCGG CCATGACGCC CGAGGCAGAT GTACCTCCTC TCTTGTTGTT TGATGATATC AAGCCTCTGC ATCAGCGGTT AGTGAGGGAA GCAAGGGCCA AGGATATTAA AAAGGTCAAG TGGATAGCGG CGGGTTACGA AAAGTCGTTG AAGACGAGGA AGGGATGGAG GATGAAGGGT GTGGGACAAC AGAGAGAAAA GTCGAGGGCA AAAGATTTAG AACGGCTTCT CGCCCAGCAA GGCGAGATGG AGAGGCTCGA GTGAGGGAAA AGACAAGCTG AATACCCTTA TTTGTATATA CCCTGCACAA GAATA
|
Protein sequence | MLRPSIVRKV ARIRHARLAC QNLHTQDTYP PSFHSLSEKS TTQTSSERNF PLYKPQKHAQ EIPLTADTLA ALPSEQALSR QLKRRVTQKA VENLPEFTED ELRDFYAALV KSGSKDNGGI YQALEAPSET KKIPMPKKER EEILTGMAGR LVGEGGLALP NGSTRYGKSL ELVVNENQSS DAPYAILDVL SQAAILSGPS ETSKGKAMEG QLPLRSLEVP LGLVTSKEWD ALFQEFIRKG DAAAAESLLD IMDAHGVPVG IERINSVIHV YALQGRSEDV ARLTLDVTAM GETVSSDQQD DYILSILRQT PSNPGPAIAR LRQAEIAGTP FPQSSYQVVI SHLTSPSSIV QPNSHTRAVA WDLFAHMRLT AHPTPTRELY TTMIKSCGEA SQPEPERARD LWIEMTEQEK IEPSTEAYNA IIKSLGSTKK DYLEAFDLLR QMLVKHNEAV FVPFEEEDGI PRFSEYVPTL ETFASVLEGT KRAGDVNRAR WVLSEVIKLS RAGQALNVPK WKEGPSGDLM AAVFMTYAAW NPVLRKREVK MKQASEKPYG AASETAVVEK EEHRLSKGEE FLNVDVLQDV VEPSAGSSSI EYTLEEPPVN DFFTPLSSAD AIRECTSLFH RILSDIQSSS STSDFLPFRH VYITPKLINS YISVYNAHGP SLSSVKDVYD TVWEEASSVT GRSVRPNAWT FIPLLEKCAS GSRGGIYPAD RSLALSWGRA LWSSYLAFFK SASSELASIP SSEHLTIQRR KYLFGLGDRQ TEKMWKAVIK LEALHGDVDE AMRLLEEFHA TYKPSAITQL YAPLPEVGLQ LKMFTPAMTP EADVPPLLLF DDIKPLHQRL VREARAKDIK KVKWIAAGYE KSLKTRKGWR MKGVGQQREK SRAKDLERLL AQQGEMERLE
|
| |