Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04980 |
Symbol | |
ID | 3253268 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1316629 |
End bp | 1317981 |
Gene Length | 1353 bp |
Protein Length | 333 aa |
Translation table | |
GC content | 48% |
IMG OID | 638252817 |
Product | peroxisome targeting signal receptor, putative |
Protein accession | XP_566938 |
Protein GI | 58259051 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0414188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATACC TCCGCCTCTC TCTCCCTCCG TTTGCACATA ACAACCTCGC CTTCTCTCCG TTTTACGATC ATCATTTGGC TCTTGCTTCC GGTTCCAACT TTGGGCTAGT CGGCAATGGC AGGGTGCACG TCGTCAAGAT GGACCAGCAG GTCGCGGGTG GATTGGGGCT GGTGAGAAGT TGGGATACAG CAGATTGCGT GTATGACGTG GCTTGGAGCG AGATACACGA GTAAGTTGCT CAGCTTGTGA GAACTGTCTG GGAGAAGCGA GATGTTCAGA CTGGGGGATG TGTTACAGTT ATAGGAGATG TTGAGAGAAG GTGTTGATGA TGGAGACGGC AGCGACCGGA ACGTATAATG ACCATCCATT ACTTATAGGA ATCAGATAGC AGCAGCTTGC GGTAACGGTG CCATCAAACT TTTTGATCTC GCTCTCGAAG TAAGTTTTGT CGCCTCCGAT GGGATTGTCA TTTGGGAGAC ATCACTCTTA TTTTTCGGTG GTAGGGACTA CCTATACAAG CTTGGCAAGA ACATACGGCT GAAGTCACGT CCATCGAATG GAACAACATT GAAAAGGAAT TGTTTGTGAC TGGTTCATGG GACCAATCTG TCAAAATCGT TCGTTTCCAT TATTATTCGT TTCATGCTCG AAAAAGTTTG GGGTTAACAA AGACTTGCTG TGACCATAGT GGAACCCCAA TCGACAGTCA TCAATCCTAA CCATACCTGC CCATGCAGGC CAAATATACT CCTCCACTTG GTCTCCTCAT TCACCAACCA TTATCGCGAC TTGTGCTTCT GATGGGTTTA TCCGGATATG GGATACACGT ATTCTCCCCT CCCCCATCCA AGAAATCTTC CCTCCCTCCG CCGCCCCTAA TCCAATGTCA TCACGTTCTG CTGGAGAAAT ACTTAGCTGT GACTGGAATA AATATACTCC ACAGCTGCTA GCGTTCTCTT CTCAAGATGG AGGGGTCAGT ACGGTGGATT TAAGACACGT ACCTCGCAAC GCAGAGAAGA TGGCGGTAAG GCTAGTGGGA AAACATGGTT TACCGGCGAG GAAAGTGAAA TGGGACCCGC ATAATGGAAC CAGATTACTG AGTGCAGGCT ACGATATGAC TTGTAGAGTG TACGTTCATT TGTCTTATTT CGACACATCC TGTCGACTTC GATGCTGATT TTGATGTAGC TGGCAAACTG ATCTGCCACC AGCCGCACCT TTAAGAGAAC TATTTAGTCA TCAAAACCAT ACGGAGTTTG TAATGGCTGC AGATTGGGCC TTGTTTGACC CCGGATTAAT AGCTAGTGCG GGGTGGGACG GGGATTTGCA TATGTATCGT ATCTAGCTGT TGT
|
Protein sequence | MQYLRLSLPP FAHNNLAFSP FYDHHLALAS GSNFGLVGNG RVHVVKMDQQ VAGGLGLVRS WDTADCVYDV AWSEIHENQI AAACGNGAIK LFDLALEGLP IQAWQEHTAE VTSIEWNNIE KELFVTGSWD QSVKIWNPNR QSSILTIPAH AGQIYSSTWS PHSPTIIATC ASDGFIRIWD TRILPSPIQE IFPPSAAPNP MSSRSAGEIL SCDWNKYTPQ LLAFSSQDGG VSTVDLRHVP RNAEKMAVRL VGKHGLPARK VKWDPHNGTR LLSAGYDMTC RVWQTDLPPA APLRELFSHQ NHTEFVMAAD WALFDPGLIA SAGWDGDLHM YRI
|
| |