Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA01900 |
Symbol | |
ID | 3253759 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 510433 |
End bp | 512806 |
Gene Length | 2374 bp |
Protein Length | 724 aa |
Translation table | |
GC content | 47% |
IMG OID | 638252523 |
Product | conserved expressed protein |
Protein accession | XP_566565 |
Protein GI | 58258305 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.550521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACTC TGGATAGGGC GCAGGAAAAA TTGAAGAACG GATGGGGTAA AGAAGAAAGG GAAGCTCCTC TACCAATGAG ACCGTGGCCA AACCACATAG GAGGCTACTG TGCTCCAGCT TCTGACCCTA GATGGAAGGG AAAAAGGCGC AAACCAACAC CCGCAGTGAA GGATGATGGA TTTTGGGAAG GTGTTCTCGG TCGAACGTCG ACTCCTGTCT ATCAAGGGAT AACATCAAAA GCATATGAAA TTCCCATTTT TCCTCATATC GCTAAACACC CATCTTTTTT ATCCATTCCT TCCAAGTGCT CTTCATCATC TTCTAATCGC GCTCTCTGGT CTGATCGCTA TCGTCCTCTT CGAGCATCCG AAGTGATAGG TAACGAAGTT GAAGCGACCT ACCTCCGAGA TTGGTTATCT ACTCTCGCCG TAGGTGGGCA ACATGCCAAA GGTTCAAAGA TTGTCAGACA GGTTGTAAAG AAACCGAGGT CGGCCTTGGT TGATGGCTTT ATTGTGGATG ATCTGGGACT CTATGGGGAC ACACCTAATT CTGAAGAGGA CGGTGAAGAT GAATTTCCGC ACCTTGAAGA TCTCCCGGAT CCTCCCATCT CTCATGACCT CAACGCCCGT CCCGATAAAT ACCCTTCTTT GGCCTCTCAC CTTGCCAACA CCATTCTCCT TACTGGTCCA ACGGGTTCGG GCAAGACAGC TGCAGTGTAT GCTGCGGCTC ATGAGCTAGG TTGGGAGGTA TTTGAAGTTT ATGCGGGAAT GGGCAGACGG ACCGCTGCGA ATTTGATGAA GTGGGTAGGA GAATTAGGCA AGAATCATAC TGTCCTCCCG CAGGATGGCA AGTCGCAAGG CACGATAAAC GACAATGAGA AGAAGGGGAA GAGCAGGGGG AGAGGGAAAG GCCTCTCGTC ATTTTTTGAT AAGGGATCAT TCCAGTCTAG CAAGGTTTCC TTAAGCCGGG GGATTGCCAG TGATCCGATA GACATTGAGT CTAACGGCGA GAGCGACAAG ATACCAGTGA CTGAAGCTGC TAATGTTTCT GGAGGAGAAC CAGGGATCAA ATTCAAAGAA TCATTGCTCT TAATTGATGA AGCGGACATC TTATTTGAAG AGGAAGGGTC GTTCTGGCCA GCAGTGATCG CTTTGGCATC CGAGTCAAGG AGACCGATAG TATTGACTTG TAATGGTGCG TACTTCATCT AGCCCGATTT CCTCGCCTCT CAATTAATCG CTGACTTTAT TAAACGGCAG ACCATCAGCG AATACCAAGG ATTCAACTGC CACTCCAGGC AATTCTGCAA TTCCATCCCA TTCCGTCATT CATTGCCCTC CCGTATCTCC AGGCTATCTC TTCACAAGAA TCGCAATTGC GCGGAAAACC TTGTAACCCT TGTGTAGAGA CTATTTTTCG AGGAGCTATC CACCAAACGC CCGAAAAGGA TGTGCTTAGC GACCAATGCT TGCCGCCTAA TGGACACGAG CGGATACCAT TCTTTGACCT TCGACAAGCG ATGATGCAAC TGCAATTTGG GTTGACAGAT CAAATACTCC AGAGAGGCTG CGCAAAGAAA TATGGAACCC CAGATGAGGA TGAGAAAAAG GACGATTTAC AACTAATGAC GGAGAGGATG GAAGTTATCT CGTTCTCTGA TGCCTTCATC GATATCCGAC CTAGGGTATT AATGGAAGTG AGCAGCTCTT CTATCTCAAA TAAATATATT ACATCCGCTG ACGCTTCACT TTATAAGCTT TACGACGTCG ACAAGCTGCA GCCAACTTCA GATGAGGAAC TGGATGTCGC TGCCCTGTTG AAACCAGAAA TGTACGAGAC GTATCCCATC CTAGCAATGA TCGATAGATC TTCGGACATT GCAGGTTCAC TCGTCCAGGC CGTTGGTGGA CACCTGCCAC CTTTCGGTGA TCTTGGCCTT GCAAGGTTAG TTTCTTCTAA TCGCAAAGAT TATGAATTTA TGTTGACCCC AAATGAAGGG CCAAGTACAT CCGCTCCATG CTCCCCTTAC TTGACCCCCT CATCCCTCTA TCGGAACCTT TATTGCCCCA TTCAACTCTT TTCCTTTATA CCCTCCCTAC AATGCAAAGT ATTATTTCCG CTGACGACAT CTTTGAGGCG CTTGAACAGC AAGCTGTCGA TAGAGGCGAC GAGAAAATTA ATCCGAGAAC GGGGAAGCCT ATGCGCCGGG TAGCAGGATA TACATATACT AGGTATTGGG ATTTGGAGGG AGCGGAAAAT GAGGCAAGGA GGATTAGCAG ATTAATTTGG TAGAAGTAAC AGAGGTAACA CGTCTGTAGC GATAATCAGT CAGATTCGCA GTGCTTATGT GTAGTATCCG ACGACAAATG ATAT
|
Protein sequence | MSTLDRAQEK LKNGWGKEER EAPLPMRPWP NHIGGYCAPA SDPRWKGKRR KPTPAVKDDG FWEGVLGRTS TPVYQGITSK AYEIPIFPHI AKHPSFLSIP SKCSSSSSNR ALWSDRYRPL RASEVIGNEV EATYLRDWLS TLAVGGQHAK GSKIVRQVVK KPRSALVDGF IVDDLGLYGD TPNSEEDGED EFPHLEDLPD PPISHDLNAR PDKYPSLASH LANTILLTGP TGSGKTAAVY AAAHELGWEV FEVYAGMGRR TAANLMKWVG ELGKNHTVLP QDGKSQGTIN DNEKKGKSRG RGKGLSSFFD KGSFQSSKVS LSRGIASDPI DIESNGESDK IPVTEAANVS GGEPGIKFKE SLLLIDEADI LFEEEGSFWP AVIALASESR RPIVLTCNDH QRIPRIQLPL QAILQFHPIP SFIALPYLQA ISSQESQLRG KPCNPCVETI FRGAIHQTPE KDVLSDQCLP PNGHERIPFF DLRQAMMQLQ FGLTDQILQR GCAKKYGTPD EDEKKDDLQL MTERMEVISF SDAFIDIRPR VLMEVSSSSI SNKYITSADA SLYKLYDVDK LQPTSDEELD VAALLKPEMY ETYPILAMID RSSDIAGSLV QAVGGHLPPF GDLGLARAKY IRSMLPLLDP LIPLSEPLLP HSTLFLYTLP TMQSIISADD IFEALEQQAV DRGDEKINPR TGKPMRRVAG YTYTRYWDLE GAENEARRIS RLIW
|
| |