Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA02020 |
Symbol | |
ID | 3253827 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 537861 |
End bp | 540734 |
Gene Length | 2874 bp |
Protein Length | 702 aa |
Translation table | |
GC content | 48% |
IMG OID | 638252534 |
Product | conserved hypothetical protein |
Protein accession | XP_566582 |
Protein GI | 58258339 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.354821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACCATCCA TAACCTATGT CTACCCAGTC TGGTAGATCA ACAACTTCTA AATCCTCTAG CAGGAAGAAG CCGCTCTCTT GTATGTACTT TACAAGTAGT CCCATCTCCA CCGCCGCACA CCAGTTTACC GGCAACGTAG AATGGATGCA TGCTAAATTC CAACAACATA GGTGCAGAAT GCAGACGCCT GAAGCTCAAG TGCGAGTTGG TGTTCCCATG TAGACACTGC GTCAAAAGGG GCTTGGCCAG GTAAGTGGCG GTATACTGTA GTTGGTCCTT TTGAGATCAA CATCACCGTT TACTCGCTCC GTCGCCACCT TTTGAATGAA GGACATTGAT CTTACACTTG TCTGTGACTT GCTGTTGACC CGTCATCACA GTATCTGTCC AGAGGGAGAA CTCGTAAACG GATCTCGCAG GACGTGAGTT CTCTCAGCCT GTATATAGCT CATATAATAC GCAGCAAGAT TCTAGCCAGT ACGGAGGTAT GCATGATTAT CTATCCGTGT TCTATTCAAG ACAACAAATT GACACCACAT AGGACCTTCA CAGCCGCATT GCAGCCCTTG AGGAAGCTCT GAAAACTGCG ACTTCATCCC AACATCCGCT TCTGCAAGAC AGTCTATATG CCAACAGGAA AGAAATTCGG CAAAGGCGAA GCTCCTCCAG CTCTCCTGAA TCAAGCATTC AACTGCCTTC GGATTGTCTC ACCAGCTTCA GCCATTTGAC GCTAGGTGAC GACCCCCATT TCTCAAGGTA TTACGGCGCA GCAAGCAGCG TATATCTTTC AGTGAGCAAA AAAGTCATCT TGCCTTGTAC CCTATAATGA CAACAAAGGT TCTCAGAAGC AGCAGTCGAT ACCGGTGTCC TGTGAACGGT TCGTCCCCTC GGATGAATCT GTTGCACCTC GCCAACCTCA TGCCATTTCA AATGATTACA CCGATCTGTT CCCTCCTTTT TCTTCCGTCC ATAGTCAGCT TGACATAGAT GGCATTTTTC ATCAATATCT CCCGCCACCA GAAATTGGCC TAACGCTGGC GGAAGTGTAC TTTCACGCAA GTACCCATAC TTCTTCTCCA CGATGCTCCC GCTGACACCA CCTCGCAGAC CTTTGGATGG TTCACCAACA TTGTGCAGCG CCCGTCATGG AATGATGTAT TCTTTCATCA TGTCTACCGC GAGTCGGCAT CTATCAGAAG CAACCCCGTT AAGCCGCAAA GGCTAGCGGT CGTACTGCTC GTTTTTGCTC TGGGAGCCCT CATGGATCTC AGTAAACCAC CACATAATGA AGAGGCAAGA CATTATTTCA ATGGTGCACG AGCTTGCTTG TCCCTCGACC CCTCCCATTC TGTCACCTAT GTGCAGTGTA TCTATCTCTA TGGTTCATAT CTAATGAATC GCGGGGAAGA CACTTCTGGC GGAGACACCT TTTGGCCTCT GTTGAGGATG GGTATGGCCG TATGCGAGGC TATTGGTCTT CATCGTGATG GCAGTAACTG GGGCCTGGAG GAAGACCAGG CGATGGTGCG GCGTATAGTT TTCTGGGAGA TTCATGGATA CGATGTCCTG CAAAGTACCG CGCTGGGGCG CGGTATGTGC ATATCAGATG CTTCGATTGA TTGCCAGGTG CCACTCGCTG AAACTGACTC TGGCTTTCAC GCCAGGGCGT ATGCTTTGAC CAAGATTTGG TCCCAGATCA ACGAAAAGCA AGTGCGCATC AAGCCATTCG TCTACTCTGA GGTCGCCGAA ATCGACAAGG CTTTGTGTGC ATTTCAAGAT GATCTACCAT ACCACCTGTC ACCATCTTTG CCACCCTCTC CAGATGACCT CACCAATCTC GCCAGACACA AGGAGGCTTT CCAGCGAAAT ACCCTCCTGT TATATATCAA TGAGGCGAAA CTTACATTGC ATCGAGGTTG GTTCATTCGA GCTCTTAGAG AATATCCAGA CGAGCCCCTT TCAAGCCCGT TTAAACATTC ATATCTATCC TGCTTGGAAG CCTGTAGAGG CGTCGTATTC CTTGTGCGGA ATATGCTCAT CCTGCACGGT CAACTAATTC ACCGACGATG GCACTTTTTC TTTCATCTCT TTGCGGCTTG TGTCTGCCTT GCGGCGGCGG TGATCCGAGC GCCGGCTTCT AGTTTGGTGC GGTCTATCCT TACTGAACTC GAATCTGGGA TCAATCTTTT TAAAATAGCT GATAGAGAAG AGCTTGTACG TCTTTGTGTT GCTTCATTTT TTTTTCCTCT TGACTCAATT GCTGATATTT GCGATAGATC ACCGTCAACA GACTTCGAGA AAAAGCAGTG CGGGCTATGC AGAACATTTC TTCTGCGTCT TTTAACGAAT TGGAGACGGA AACGGAGGAT TTGGACCTTC TGGGGGCGAG AACAACATTG AGGCGAGCTG CACCGAAAGT CTCAACGCAC GAGCAGTCAG GGGTTATACC CCTTACTGTC CCCCTATTTT ACGAAAATAC GTCTATACCA CCGAATGCCG AGAGAACAGA AACGATGACC GCCATGGTAA GTGATCTGTC AATAGGTAGA TTGTTAACTA AACAAATCAT TCAGCTTGAT GTCGATGATA TGAACCAGGG CAATCTCGAA TGGAATGATT TCGATATGGA TACTGTACGG CCCTTCTCTG CTATACCTTC ATTAGTCGAC TGCTAACGAC ATTGCAGTTT CTCAGGGAAA TTGGAGCAAT TTGACGAATT CCGGAGCTTG CATATAAAAA CAGGCTGGTC GTTATGTTTG AATATCATCC CGTGTTGAAT GTAATAAGTA TAAGCACTGC TGTAACCGCT GACAGTGGAC AGAACTTGAA CTTGCTGTAA CATCTTGACG CGCTGTTATT GGTGCCGGTG TATT
|
Protein sequence | MSTQSGRSTT SKSSSRKKPL SCAECRRLKL KCELVFPCRH CVKRGLASIC PEGELVNGSR RTKILASTED LHSRIAALEE ALKTATSSQH PLLQDSLYAN RKEIRQRRSS SSSPESSIQL PSDCLTSFSH LTLGDDPHFS RYYGAASSVY LSKQQSIPVS CERFVPSDES VAPRQPHAIS NDYTDLFPPF SSVHSQLDID GIFHQYLPPP EIGLTLAEVY FHTFGWFTNI VQRPSWNDVF FHHVYRESAS IRSNPVKPQR LAVVLLVFAL GALMDLSKPP HNEEARHYFN GARACLSLDP SHSVTYVQCI YLYGSYLMNR GEDTSGGDTF WPLLRMGMAV CEAIGLHRDG SNWGLEEDQA MVRRIVFWEI HGYDVLQSTA LGRGMCISDA SIDCQVPLAE TDSGFHARAY ALTKIWSQIN EKQVRIKPFV YSEVAEIDKA LCAFQDDLPY HLSPSLPPSP DDLTNLARHK EAFQRNTLLL YINEAKLTLH RGWFIRALRE YPDEPLSSPF KHSYLSCLEA CRGVVFLVRN MLILHGQLIH RRWHFFFHLF AACVCLAAAV IRAPASSLVR SILTELESGI NLFKIADREE LITVNRLREK AVRAMQNISS ASFNELETET EDLDLLGART TLRRAAPKVS THEQSGVIPL TVPLFYENTS IPPNAERTET MTAMLDVDDM NQGNLEWNDF DMDTFLREIG AI
|
| |