Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA00290 |
Symbol | |
ID | 3253946 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 94248 |
End bp | 98982 |
Gene Length | 4735 bp |
Protein Length | 815 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252363 |
Product | conserved hypothetical protein |
Protein accession | XP_566463 |
Protein GI | 58258101 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTATCTC AGGCGTTGTC TACCTGCAGG CTCAAAACCT TTCGAACCAG CGCAACAAGC AATAAAGCAA GCTTGATTGC AGATTGCAGC CCACTCCGAT CATCAACGAA AAAAGAAGTG GAAAAGTCCA CCTGCTTACG ACATATTACC CTTTCCAAAA ACGCGATCGA CCGCAAATTA ACGAGCGACA CCAACGCTTT GACCTTTCCA TCTCTCTTTC CCGTGTCGAT CCCAGCCCCC ATGATTGTAA CAGTGTGCGT CCACTAAAAA TTTCTAGTTG GCTCAACAGC CATACAACAG ACTATTTGCT AACTCAACAT CACCGCAGTT TGTGATCGAT TTTTCATATC GACGGTGATT TGACTCCTTT CAATCGCGTT GCCACCACCT CTCGTTATCC TCCCATCATT TTTATCTGCG GACGATCCAT TTGATCGTAC CTATCATCCA TCTTTTATCT CTTAATTCGG CCTTTTTTCC GGATTTTACA CCACCGCCGC CATATCAAGT CTCACATCAA CGGGCCGACT CTGGAGGCAG CAGAGCAGAG ACAGAGGGTC AAAAGGCAGA AAATCCAAGT AGTTCGCCAC CATAAGAAGG GAGAATCAGA TCTTATTGAT ATTTGCTATT GGTGCTCGGG ACTCTGTTCA GAGCGATCAA GGTGAGTCTC CCTTCTGTCG TCTGTCGTTT ATCGGTTGTT CGGCTCTGGT TGGATGGTGG GTGCGCGTGT TGTGTTTGTT TATGTGTGTA TGTGCTGGAA GCATGCTCAA TTTTATTGAT CATCACTCTG GGTGCATCAG TGATACTTGC TGTCCTTTTA CACTCCGAAA GGCGAGCACC CATGGCTTGA AGAAGTCATC GGGTGTTAGA AAGTTGGGAC AGCCTTTGTG CTTGTCCGAA GGGCGATAGG AAGCCGACTT CAGTATCTGA TATCGAAAAC GGAAGTCGAC TCCACCCCAA TGGAAGGCCC GTGTATTTAA CGATCAATGA CTTTCATCAT CCTGCGACGT CCTAGTCCTG TCTTAGTCTG ACTAAAAGCG GGGGGACCAT ATGAGCGACA CTAAACCGTG ACACGGTCAT GGGTCCTTGA CAGCCTGGAG CAGGTACCAC CGATTGACTA GCTGGGTATT TCCGGCCGTA TCATCGGCTG TGCTTGTCTC TCACGACTCG TATCCGCCTG CGATTTCGAT GTGCCATCGT CGCGCAGAAC ACCCAAAGCG GAGCGCAAGG ATCGGCAGCA TTAGTGGCTG TGTGGAGTGA TGTGAGGAAG ACAAATGGCG ATTGTTCTCC ACAGCCTATG ACGACCGCAG CACATCTCAA CATCAGTTCG TTTGGGCTCT GTCGTTTGTC TTGGTCTACT GTTCTGTATT GCTGGGTGTG ACGAAAATTA AAGCACGGTC GCTGACTTAG AGTATCCATA TTTACTCGAA CCCTCTAATA TTTTATCTGC CTCGCTACTA CGCCTTTCAT TTTCACTACT GTCCACTGCA ATTTTTTTGC TACGATTTTT TGATAATCGC CAACATCGCC CATTGCAATC ACCACTGCCT GTCTCAACAA AACTTCCTCG TGCCCTTGCC TGGTTATTGG CCAATCCCAT CATCTGGTCT TATGTACGCG CCGTCGATCC TCTGGGCTGT CGTACGGATC TTGACCATCT TCGCTATCGG TACAAACTCG CGGTTGCCAA TCCCTCCTTC AACGATCTTT CTCCGGTGGA AACCCACTTC CCGCGTACCT CAACGGCTGC TTCTGTTTCA CCCACTCCAA TCATCCTCGA TCCCCTTACC GCATTGAACC TCCTTTGCGT CTTTCATTCC CCCCGGCGCG CCTCGCCCAT TAACGACTCC TTTTGCATTG TAAATCGTAC ACCTTCGAAA TATCATGTCT CCTTCGCCGC ACCTCCATAA TCTCTCCCCT CGGTTTTGTC CACTGCCGAC TCTATTTTCT GACTATTCAG TAAGCCCAGC GTTGAAGTTC GAGCACAGAG TGCCTAGCAG CTCCCTCGGT ATGGTCTCTG TGGAACAGTA CAGATCTCAA CGAACATCAT ATTACCCTCC GCCTCATTCG CCCACATACG TATCACCCAC CAGTACGGCA TCGCCTGTCT CAGGTGCTCC CATTTTTACT TTCCCATTCC AACAACCAGA CATCTCGTCC CCAGTAGGCA AGATGGCTCG ACGTCCATCC CGCGCTGATC CTATGGTAAG AACGGAGGAT GACATGTATG AAGAGGATGA TGGCTACGGG GAGCTTGGAC AGAGACATCC TTTGAGACGA GGGAAGGAAG ATATGAGGGA GGAAATAGAG GCGAGACCTG GACACGAAGT GAATCTGACT CGGATTCGAG AACGCGGAGA AGGTATGAGC CTGCCTGGAA TCAAGACCTT GCTTGGCGTA AAGGGTAAGC GATGGCGATT GAGATAAGAT TGGGTTCCTG CGCTAAGGTC TTGTATAGAA CATCCGTCCG GATCATCGTC TCTCTATCAT TCACCATCGC TCCCGTCATT GGAAACAAAT TCTCCTACAA CTTCTCCTTC TTCCGCACGC ACTTCTAGGT TTTCTTCTTT TACATCGTCC ACTGTACCCG AGCTGTCAGC TCCCGGGTGG TGGGCACCTG AATTCGAGAG AAGCCCCTTC CACGCCGTCC CCTCCCGCTC CGACTCCTTT TCCAGCGCAC AACTTCACAT CGTTGACGAA CACGATCAAA AACGTCGTCG TTCGGACGTT CCTCCTCTTC GTGATGTTGA GGAGTCTGCC AGATTAAGAT GGCAGGCTCA AAGCAGAAAC GCATCCTTCC CTTCCTCATC ACCACATTCC TCTGGTCGTA CTACACCCAC CAGCTGGTCA TCAATGAGAA ACCGCCTGCA TCCTCCAATT TCTTCGTCCC CTAGTGTGGC GACCGTCATG GGTCGGGGAT CAATATCAAG TACATCTGGC GCAATGTCAC CGCCCATCAC TCGTCGAGCT AGTCCAAGTT CGAGAAATCC AAGTTTGGTG GGTGGACAAC TATCAAGACA TTTTGCAGAC CTTTCAGCTA CAGACAGCCA GCGGGGTTCA ATATCTGGTG CAGCTGGCCC GCCAGAAAGA CGAATATCTG TCCAAGCACC TTCAAGTACC ACCCCCATCG ACCTAGATCG TGCACCTATC TTGCCCCCAT TAATATCTCC TGAAAATGAA AGGCCCTCTA TGCCATCTGC GTCTTTTTCT CTCCCATCTA TACGGCGGTC TTCGTCTACA TCTTCGGACT GCCTACGGCG GCATTCCAAC ACTCAACCTA CAACGCCTGA TACGACAGGG CAGCCTGAAG TTCGTCGCTC ATCACTGACG GAGATCATCA TGGCGAGGAG TGGGGATCAT ATTGCGATGA AAGAAGGGCG GTATGCTTTC TCAACTGAAG AGCGGCATGG GAGCTTGGGA ATGGAGAAAC GTGCGGAAAC TGCACTTGGT CTTGCGCCTA TCCCGCTTCA ATCCAAAACT TCAGTTCAGT CACTTGCTGG AACGAGCAGC GATACGCCCG CCTGGAACCC TCAAGGAAGA CGAGAGTCTA CCGAATCAAT ATCGTCCGCC ACAGCTCATC TCGCTATCAA CTCTGAAGCT GAACGCGAGC GTACTGCATC TCTTCGGGGT CGAAAGCGAT CGGCAGACAT GCGCGATGAT AGTGAGCCCA GGGCAGATCC GTCTCTTGTT GGAGTCGGTG TCAGTGTCAA TGTTGGTGCT GGGGATCCAG GTTTACGAGG GATGGAAGTA TTGGCAGAAT CTGCAAGGAG AATTGCTGCT GCAGAAGAGG AACAAAAGGC GAAGAATGTG GAGGAAGAAG AGGTTGAAGA GGCTCAGGAG AAGGATGAGG TGCCAGAGAA GACTGGAGGG CCCAAGTATA CTTGTACTTA TTGTGTCAAG ACGTTCTCAA GGCCTAGCTC CCTGAAGATA CATACCTACA GCCGTGAGTT CCTTTCGTTT GGTAGGCCAG AATTAGTGAC ATTGGCGCTG ACGACTTGTA GACACTGGTG AAAGGCCGTA TATCTGTAAC GAAGCTGGGT GCGGGCGCCG TTTCTCTGTG CAATCCAACC TCAAGCGTCA CGCCAAGGTG CATCAAGTTG GACCATTAGG TGTTAGCGCT CCTGAACCCT CAGTCCCTCC TACAAAAGCT TCCAAACAAC CTACACATCC TTCCAAACCT AATGAACATT CATCTCCACA CCCGCCCCCA TTACACCATC ATCAATCGCA TCATCGAATG GCTCAGCCTC TCCCGCCCCC TGTGATGTCC GGCCCTATGC CTCCTCCAGG GGGCTATCCT TTCTTCCCTC CGAGTTATCC GCCCTTATCA ATGAATGCCA TGCCCGGCCC GCCTCCACCG GGGAGCACTC CCGGAATGGC GCCACCGCCG GGGTATTATA TAGATGGAAG GTATGCGGTC GCTCCACCTC CGCCAACTAA TGGGATGCCT CATGGAGGAT ATGTGCAGTA TGATATGCCA GTGCCAACGC ACGATGGGAA GGATGAAAGT GGAAGAGATA ATGGGAAGGG AAAGGAGAAG AGCAGAAAGA GTAATTCGAA GGAAGAGTAG AAAAATTTCA TCAAATAGAG GAAATCCGTG TTTTAGATCC TCTGGAGCTC CAAGGTGTAG ACGATCATTG TATAACATTT AGTGGTTTCG TTATTCATTT GTCTGTTTGG GCCATAGTAT ATATCCCCCC GTATAGAATG ACAAGTTTAC GCCAAGCCTT CTGTA
|
Protein sequence | MVSVEQYRSQ RTSYYPPPHS PTYVSPTSTA SPVSGAPIFT FPFQQPDISS PVGKMARRPS RADPMVRTED DMYEEDDGYG ELGQRHPLRR GKEDMREEIE ARPGHEVNLT RIRERGEGMS LPGIKTLLGV KEHPSGSSSL YHSPSLPSLE TNSPTTSPSS ARTSRFSSFT SSTVPELSAP GWWAPEFERS PFHAVPSRSD SFSSAQLHIV DEHDQKRRRS DVPPLRDVEE SARLRWQAQS RNASFPSSSP HSSGRTTPTS WSSMRNRLHP PISSSPSVAT VMGRGSISST SGAMSPPITR RASPSSRNPS LVGGQLSRHF ADLSATDSQR GSISGAAGPP ERRISVQAPS STTPIDLDRA PILPPLISPE NERPSMPSAS FSLPSIRRSS STSSDCLRRH SNTQPTTPDT TGQPEVRRSS LTEIIMARSG DHIAMKEGRY AFSTEERHGS LGMEKRAETA LGLAPIPLQS KTSVQSLAGT SSDTPAWNPQ GRRESTESIS SATAHLAINS EAERERTASL RGRKRSADMR DDSEPRADPS LVGVGVSVNV GAGDPGLRGM EVLAESARRI AAAEEEQKAK NVEEEEVEEA QEKDEVPEKT GGPKYTCTYC VKTFSRPSSL KIHTYSHTGE RPYICNEAGC GRRFSVQSNL KRHAKVHQVG PLGVSAPEPS VPPTKASKQP THPSKPNEHS SPHPPPLHHH QSHHRMAQPL PPPVMSGPMP PPGGYPFFPP SYPPLSMNAM PGPPPPGSTP GMAPPPGYYI DGRYAVAPPP PTNGMPHGGY VQYDMPVPTH DGKDESGRDN GKGKEKSRKS NSKEE
|
| |