Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA07420 |
Symbol | |
ID | 3253944 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 2032686 |
End bp | 2035631 |
Gene Length | 2946 bp |
Protein Length | 859 aa |
Translation table | |
GC content | 49% |
IMG OID | 638253066 |
Product | hypothetical protein |
Protein accession | XP_567058 |
Protein GI | 58259291 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.592915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTTG GAGATAAGAA CCAGGGATCA TCCAAATCTC AAGCCAAACA AAAGGGCACG AAGGGCAAAA ATGCTCAACC TAGACTCAAA TCAAATCAGC TCAAGCGGTT AAAGATTAAT GAGGAGCTCA AGGAGTTGCA GAGCCGTGTG GATAACTTTG TGAGCGATAG ACATCTGGCT GCGACGTATG AGATGTATCG CTGACCTTGG ACATCAATTA CAGGTGCCGC CATCCGAGAT AACATTGTTC TCAGAGCTTC CCATGTCTTC AAAGACACAA AAAGGTTGGT AATCTGATGT TTATATTCAG TGACTGACTT GACTCTCAAT ATAGGCTTGA AGAGTAGCCA TTTCCTTAAT CCTACACCTA TCCAGTCACT TGCCATCCCC CCTGCCCTCC AAGCCCGTGA CATTCTTGGC TCAGCCAAGA CCGGTTCCGG TAAAACCCTC GCATTCCTTA TCCCACTACT CGAACGTCTT TATCTGGAAA AGTGGGGACC TATGGACGGT CTCGGAGCGG TGGTTATCTC GCCTACAAGA GAGCTGGCTG TTCAGACGTT TATGCAGCTT AGGGATATAG GAAAGTACCA CAACTTCTCT GCGGGTCTGG TCATCGGTGG TAAGCCCTTG AAAGAAGAGC AGGAAAGATT AGGGAGGATG AACATTTTAA TTGCAACACC TGGGAGGCTG TTGCAGCATT TGGATAGTAC AGTCGGGTTT GACTCATCAG CAGTCAAAGT CCTTGGTTGG TATAAATTAT ATTTCCCTTT CTCTAATGCT AACAACGTAC CTAGTCTTGG ATGAAGCTGA TCGACTTCTC GACCTCGGTT TCCTTCCCGC ACTCAAGGCT ATCGTTTCCC ACTTCTCTCC TGTTCAAACG GCTCCTGGTT CCCGTCCTTC TCGCCAAACC CTATTGTTCT CCGCTACCCA ATCCAAAGAC CTCGCCGCCC TCGCCAAACT CTCATTGTAC GAGCCTCTTT ATATCAGCTG TAATAAACCT GGAGAAGAAG GTGTGATGCC CGCCAATTTG GAGCAATACT ATGCTGTAGT ACCCCTGGAG CGGAAGTTGG ACGCCCTTTG GGGATTTGTG AAGAGTCATT TGAAGATGAA GGGTATTGTG TTTGTCACTA GTGGTAAACA GGCACGTCGA GTTCTTGTTA TGATAAAATA TCCTTTGCTG ATTGAATTGT AGGTTCGATT CATCTTCGAA ACTTTCAGGC GTCTCCACCC CGGTCTCCCA CTCATGCACC TCCACGGCAA GCAAAAGCAA CCTACTCGTC TCGACATTTT CCAACGATTT TCCTCCTCCA AATCAGCTCT TCTCATCTGT ACCGACGTTG CAGCCCGTGG TCTCGACTTC CCTGCCGTAG ACTGGGTTAT CCAACTTGAC TGTCCTGACG ATGTCGACAC CTACATCCAT CGAGTTGGAC GTACGGCGAG GTACCAGAGT GCAGGTACAG CTTTGACTAT TCTCTGCCCG AGCGAAGAGG AGGGAATGAA GACCAGGTGG GGAGAGAAAG CGATCGAGGT CAAGAGGATA AAGATCAAGG AGGGTAAAAT GGGCAACTTG AAGCAGTCTA TGCAGAACTT TGCGTTCAAG GAACCCGAAA TCAAATATCT TGGACAACGT GTAAGTCATA CCCCTTACGG TCTCTTTGCT TTGCTAAAAT CCTTTGGTAC TAGGCTTTCA TCTCATACAT GAAATCTGTC CACATCCAGA AAGATAAATC CATCTTCAAG ATTGACGCAC TTCCCGCTGA AGCCTTTGCC GAGTCTATGG GTCTTCCTGG TGCCCCTCAA ATTAAATTGG GCAACCAGAA GGCTGCCAAG GTGCGAGGAC CGAGCAAGGA GGAATTGGCC AGGAAAGCAG AGAAGGAAGA AGAAGAGGAG GAAGAGAGAG CGGTTGTGGG CAGCGACGAG GAAAGTGAGG AGGATGAAAG CGAAGGGTTA GGGTCTGAGG ACGAGGAAGA AGAGATCGAC GATGAGGCAG AAGAGATCGA CGATGAGGAA GAGAGCGGTG AGGAAAGCGG TAGCGACGAG GAAACTGAAG AGGAGAAAGA CGCTTCCAAG GTAAGTCATC TGAATACCCG CTTTCCATAA TCAGTACGTA AAATTAACAT TCTATTCAGC CTAAAGCGCC TGCCGTCCGT ACCAAATACG ACCGCATGTT TGAACGCAAG AACCAGTCTA TCCTCACACC CCATTACACT GCCCTCATCG CTCACGATGC TGACAACGCC GCTGGAGCCG GAGAAGCCGA TGATGACGAT GACGTCTTCA CCCTTGCCCG TCGAGATCAC AACCTGTCCG ATGACGAGGA GGCCGATACC GACGCTATTC TCGGCGCTGA GGCTCTTGCT GCGGAATTGA AGAAACCTCT CATCACATCT GAGGACTTGT CTAAGCGAAA GCTCAAGGCA GCCGCGTCTA AGAAGGGTCT GTTGAAGAGC AGGCCTGGTC CTGAAAAGGT GCTGTTCGAT GAAGAGACTG GTGAGGCTCG AGAGTTCTAC AAGAGCGGCG TGGATGTGGA GAAAGAAATG AGCGCGGCGG ATAAGAGGAG GGAGTACTTG GAGAAAGAGA GAGAAATTAT GAAGATCCAG GACAAGATCG ACAAGGAGGT CGCAAGGGAG AAGAAAAAAG AGTTGAAAAG GAAGAGAAAG GAGAGGGAGA GAGAAGTGAG TGGTCTGTTT CCATCTGCTA AAACTCCCAC TGATGTCTTG GTAGCTACGA CAGATGGAAA TGGGTGATGA GCCCGTCGCT TACCTTGGTG GAGATGATGA CTACGCCTCT GCTGATGAGG GTCGATCACT TTCACCTTCT CCTGCGCCCT CTTTAGAACC CGAGCGACAT GCTAAGAAGC AACGACGAGG AGGAGTGCAG GAGTCTGGGG CTGGCGATTT GGAGGATGAA GAAGCCCTCG CACTCAGGTT ATTGCAAGGA TCATAG
|
Protein sequence | MALGDKNQGS SKSQAKQKGT KGKNAQPRLK SNQLKRLKIN EELKELQSRV DNFVPPSEIT LFSELPMSSK TQKGLKSSHF LNPTPIQSLA IPPALQARDI LGSAKTGSGK TLAFLIPLLE RLYLEKWGPM DGLGAVVISP TRELAVQTFM QLRDIGKYHN FSAGLVIGGK PLKEEQERLG RMNILIATPG RLLQHLDSTV GFDSSAVKVL VLDEADRLLD LGFLPALKAI VSHFSPVQTA PGSRPSRQTL LFSATQSKDL AALAKLSLYE PLYISCNKPG EEGVMPANLE QYYAVVPLER KLDALWGFVK SHLKMKGIVF VTSGKQARRV RFIFETFRRL HPGLPLMHLH GKQKQPTRLD IFQRFSSSKS ALLICTDVAA RGLDFPAVDW VIQLDCPDDV DTYIHRVGRT ARYQSAGTAL TILCPSEEEG MKTRWGEKAI EVKRIKIKEG KMGNLKQSMQ NFAFKEPEIK YLGQRAFISY MKSVHIQKDK SIFKIDALPA EAFAESMGLP GAPQIKLGNQ KAAKVRGPSK EELARKAEKE EEEEEERAVV GSDEESEEDE SEGLGSEDEE EEIDDEAEEI DDEEESGEES GSDEETEEEK DASKPKAPAV RTKYDRMFER KNQSILTPHY TALIAHDADN AAGAGEADDD DDVFTLARRD HNLSDDEEAD TDAILGAEAL AAELKKPLIT SEDLSKRKLK AAASKKGLLK SRPGPEKVLF DEETGEAREF YKSGVDVEKE MSAADKRREY LEKEREIMKI QDKIDKEVAR EKKKELKRKR KERERELRQM EMGDEPVAYL GGDDDYASAD EGRSLSPSPA PSLEPERHAK KQRRGGVQES GAGDLEDEEA LALRLLQGS
|
| |