Gene CNA07420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA07420 
Symbol 
ID3253944 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp2032686 
End bp2035631 
Gene Length2946 bp 
Protein Length859 aa 
Translation table 
GC content49% 
IMG OID638253066 
Producthypothetical protein 
Protein accessionXP_567058 
Protein GI58259291 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.592915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTG GAGATAAGAA CCAGGGATCA TCCAAATCTC AAGCCAAACA AAAGGGCACG 
AAGGGCAAAA ATGCTCAACC TAGACTCAAA TCAAATCAGC TCAAGCGGTT AAAGATTAAT
GAGGAGCTCA AGGAGTTGCA GAGCCGTGTG GATAACTTTG TGAGCGATAG ACATCTGGCT
GCGACGTATG AGATGTATCG CTGACCTTGG ACATCAATTA CAGGTGCCGC CATCCGAGAT
AACATTGTTC TCAGAGCTTC CCATGTCTTC AAAGACACAA AAAGGTTGGT AATCTGATGT
TTATATTCAG TGACTGACTT GACTCTCAAT ATAGGCTTGA AGAGTAGCCA TTTCCTTAAT
CCTACACCTA TCCAGTCACT TGCCATCCCC CCTGCCCTCC AAGCCCGTGA CATTCTTGGC
TCAGCCAAGA CCGGTTCCGG TAAAACCCTC GCATTCCTTA TCCCACTACT CGAACGTCTT
TATCTGGAAA AGTGGGGACC TATGGACGGT CTCGGAGCGG TGGTTATCTC GCCTACAAGA
GAGCTGGCTG TTCAGACGTT TATGCAGCTT AGGGATATAG GAAAGTACCA CAACTTCTCT
GCGGGTCTGG TCATCGGTGG TAAGCCCTTG AAAGAAGAGC AGGAAAGATT AGGGAGGATG
AACATTTTAA TTGCAACACC TGGGAGGCTG TTGCAGCATT TGGATAGTAC AGTCGGGTTT
GACTCATCAG CAGTCAAAGT CCTTGGTTGG TATAAATTAT ATTTCCCTTT CTCTAATGCT
AACAACGTAC CTAGTCTTGG ATGAAGCTGA TCGACTTCTC GACCTCGGTT TCCTTCCCGC
ACTCAAGGCT ATCGTTTCCC ACTTCTCTCC TGTTCAAACG GCTCCTGGTT CCCGTCCTTC
TCGCCAAACC CTATTGTTCT CCGCTACCCA ATCCAAAGAC CTCGCCGCCC TCGCCAAACT
CTCATTGTAC GAGCCTCTTT ATATCAGCTG TAATAAACCT GGAGAAGAAG GTGTGATGCC
CGCCAATTTG GAGCAATACT ATGCTGTAGT ACCCCTGGAG CGGAAGTTGG ACGCCCTTTG
GGGATTTGTG AAGAGTCATT TGAAGATGAA GGGTATTGTG TTTGTCACTA GTGGTAAACA
GGCACGTCGA GTTCTTGTTA TGATAAAATA TCCTTTGCTG ATTGAATTGT AGGTTCGATT
CATCTTCGAA ACTTTCAGGC GTCTCCACCC CGGTCTCCCA CTCATGCACC TCCACGGCAA
GCAAAAGCAA CCTACTCGTC TCGACATTTT CCAACGATTT TCCTCCTCCA AATCAGCTCT
TCTCATCTGT ACCGACGTTG CAGCCCGTGG TCTCGACTTC CCTGCCGTAG ACTGGGTTAT
CCAACTTGAC TGTCCTGACG ATGTCGACAC CTACATCCAT CGAGTTGGAC GTACGGCGAG
GTACCAGAGT GCAGGTACAG CTTTGACTAT TCTCTGCCCG AGCGAAGAGG AGGGAATGAA
GACCAGGTGG GGAGAGAAAG CGATCGAGGT CAAGAGGATA AAGATCAAGG AGGGTAAAAT
GGGCAACTTG AAGCAGTCTA TGCAGAACTT TGCGTTCAAG GAACCCGAAA TCAAATATCT
TGGACAACGT GTAAGTCATA CCCCTTACGG TCTCTTTGCT TTGCTAAAAT CCTTTGGTAC
TAGGCTTTCA TCTCATACAT GAAATCTGTC CACATCCAGA AAGATAAATC CATCTTCAAG
ATTGACGCAC TTCCCGCTGA AGCCTTTGCC GAGTCTATGG GTCTTCCTGG TGCCCCTCAA
ATTAAATTGG GCAACCAGAA GGCTGCCAAG GTGCGAGGAC CGAGCAAGGA GGAATTGGCC
AGGAAAGCAG AGAAGGAAGA AGAAGAGGAG GAAGAGAGAG CGGTTGTGGG CAGCGACGAG
GAAAGTGAGG AGGATGAAAG CGAAGGGTTA GGGTCTGAGG ACGAGGAAGA AGAGATCGAC
GATGAGGCAG AAGAGATCGA CGATGAGGAA GAGAGCGGTG AGGAAAGCGG TAGCGACGAG
GAAACTGAAG AGGAGAAAGA CGCTTCCAAG GTAAGTCATC TGAATACCCG CTTTCCATAA
TCAGTACGTA AAATTAACAT TCTATTCAGC CTAAAGCGCC TGCCGTCCGT ACCAAATACG
ACCGCATGTT TGAACGCAAG AACCAGTCTA TCCTCACACC CCATTACACT GCCCTCATCG
CTCACGATGC TGACAACGCC GCTGGAGCCG GAGAAGCCGA TGATGACGAT GACGTCTTCA
CCCTTGCCCG TCGAGATCAC AACCTGTCCG ATGACGAGGA GGCCGATACC GACGCTATTC
TCGGCGCTGA GGCTCTTGCT GCGGAATTGA AGAAACCTCT CATCACATCT GAGGACTTGT
CTAAGCGAAA GCTCAAGGCA GCCGCGTCTA AGAAGGGTCT GTTGAAGAGC AGGCCTGGTC
CTGAAAAGGT GCTGTTCGAT GAAGAGACTG GTGAGGCTCG AGAGTTCTAC AAGAGCGGCG
TGGATGTGGA GAAAGAAATG AGCGCGGCGG ATAAGAGGAG GGAGTACTTG GAGAAAGAGA
GAGAAATTAT GAAGATCCAG GACAAGATCG ACAAGGAGGT CGCAAGGGAG AAGAAAAAAG
AGTTGAAAAG GAAGAGAAAG GAGAGGGAGA GAGAAGTGAG TGGTCTGTTT CCATCTGCTA
AAACTCCCAC TGATGTCTTG GTAGCTACGA CAGATGGAAA TGGGTGATGA GCCCGTCGCT
TACCTTGGTG GAGATGATGA CTACGCCTCT GCTGATGAGG GTCGATCACT TTCACCTTCT
CCTGCGCCCT CTTTAGAACC CGAGCGACAT GCTAAGAAGC AACGACGAGG AGGAGTGCAG
GAGTCTGGGG CTGGCGATTT GGAGGATGAA GAAGCCCTCG CACTCAGGTT ATTGCAAGGA
TCATAG
 
Protein sequence
MALGDKNQGS SKSQAKQKGT KGKNAQPRLK SNQLKRLKIN EELKELQSRV DNFVPPSEIT 
LFSELPMSSK TQKGLKSSHF LNPTPIQSLA IPPALQARDI LGSAKTGSGK TLAFLIPLLE
RLYLEKWGPM DGLGAVVISP TRELAVQTFM QLRDIGKYHN FSAGLVIGGK PLKEEQERLG
RMNILIATPG RLLQHLDSTV GFDSSAVKVL VLDEADRLLD LGFLPALKAI VSHFSPVQTA
PGSRPSRQTL LFSATQSKDL AALAKLSLYE PLYISCNKPG EEGVMPANLE QYYAVVPLER
KLDALWGFVK SHLKMKGIVF VTSGKQARRV RFIFETFRRL HPGLPLMHLH GKQKQPTRLD
IFQRFSSSKS ALLICTDVAA RGLDFPAVDW VIQLDCPDDV DTYIHRVGRT ARYQSAGTAL
TILCPSEEEG MKTRWGEKAI EVKRIKIKEG KMGNLKQSMQ NFAFKEPEIK YLGQRAFISY
MKSVHIQKDK SIFKIDALPA EAFAESMGLP GAPQIKLGNQ KAAKVRGPSK EELARKAEKE
EEEEEERAVV GSDEESEEDE SEGLGSEDEE EEIDDEAEEI DDEEESGEES GSDEETEEEK
DASKPKAPAV RTKYDRMFER KNQSILTPHY TALIAHDADN AAGAGEADDD DDVFTLARRD
HNLSDDEEAD TDAILGAEAL AAELKKPLIT SEDLSKRKLK AAASKKGLLK SRPGPEKVLF
DEETGEAREF YKSGVDVEKE MSAADKRREY LEKEREIMKI QDKIDKEVAR EKKKELKRKR
KERERELRQM EMGDEPVAYL GGDDDYASAD EGRSLSPSPA PSLEPERHAK KQRRGGVQES
GAGDLEDEEA LALRLLQGS