Gene CNA06230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA06230 
Symbol 
ID3254007 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1695605 
End bp1698589 
Gene Length2985 bp 
Protein Length900 aa 
Translation table 
GC content50% 
IMG OID638252944 
Producthypothetical protein 
Protein accessionXP_566972 
Protein GI58259119 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCTGCAAA CATCAACTTT CAAAGGTCAT GTTGAGACCC AGCATAGTCA GAAAGGTAGC 
CAGGATCAGG CATGCCAGAC TTGCTTGCCA AAATTTACAC ACGTAAGTTG GCCAACGCTC
ATTACTCCCG CCAGCTGATG TTTGAGCAGA CAAGATACAT ATCCTCCTTC CTTCCATTCC
TTGTCGGAGA AGAGTACCAC GCAAACCTCA TCTGAAAGAA ATTTCCCTCT CTATAAACCC
CAGAAGCATG CCCAAGAAAT ACCTCTAACT GCAGACACTC TTGCAGCTCT TCCTTCCGAA
CAAGCTCTCT CCCGACAACT CAAACGGCGT GTCACCCAAA AAGCAGTCGA AAATCTTCCG
GAATTCACCG AGGATGAGTT GAGGGATTTC TACGCAGCCT TGGTTAAATC CGGATCCAAG
GACAATGGCG GGATATATCA AGCGCTGGAA GCGCCTTCAG AGACAAAGAA AATCCCTATG
CCGAAAAAAG AGAGGGAAGA AATCTTAACG GGCATGGCGG GAAGGCTGGT TGGGGAGGGG
GGGCTTGCCC TGCCAAACGG TTCTACGCGA TATGGTAAAT CTTTGGAGCT TGTTGTAAAT
GAAAACCAGT CTTCCGATGC TCCTTACGCC ATTCTTGATG TCTTGTCCCA AGCAGCCATA
TTGAGTGGTC CGAGTGAGAC AAGCAAAGGG AAGGCAATGG AAGGTCAACT GCCCCTTAGA
AGCCTTGAAG TACCTTTGGG ATTGGTCACG TCCAAAGAAT GGGATGCTCT TTTCCAGGAA
TTTGTAAGCC TTTTCTGGAT CAAATATGAT ATACCTGCTC ACTATTCGTA GATACGGAAA
GGAGATGCTG CAGCTGCCGA GTCTCTTTTG GACATTATGG ATGTAAGCGA CATTGAGATA
TGTGAGAGTA ACCCACACTG ACAGTCATTC GGTAGGCCCA TGGTGTCCCA GTCGGTATCG
AACGTATAAA CAGCGTCATA CATGTCTATG CTCTTCAAGG TCGTTCTGAA GATGTGGCTC
GTTTAACCTT GGATGTCACT GCAAGTAAGT TATATCTTTT CCGTCTCGAT CATCACCATA
ATCTTATAAT GCGACATAGT GGGCGAAACT GTCTCGTCAG ATCAGCAAGA TGATTATATT
CTGTCTATCC TCCGGCAAAC ACCATCTAAT CCCGGTCCGG CGATTGCCCG CCTCCGTCAA
GCTGAAATAG CTGGCACTCC TTTCCCCCAA TCGTCCTATC AAGTCGTCAT CTCACATCTC
ACTTCACCGT CTTCCATCGT CCAACCAAAC TCTCATACTC GGGCAGTTGC TTGGGACTTG
TTTGCCCATA TGCGTCTAAC TGCGCATCCT ACTCCGACGC GGGAGCTGTA TACGACAATG
ATCAAGTCAT GTGGTGAAGC CAGTCAGCCT GAACCCGAGC GAGCGAGGGA TCTGTGGATT
GAAATGACAG AGCAGGAGAA AATTGAGCCT AGCACGGAAG CTTATAACGC GATCATCAAA
TCTCTAGGAA GTACGAAGAA GGACTACCTC GAAGCATTTG ACTTGCTTCG GCAGATGCTG
GTCAAGCACA ATGAGGCAGT ATTCGTCCCT TTTGAAGAAG AAGACGGTAT CCCAAGGTTC
AGCGAGTATG TCCCAACGCT AGAAACGTTT GCGTCGGTAC TAGAAGGGAC GAAGCGGGCA
GGAGACGTCA ATCGAGCCAG ATGGGTCTTA TCAGAGGTCA TTAAACTATC AAGAGCCGGC
CAGGCCCTGA ACGTGCCAAA GTGGAAGGAA GGGCCTAGTG GTGATTTGAT GGCGGCTGTG
TTCATGACTT ATGCAGCTTG GAATCCAGTG CTGCGCAAGA GAGAGGTCAA AATGAAGCAA
GCTTCGGAGA AGCCATATGG AGCAGCGTCC GAAACGGCGG TCGTCGAGAA GGAAGAGCAT
AGGCTTTCGA AGGGCGAAGA GTTCCTCAAC GTGGATGTTC TGCAAGACGT GGTTGAGCCT
TCTGCCGGCT CTTCGTCGAT TGAGTATACT CTTGAAGAAC CGCCTGTCAA TGATTTTTTC
ACCCCGCTGT CAAGCGCTGA CGCGATCCGC GAGTGCACGA GTCTTTTCCA CCGCATCCTC
TCCGATATCC AGTCTTCTTC CTCCACCTCT GATTTCCTAC CTTTTCGCCA CGTATATATT
ACGCCAAAGC TCATCAATTC GTATATCTCT GTGTACAATG CTCATGGCCC GTCCCTGTCA
TCTGTGAAAG ATGTGTACGA TACCGTGTGG GAAGAAGCGT CGAGCGTCAC TGGTCGTTCC
GTCAGACCCA ACGCCTGGAC TTTTATCCCT CTGCTCGAGA AATGTGCTTC AGGATCTCGA
GGGGGTATAT ACCCGGCCGA CCGATCCCTA GCTCTCTCCT GGGGACGTGC CCTGTGGTCG
TCATACCTCG CATTCTTCAA ATCCGCTTCT TCTGAACTCG CTTCGATCCC ATCCTCTGAA
CACCTCACTA TCCAGCGTCG CAAGTACCTC TTCGGTTTAG GCGACAGGCA GACTGAAAAA
ATGTGGAAGG CTGTTATCAA ACTCGAGGCT TTGCACGGCG ACGTTGATGA AGCAATGAGG
TTGCTTGAGG AATTCCATGC GACGTATAAG CCCAGTGCTA TCACACAATT GTACGCACCG
TTGCCGGAAG TGGGCCTGCA GCTGAAAATG TTCACACCGG CCATGACGCC CGAGGCAGAT
GTACCTCCTC TCTTGTTGTT TGATGATATC AAGCCTCTGC ATCAGCGGTT AGTGAGGGAA
GCAAGGGCCA AGGATATTAA AAAGGTCAAG TGGATAGCGG CGGGTTACGA AAAGTCGTTG
AAGACGAGGA AGGGATGGAG GATGAAGGGT GTGGGACAAC AGAGAGAAAA GTCGAGGGCA
AAAGATTTAG AACGGCTTCT CGCCCAGCAA GGCGAGATGG AGAGGCTCGA GTGAGGGAAA
AGACAAGCTG AATACCCTTA TTTGTATATA CCCTGCACAA GAATA
 
Protein sequence
MLRPSIVRKV ARIRHARLAC QNLHTQDTYP PSFHSLSEKS TTQTSSERNF PLYKPQKHAQ 
EIPLTADTLA ALPSEQALSR QLKRRVTQKA VENLPEFTED ELRDFYAALV KSGSKDNGGI
YQALEAPSET KKIPMPKKER EEILTGMAGR LVGEGGLALP NGSTRYGKSL ELVVNENQSS
DAPYAILDVL SQAAILSGPS ETSKGKAMEG QLPLRSLEVP LGLVTSKEWD ALFQEFIRKG
DAAAAESLLD IMDAHGVPVG IERINSVIHV YALQGRSEDV ARLTLDVTAM GETVSSDQQD
DYILSILRQT PSNPGPAIAR LRQAEIAGTP FPQSSYQVVI SHLTSPSSIV QPNSHTRAVA
WDLFAHMRLT AHPTPTRELY TTMIKSCGEA SQPEPERARD LWIEMTEQEK IEPSTEAYNA
IIKSLGSTKK DYLEAFDLLR QMLVKHNEAV FVPFEEEDGI PRFSEYVPTL ETFASVLEGT
KRAGDVNRAR WVLSEVIKLS RAGQALNVPK WKEGPSGDLM AAVFMTYAAW NPVLRKREVK
MKQASEKPYG AASETAVVEK EEHRLSKGEE FLNVDVLQDV VEPSAGSSSI EYTLEEPPVN
DFFTPLSSAD AIRECTSLFH RILSDIQSSS STSDFLPFRH VYITPKLINS YISVYNAHGP
SLSSVKDVYD TVWEEASSVT GRSVRPNAWT FIPLLEKCAS GSRGGIYPAD RSLALSWGRA
LWSSYLAFFK SASSELASIP SSEHLTIQRR KYLFGLGDRQ TEKMWKAVIK LEALHGDVDE
AMRLLEEFHA TYKPSAITQL YAPLPEVGLQ LKMFTPAMTP EADVPPLLLF DDIKPLHQRL
VREARAKDIK KVKWIAAGYE KSLKTRKGWR MKGVGQQREK SRAKDLERLL AQQGEMERLE