Gene CNL05340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL05340 
Symbol 
ID3254797 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp486848 
End bp488038 
Gene Length1191 bp 
Protein Length361 aa 
Translation table 
GC content54% 
IMG OID638254008 
Productriken protein, putative 
Protein accessionXP_568072 
Protein GI58261324 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTCC AGGTATCTCC AGAGCAAATG GCCACATTGA AGGCTACTCT GCTCAACACT 
CCCGGAAATG TCCCCTTGCA TGAGCGATTC AGGGCCTTGT TTATGCTCAA GGCTGTGGGC
GGTGATGAGG TTGTCGACAT CGTTTCCGAA GGTGGGTTCA TATATGAACG ATCAATGAGA
ATAAGCTCAA CATTCAATTT ATACAGGTCT TAAGGACCCT TCACCTCTCT TAAAGCACGA
GCTTGCCTAC GTCCTCGGTC AGCTTCTCAA CACCCGTGCT CTTCCCACCT TGTCTCGAGT
CCTTGAAAAC CCCACCGGCG AGCATTGCTC TATGGTTCGT CATGAGGCTG CTGAGGCTCT
TGGCGCTATT GGTGCTGAAG AGTCCCTCCC AATTTTAAGA AAGTACATGC AGGACGAAAA
TAGGGAGGTC CGAGAGACCT GCGAGATTGC AGTCGGCAAG ATTGAGTTCG ATTTGAGCGA
GGAGGGAAAG AAGACCAATG CCAAGTGAGC TCCTTTTATC TTCTTTTTAT TGCTCTTCCG
CTGACCTTTT AAAGCCCCGA CTTCCCCACT ATTGACCCCG CTCCTTCTGC TGCCCCTTCC
GACATTCCTT CCCTCCGCGC CGACCTTCTT AACACTTCCC TTCCCCTCTT CCAGCGATAC
CGTGCCATGT TTGCGCTCCG TGACTTTGGT GCCGGCTCCA AAGAGGCTGT TGAAGCCCTT
GCCGACGGTT TCCGAGACGG CAGTGCCCTT TTCCGTCACG AGATTGCCTA CATCTTTGGT
CAGCTTTCCA GCCCGTACTC CATCCCTTCG CTTCTATCCA GGTTGAGGGA CGCCAAGGAA
GACGACATGG TCAGGCACGA GGCTGCCGAG GCTCTTGGGG GTATTGCGTC TGACGGCGTG
GAATCTGAGA ACCCCGAGGT CGTGCTTCCT GAAGACGAAC GCCTTCCCGA AGGTGGTGTT
CTTGCCGTTC TGCGTGAATG GGCCGTCAAG GCCGATGCCC CTACCGTTGT TCGTGAGTCT
TGTCAGGTCG CCATTGACAT GTGGGAGTAT GAGAACTCTG CCGATCAATT TAACCCGCTT
GACTCTCTCT CGGCCAAGCA AGAGGAGAGA GAGAAGACTG AGAAGGTCAA CACCACGGGT
ATGGAGAGGT CTGCACACGC TGCTGTTGCC GCCATGGGTA TTGCTGCCTA G
 
Protein sequence
MSVQVSPEQM ATLKATLLNT PGNVPLHERF RALFMLKAVG GDEVVDIVSE GLKDPSPLLK 
HELAYVLGQL LNTRALPTLS RVLENPTGEH CSMVRHEAAE ALGAIGAEES LPILRKYMQD
ENREVRETCE IAVGKIEFDL SEEGKKTNAN PDFPTIDPAP SAAPSDIPSL RADLLNTSLP
LFQRYRAMFA LRDFGAGSKE AVEALADGFR DGSALFRHEI AYIFGQLSSP YSIPSLLSRL
RDAKEDDMVR HEAAEALGGI ASDGVESENP EVVLPEDERL PEGGVLAVLR EWAVKADAPT
VVRESCQVAI DMWEYENSAD QFNPLDSLSA KQEEREKTEK VNTTGMERSA HAAVAAMGIA
A