Gene CNA04110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04110 
Symbol 
ID3253375 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1104273 
End bp1107017 
Gene Length2745 bp 
Protein Length836 aa 
Translation table 
GC content59% 
IMG OID638252731 
ProductER organization and biogenesis-related protein, putative 
Protein accessionXP_566769 
Protein GI58258713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0969771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACC TGCTCGACCT CAGCTGGTCC GCCAAGCCGC CGCCCCCCGC GCGCAGCCCC 
CAGCCCGCCT TTGAGCTGCT CGCCCGCCCC CCCCCGCAGC CCGCCGCGAA GCCGCCAACG
CCAGCGCCGC CGCCAGCGAG ACCGCCAGCC ACAGCGCCAC CGCCAGCGCC AGCCACAGCG
CACAGCGACG CCTTCTCCAG CCTCCTCGCC ATCCCCGCAG CACGCCCCGC CGAGCCCGCC
AAGGACATGA CCATGGCGCA GAAGCAGGCG GCGATCGCAG AGGAGCGGCG GCGAAGGGAC
GAGGCGCAGA GGAAGCAGTT TGAGGCCGAC GGTGCCTTCT GGGAGAACCT CGGCAGTCGA
AGCAGCGCCG CCACGTCCGC GTCGTCCCGC TCCAACCACG GCACGTTCTG GGACCAGTAT
GGCGACGACG GCGACCTCTT GTCCGCAGGC TCAAACCCCG TATCGCAGCC GCGATCCCAC
GCCCACTCGC CTGCGCCGCT TCCGCCCGCA CACACCGACC CGTTCGACTT TGACGCCTTT
GAGAACACGC TGAATGGTTC CGCCAAAGCG GCCCTGGCAT CCGAGGCGAG CAACAGCGGA
GAGACATCAG CGATACGGAC CCCAGTGTCA AACGTCGACT TGTGGGATGG TGACGGGAAT
GATGACGACT TGCTTGGAGA GCTCGGTAGA CCTGCTGCAT CCAAGCCCGC TCAGACCAAG
GTATGCACAA ACAGCTATGC ACAAACAGAC GACAGGTGCT TTCCCGCTAA CATCCAATCA
GTCCACGCTT CCCCGCACCC CTCAAGAACC TCCACCCGCG TCTCCTCCAC CCCATATTGT
CGGCCAGATT GTCGAAATGG GCTTTGCCCC TACGCAAGCT CGGCAGGCGC TCGCAAAGAC
GTCAACCGGT ACAGACGTCC AGGAAGCTCT CGAGCTGCTT CTCGGTAGTG GCCAAGGCCA
AGGCCACGCG CCTGGTAATG CCGGACCGAG CGGATCATGG GATGATCATC TACCCGAAGC
TGATGACGAT AGGATCGAGT ACGAGCGACG TAAACGCCAA GAAGCCGAAA AAGAACGACG
GCGTAGACGG CGTGCTGGTC CGTCCCGCGA TTCTGTAAAG GCACGTACGA CCGAGGAACA
AGAGCGAGAT GCCGCCGCTG CCGCTGCCAA CGCGCAAGAA CAGGCCGAAC GTATTCTTGC
GCAAGCATCC GAGATTGGAC AGAGTATGTT TAACAAGGCT ACATCGTTCT GGAACACCAG
TAAAGAACGG GCTATAAAGG TGTATGAAGA GCAGAGAAAG GCTCTGGAGG CGGCGGAAAG
GGGCGAGGAT CAGAAGAAGA AGGATACGAG GCCAAAGTGG ATGCAGGAAA ATGAAGGTTG
GAAGGATGAG AGATCGCGCC AGGTCAGGGG AGGCTTCAAG GATAGCGACG ATGAGCAAGA
GATCGGACCG TCAATCCCTG CACCCAGGCG AGAGCAGAGA CCTGCACGAC AAGAGGGCAT
CGCCGATCTC TTGTCTGGCA ATGACCACCA CCCAGCACAT CGCCCTACAT CTCGCCCTAC
ATCTCGCCCC GCATCTCGAG CTGCACCGTC ACCCAAACCC GCGGCGCCGC CTTTACCTCG
TCGACCCCTC ATCTCTGCCA CCCCCTTACA ACTCGAGTCG TCAGCCGCTC ACAAGACAAA
GGGAAACGAC CATTTCAAAC TCGGTCGTTT TACCGAAGCC GAGTCGTGCT ATTCATCGGC
GATCGCCGCT CTCCCAAAAG ACAACCTCTT CCTCATCCCT CTCCTCAACA ACCGCGCGAC
CACCCGGCTC AAGCTGGGCG ACGCTGTCAA CGCAGCGACA GATTGCACCG CCGTCATCGA
CCTCGTCGGC ATCTCCTACC ACCCGGGCAA GGAAGCGCCC CTACCTGCAG AGTACGCCGA
CATCAAGCTC GGCGACGGTC TTTCAAAAGC ACTGGTCAAG AGAGCGCAAG CGTGGGAAAT
GGGTGAAAAG TGGAAAGCGG CGTTGGAGGA TTGGGAACGG GTGATGGGAT TGGATCCTGT
GCTCTTGGGC GGCACGAATG CGGCTGCCAG TACGAGGAAT ATGGCAGCGC AAGGTGCGAG
GCGGGCGAAA AAGATGATGC AGAGTGGTGG AAATGTTGCT GCCGCCGCTC CTGCCGTAGT
CAGCAGCCCC GCACCGTCTA CACCCGCCGC AGACGTCAAC CGCTCGGCTG CCGTCGCCGA
CCTCCGCAAA GCCGCACAGG CGCTCGAGGC AGAGGAAGAA GCCAGACTCT TTCACAAGGA
CAACGTCGAC GCAAAGATCG CAAACTGGAA AAGCGGCAAG GAAACCAACT TGCGCGCGCT
CATCGCTAGT CTTGATACAG TGCTCTGGGA CGATATCGTG AAAGAGGGCG GGCTGAAAGT
AGGGATGCAT GAGCTGGTTA CGGATAAGCA AGTGAAGATC AAGTATATGA AGGTGATTGC
GAGGTTGCAT CCCGACAAGG TAAGTTTAAT TTATAGATCA TAGTGGTGGC CGATGCTCAT
TTGGGGCGCG TAAAGTTGAA TACGCAAAAC ACAACAGTGG AGCAGAGGAT GCTGGCGAAT
GGCGCATTTG GAGTGCTGAG CGACGCGTAA GTCGGGTTTT TTTTTTTTTT TTTTCAAGGA
CGCAAGTATT AGTTGCTGAT GAGATCATTT GTGAGGTAGT TGGCAAGCAT TTAACCAATG
ATTTTTAAAT CTTCTCATTT ACATTGTTTT ACAACAATTG AGAAT
 
Protein sequence
MDDLLDLSWS AKPPPPARSP QPAFELLARP PPQPAAKPPT PAPPPARPPA TAPPPAPATA 
HSDAFSSLLA IPAARPAEPA KDMTMAQKQA AIAEERRRRD EAQRKQFEAD GAFWENLGSR
SSAATSASSR SNHGTFWDQY GDDGDLLSAG SNPVSQPRSH AHSPAPLPPA HTDPFDFDAF
ENTLNGSAKA ALASEASNSG ETSAIRTPVS NVDLWDGDGN DDDLLGELGR PAASKPAQTK
STLPRTPQEP PPASPPPHIV GQIVEMGFAP TQARQALAKT STGTDVQEAL ELLLGSGQGQ
GHAPGNAGPS GSWDDHLPEA DDDRIEYERR KRQEAEKERR RRRRAGPSRD SVKARTTEEQ
ERDAAAAAAN AQEQAERILA QASEIGQSMF NKATSFWNTS KERAIKVYEE QRKALEAAER
GEDQKKKDTR PKWMQENEGW KDERSRQVRG GFKDSDDEQE IGPSIPAPRR EQRPARQEGI
ADLLSGNDHH PAHRPTSRPT SRPASRAAPS PKPAAPPLPR RPLISATPLQ LESSAAHKTK
GNDHFKLGRF TEAESCYSSA IAALPKDNLF LIPLLNNRAT TRLKLGDAVN AATDCTAVID
LVGISYHPGK EAPLPAEYAD IKLGDGLSKA LVKRAQAWEM GEKWKAALED WERVMGLDPV
LLGGTNAAAS TRNMAAQGAR RAKKMMQSGG NVAAAAPAVV SSPAPSTPAA DVNRSAAVAD
LRKAAQALEA EEEARLFHKD NVDAKIANWK SGKETNLRAL IASLDTVLWD DIVKEGGLKV
GMHELVTDKQ VKIKYMKVIA RLHPDKLNTQ NTTVEQRMLA NGAFGVLSDA WQAFNQ