Gene CNL04700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04700 
Symbol 
ID3254838 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp314756 
End bp317861 
Gene Length3106 bp 
Protein Length944 aa 
Translation table 
GC content52% 
IMG OID638253941 
Productexpressed protein 
Protein accessionXP_568013 
Protein GI58261206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCCTTCCAA CTCCGTCTAT CATGTCATGC CCACCCGCTC CCAACCGCAT TCACCAACTA 
CCAGGTACGT CATCGCGTCC CTTTTTTACC GCACTGTTCG TGTGCTAACA ACGTCAACCA
ACCACAGTCC GCGTCTCCTA CTACATCCCT GCTTCCTCAC AGACCTACTC TACAATATTT
CCGTCATTAC AACAAGTGTA CATACATCCT CAATCTGCTG GTGGAGACGA CGAGACATGG
GGATCCATAT ACCTAAAAAC TGTGGTAAAG GGGGTTCTGG CCGCCAGGTG GGTCTTGTTT
GAACACGTCG AAAAGGCGCC CATATCACTG AACCAGCTGA CCATCTCTAT TGCAGTCCTG
AATTACATCC TGCCTATCCC GGAACATCCG ACTTATCTCT CTACGTCCTG GATCCCCGCG
AAACATATTT TCGTCGTGCA CGCGTCGGCT CAGCCCACCA TGGCAACACC GACACGACTG
GTAGCGAAGT ATGGACAGGT AAAGGTCTTG TTTCTTGGAG CCTCGCGGAA CATGGCCAAG
GCAAAAACCT TATCGCCGGT CGGCTAGAAC GAGACGTAGC GTTTACTACG GCGATCAGAA
CTCAGGGCAT GTCTGCTCTT GAAGCTTTGA TGGCGGCAGA CCAGATGGGT AATGTAAGCG
AAGAAAGCTG GGGTATTGAC ATCTCATTGG GCTTGAACAT GGGATTAGGT GCGCCTGGAG
GCTTCGCACG TGCACGTGCG AGCTCGCCGG CCGTGCGGAG GAGGCGCTCT AGTTTAGAGA
AGGAAGAAAG GGGACATATA ACACCTCCTT ATCCTAGACA ACAAAAACGC GCGTCAGTCC
CGCCCGCACC GCAAACTTTT GACATCCAAG AAAAACTTCC CGCCCACCCA TCATCATCTT
CTTACTCCCA CGACCAGCCC TTTAGACCCG GTCACAATAT CAGTCACGCA AGCTCCCCCA
TCGACCCCCC TCGTGCTCGT GGTCGACCAA CCAAATCTGC TAGTACTGGC GGCCGTCCTC
GCAAGGTCAC TGCTTCGTCA CACAGCTCCT CCACTTCTAT TGATGGTGCC GACCCAAGAC
AACGCCCTGC ACATAGTCAA CGACCATACG AAATCCCGCT TCCGTCCTCT GAAGGCCCTA
ACGTCTACGA TGCCCTATCT TCCATTCCCG AAAATATCCT GGCACGTCCC GAATCTCTCA
CACGCGAGCA AGCACAAAGA CTACTGGCAA GCCCTGCATT CATCGATATG CTTGGAAAAA
TCACTGGCAC CTCGATTCCT ACGGGCAAGC CCAACCCCAA CAACAAGCGC CTGAGAGATG
GAGAGGAACC GGAGTTTCAA CCTCCGAAGC GAAAGAGAGG AAGACCGACG AAGGCTGAAA
AGGCGGCAAA GGAGGCTCAA GAAGCTGCCG CGAGATTGGC GGAAGCTGTC AAACAAAAGG
CCGAAGCTGT CGATGGTCCA CAAGAGGACT CGAGAAACCC AGTATGCTGG AATTGCGGCA
GAACGAAGTC TGCCATCTGG AGAACCAAGG TGATGGAAAA TGGACAGAGT GTGAGAGTGT
GCAATGGTAC GTGATCATTA GTAGTGCATG TGGATTTTAT ATTGACCTTA AACAGCTTGT
GGTCTTTACT GGAACAAACT TGGGACTATG AGGCCACCCA ACCTTTGGGC AGACGGCGAT
GACGATCGTC GAAGCGAACG CGCGCAAAGT ACATTCTCCG AGGCTCCATC AGAACATGGA
CCAGCACATC GAGTTGATAA ACCGCTTGCT CGGGCCCAAG AGTCATTTAA GCGGACATTA
TCTGCAGCGG TCGAGCAAGA TGCTAGGCGT ATGGCCACCC GCCACAAGGG TCGGGCGCCT
ACTTCTCCCA ACAAACATTC AAAACTTGGT CCAATGACTT CCCCGCCTCG TGGGTCTGCT
TCTGCAACAA AATCGTTAAA GCAACGCAAA TACACTGCGG CCAGTTCTCC CGGTGGCTTC
GTAGAGACGA TGCACGATTC GTTCGAGTCG GAAGTTAATG AGGCTAGTCC TGAGGATGAA
GATGAATCTC CTCATTTCCG TGCCCGCCCT GCTCCTCATC CCCATCGCCT CACTCGTAAT
CCACCTTCCA ATACTGATAC CAGGACGCTC GACCTTCCTC TCAGCGACGA CGGTTTAGGC
AGCGGCGCAT CTCGCGGCAA CAATGAATGG AACGATGAAG TCTCTGCATT TTTCGATGTG
GAGGGCTTCT CAATGTCCAC CATGGCCCAA CATGAAATTC CAGAATACAG TCGTCGTCAC
CATCGTCAAG AGGTTATTCA AGAAACGAAG CTCGGTGCCC ACCGCTCAAA TCCTATCCAC
GTGACTTCGT CGTCGATGAT GGAGAATGGG CATGTTGGCA CGGATTCTAC TCTTGAGGAA
GATACCGTAC TTTCACAGTT ATTCAACCGT ACCTCTTCCA TTGCAGGGCC GGGCTCTTCT
CCTTCAGGGT TTGACTTTTC CCAATTACCC CCTTCTTCGC CGCCCATGTC CGTTCTCAGC
TCAGATGCTC TTCCCCATTC TGCACTCCTT TTGTCGAGCC CTGTGAAGAA GAACACTCCT
AGTGTATCTG GACAGACGCC CAGTGCGTCT GGCTTGACGC CCGTAGACGG CAACCGCTTG
CCTCAGAGCA GCTCCAAGTT GAGACACTCT GTCAACGCGG GTGACAGTCA CGATAACGAT
AGACAGGCTG GTGGAAACGG TGGTGTACAG CAGCTAGACT TTGAGGGTAT TCAGAGGATG
TTTAACTTGA TGTCCCATCC CGATTACGTC TCGCAGGAAA ATACGTCTGC CGGTACGAAT
CACACCCCTC TTTCGACGTT TGAAGACCCT CAATACGGTG CTTTGAACGA GTTGATTGAT
GGTCTCGGTG GAGGTGCGTC CGGGATGAAG GTGAATGTAG GAGGGGAGGT GGTTTGTGGA
GGAGAGGGCG AAACGAGCAG TGCGAGTGCA GTCTTGGCAG ACGGGAATGG GGAAGACATC
TTTGCATCAT TCTTGGACGG AGGAGCGTTT GTGTGAGAGA GGGTAAAACT AGCATTGGAC
TTTACTTTTT TTAATACGAA TAACAACAGA CTCGGCATTA ATATGA
 
Protein sequence
MSCPPAPNRI HQLPVRVSYY IPASSQTYST IFPSLQQVYI HPQSAGGDDE TWGSIYLKTV 
VKGVLAASPE LHPAYPGTSD LSLYVLDPRE TYFRRARVGS AHHGNTDTTG SEVWTGKGLV
SWSLAEHGQG KNLIAGRLER DVAFTTAIRT QGMSALEALM AADQMGNVSE ESWGIDISLG
LNMGLGAPGG FARARASSPA VRRRRSSLEK EERGHITPPY PRQQKRASVP PAPQTFDIQE
KLPAHPSSSS YSHDQPFRPG HNISHASSPI DPPRARGRPT KSASTGGRPR KVTASSHSSS
TSIDGADPRQ RPAHSQRPYE IPLPSSEGPN VYDALSSIPE NILARPESLT REQAQRLLAS
PAFIDMLGKI TGTSIPTGKP NPNNKRLRDG EEPEFQPPKR KRGRPTKAEK AAKEAQEAAA
RLAEAVKQKA EAVDGPQEDS RNPVCWNCGR TKSAIWRTKV MENGQSVRVC NACGLYWNKL
GTMRPPNLWA DGDDDRRSER AQSTFSEAPS EHGPAHRVDK PLARAQESFK RTLSAAVEQD
ARRMATRHKG RAPTSPNKHS KLGPMTSPPR GSASATKSLK QRKYTAASSP GGFVETMHDS
FESEVNEASP EDEDESPHFR ARPAPHPHRL TRNPPSNTDT RTLDLPLSDD GLGSGASRGN
NEWNDEVSAF FDVEGFSMST MAQHEIPEYS RRHHRQEVIQ ETKLGAHRSN PIHVTSSSMM
ENGHVGTDST LEEDTVLSQL FNRTSSIAGP GSSPSGFDFS QLPPSSPPMS VLSSDALPHS
ALLLSSPVKK NTPSVSGQTP SASGLTPVDG NRLPQSSSKL RHSVNAGDSH DNDRQAGGNG
GVQQLDFEGI QRMFNLMSHP DYVSQENTSA GTNHTPLSTF EDPQYGALNE LIDGLGGGAS
GMKVNVGGEV VCGGEGETSS ASAVLADGNG EDIFASFLDG GAFV