Gene CNA03740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA03740 
Symbol 
ID3253434 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1006580 
End bp1010546 
Gene Length3967 bp 
Protein Length1203 aa 
Translation table 
GC content52% 
IMG OID638252693 
Producteukaryotic initiation factor 4F subunit P130, putative 
Protein accessionXP_566707 
Protein GI58258589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCGGA AATGGAGAAG AAAGAAAAGC GGAAAAGGCC GCTGCGAGGA TTATAATTTG 
AAAGATGGTT TTTGTTATTG TAAGGCGGAC TTTGTTCATA TTTTCTTTTT TCCTGTCACT
CAAAAAAGTT GGGTAGTCAA CGTAGATACC GGGCAAATCT CTCGGTGTCA CAGTAACTCA
TCCACAACAA ACGATGCCCC CCCAGCCCGC CCCCGCCAGG CAGAACGGCG TCCCCCAACA
GCCCATCCCC CGCCCTCCGG TTCCCCAGCA AGGCATCCCA GCCCACTTCC CCGTCCCCAA
CCAACCCCCA GCTCAGTATC CCATCATGGC CTATCCTCCC CAATCAGGAT TCTACGTAAG
CTTCACACTA CATTCCACAT TCGTATCCTG ACTATAACCC CTTACCCTTT TTGCAGGCCG
GATACAACCC TTATGAGCAA CAGCAGAACT TTGGGATGCC CCCCCAATGG GCTCCCCAGC
ACCACCCTCA AAACCAGTTT CCGGGCGGCT ACAGTGCTCG GCCTGTCTCC CCGCCCTCTG
GAACAGCTCC TTTCCAGCCC GCGCAGCAAT CCTTCATCCA CAACTCTCCC TTCCCCGGCG
GCAGTGGGCC TAACGGTCCT CAGACACCCC TTCGGCCTTC CGGTCCTGCC TTGAACGGTC
ACCAGCAAAC TTCTTCAAAC GCATCTCAAA CATCTGCTCC TTCCACCCCC AACTTCTCTC
TCTCTGGTGC CAGTGCTACT TTTACTCCTC GACGATCTGC CGCAATCAAA ATTTCAAGAC
CCGACGGCAC GGAGTTCGAC CTCAAAAAGG AGGCCTTGAA GGTTTCGTCG ACAACTGTGT
CCCCTGCACC TACATCTCCC GCCTCTACTC CTCATACTCC TGCGACGCCC AATTCTGAAT
TGAAAAATGT CAAGGTAGAC ACTCCCAAGA AGCCTGTGTT TGGATTACCT GTGACAGTCA
AGATTGAGAG GCCAGAGGAG AAAGCTGCCA GGCTACTAGA GGAAGCGAAG AGAGAAAAGA
TCAGGGCCCA AGAGGAAAAG GAAGAAAATG AGAGGAAGGA GAGACTCGAA AAGAAGGCCA
AGGAAGAGGA AGAGAAGAAG GCCAAGGACG CTGCTGAAAA GGTAAGCATT CGAAGTCTGG
TATATGCTTT TTCCGTCGTG TGACTCACAT CCCTGTGCAG GCTGAACAGG CAAATTCCGC
CGGCGAGCCC ACCAACACAG AAGTCAACGG ATCTGTCGAA GAGGTCATCT CCGGCGCTCC
TTCCTCAGTG TCTTCTCCTG CCCTTGGCGC TGGCCTTCCT CCTAAGCCTG TGTCTGTTGT
CAACGGCGCA AACGCTCTTC CAGCCACCCT CAACCTCGCT TCTGCTTCTC CTCCTGTAGT
CAGCGAGCCC TCTAGTGCTT CTGTCTCTGC TCTCAGCTCT GCTCGCCCTA TTGAGGACAT
CCACGCCATC AGTTACCCTG GTTCACTGAA GTCTCCAAAC CCCCAGCTTA ATGTGTCTGC
AGAGCCTCGC AAGTTCCGAT ACGACCGCGA GTTTTTGATG CAGTTCATGA ATATCTGTAA
GGACAAGCCC GAGAGTCTTC CTCCTCTTGA GGAGATTGGC CTCGAGGCAG ACGGCGGTAG
TGGGTTCGGT AGCCGTGGTT CCCGCGGGGG CCGATCCTCT CAGGGACCCT CAAGCCGTAC
TGGCTCTACA GGTCTTGGTA TCGGCGGTGT CAATAGGGCA ACAATGGGTT CTTTCAGCAT
GGGCCAATTT GGTTCTGGTT CTGGTTCTGG TTCTCTACGC AACTCGACTA GTGAACAGCG
CTACCGTGCC AGTCTCCCAG GGCGCTCATC GAGCCAAGGT GGTCCCGGCG GTCTCCCTTC
GTTATCCGGT CTTCCTTCGA TGGGTCTTTC GAGTTCTCGA GGTGGCGAGA GCCGAGGAAG
TCAACGCGGT TCCAAGCGTG TTCCCCAATC TTCCCAACCT TCAACTCCCG CCGCTGCTCC
GATTCCCATC TCTGAAAATG CCTGGACCCG AAAACGACTT GGTGGTGACT CCGAGGGAAC
TCCCGCTTAC ATTGAGCGAA AGGTCAAGGC TTTGCTCAAC AAGCTTACTG AGGAAATGTT
CGATCCTATT TCCAAGCAGA TTCTTGAATG GGCGAACAAA TCGGCGAACG AGACTGATGG
TTTAACTCTC AAGCTCGTCA TCAAGCTTAT CTTTGAAAAA GCTACCGATG AGGCGCACTG
GTCTTCTATG TACGCAAAGC TCTGTCGATT GTTGCACGAC GAGATCAGTA ACGATGTTTC
CGACACCATC GATGGCCAAG TAATTTCTGG TCGATCCCTT TTCCGACGAT ATCTTCTCGG
TCGATGCCAG GTAGATTTTG AAGCGGGGTG GAAGGCTCGT GAAGACACTG CAGTGGCAGC
AGCGGCAAAG AAGGATGAGG ATGAAGCGAA GCAAGAGAAG GCTAAGGCGG AGGATGGTGG
TGAGGAGAAG GAGGCTGACT TAATGAGCGA TGAGTACTAC GCGGCTCAGA AAGCTAAGCG
ACGAGGTCTC GGTTTAATTC AACTGATTGG TGAACTCTTC AAGAGAGAGA TTGTTGCCAA
TCGAGTCATC AGCCAATGTC TATTGAAACT TCTCAGCAAT GTCAATGATC CCGACGAGGA
GGACATCGAG TCTGCTTGCA AGCTTCTCGC GACCGTCGGT GCCGCATATG ATCGAGCGGC
CCGTGAGAAT CTCAACAAGG CCTTTGACGT GCTTAATCAG ATGATGAAGA TTGAGTCGCT
TCCATCCCGT ATTAAATTCA TGATTATGGT CAGTATTTGC AATTTATCAG TTGTTCAGAT
CGATGCTAAC ATCAAAATCA GGATCTTAAC GATCTTAGGA GGGAGGGATG GAAATCTCGA
AAGAACCAGA GCGGCGTGAT GACTATTGCA GAGATTCACG AACAGAACGC AAAGGAGAAA
ACCGCCGCCG CCGCTGCTGC TCGAGAATCT CTCTCTCGAG GTGGTTCTCG TTCCGGAAAT
AGGCGCGACG GTACCCAACC CGGAGAATGG CAATCCGTCT CATCTAATCC CCGGTCTATC
AGCCGCCCCA CCGACTTTTC AAATATTGGT CGAAACATCA GTTCAAGCAG TGTCACGCCG
TCATTCGGTC CTTCAAGCGT TTTCGCGAAC CGCAAGGGCA AAGCCGCCAC GGCTAGCAAT
GCTCCTTCAC CGACATCCCG CCCCTCCTCA GCTAACATGT TTAGCGCTCT CCACGAGGCC
CAGGAGTCAG AGTCTCCCGC CGAAAGGCGT ACTAGTACCG ACGAAGGGGA GGCCCCCCAG
AGAAAGAAGC TCAACCTTGC ACCTCGAACA AAGCCTTTGG CATCCGAGGA TGACGCTAGG
GAGGAAGGAG AGGAGGCTCG GTCTGAGGAG GAAGCTGCCG GTCCTGAGTT GTCCGAAGGC
GATATCAAAT CAAAGATTGA CCTTGACATG AAGGAGTTAT GGGGTGACAA AGACCAGGGA
GGCTCAAGAA ACCCGGCAGA TGTGGCAGAG TACTTTAGCT CTTTACCCGA ATCACGTCGA
CCCTTATTGG CCGCCAGGCT GTTGGATGAC TTATTTAGGA TTTCGAAACT CAAGGATGCC
GAGGTGGTTG CAAAGGGTTG GAAGGTTTCT TTGGAACAGC AGGCTGTTTC TTCTGAAGTT
CTTAAGCAGG CGTAAGTGCA TTTATTCCCT TCATGTGTCA GTTTTGACAT CTCTGTGCCG
TAGTTTGGAA GCTCGTATGC CTACTTTGGA TGACGAAGCA ATCGACTTCC CCAATGCCTA
CGACGCTGTT GCCCTACTCA CTCGATCTCT TTCACTCTCT GATGACGACG TCAGTGCTTT
GGGAGACAAG ATCGAAGTTG ATGGTACTCC CCGAGTTACC CCCAAACAAA AACTTGAGAA
GGCTCTCGCC AAACTCAATG AGCAAGCATG AAGAAGTTTT TGATACCCGT AGATCATTTA
TTCATCT
 
Protein sequence
MRRKWRRKKS GKGRCEDYNL KDGFCYLTHP QQTMPPQPAP ARQNGVPQQP IPRPPVPQQG 
IPAHFPVPNQ PPAQYPIMAY PPQSGFYAGY NPYEQQQNFG MPPQWAPQHH PQNQFPGGYS
ARPVSPPSGT APFQPAQQSF IHNSPFPGGS GPNGPQTPLR PSGPALNGHQ QTSSNASQTS
APSTPNFSLS GASATFTPRR SAAIKISRPD GTEFDLKKEA LKVSSTTVSP APTSPASTPH
TPATPNSELK NVKVDTPKKP VFGLPVTVKI ERPEEKAARL LEEAKREKIR AQEEKEENER
KERLEKKAKE EEEKKAKDAA EKAEQANSAG EPTNTEVNGS VEEVISGAPS SVSSPALGAG
LPPKPVSVVN GANALPATLN LASASPPVVS EPSSASVSAL SSARPIEDIH AISYPGSLKS
PNPQLNVSAE PRKFRYDREF LMQFMNICKD KPESLPPLEE IGLEADGGSG FGSRGSRGGR
SSQGPSSRTG STGLGIGGVN RATMGSFSMG QFGSGSGSGS LRNSTSEQRY RASLPGRSSS
QGGPGGLPSL SGLPSMGLSS SRGGESRGSQ RGSKRVPQSS QPSTPAAAPI PISENAWTRK
RLGGDSEGTP AYIERKVKAL LNKLTEEMFD PISKQILEWA NKSANETDGL TLKLVIKLIF
EKATDEAHWS SMYAKLCRLL HDEISNDVSD TIDGQVISGR SLFRRYLLGR CQVDFEAGWK
AREDTAVAAA AKKDEDEAKQ EKAKAEDGGE EKEADLMSDE YYAAQKAKRR GLGLIQLIGE
LFKREIVANR VISQCLLKLL SNVNDPDEED IESACKLLAT VGAAYDRAAR ENLNKAFDVL
NQMMKIESLP SRIKFMIMDL NDLRREGWKS RKNQSGVMTI AEIHEQNAKE KTAAAAAARE
SLSRGGSRSG NRRDGTQPGE WQSVSSNPRS ISRPTDFSNI GRNISSSSVT PSFGPSSVFA
NRKGKAATAS NAPSPTSRPS SANMFSALHE AQESESPAER RTSTDEGEAP QRKKLNLAPR
TKPLASEDDA REEGEEARSE EEAAGPELSE GDIKSKIDLD MKELWGDKDQ GGSRNPADVA
EYFSSLPESR RPLLAARLLD DLFRISKLKD AEVVAKGWKV SLEQQAVSSE VLKQALEARM
PTLDDEAIDF PNAYDAVALL TRSLSLSDDD VSALGDKIEV DGTPRVTPKQ KLEKALAKLN
EQA