Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA03740 |
Symbol | |
ID | 3253434 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1006580 |
End bp | 1010546 |
Gene Length | 3967 bp |
Protein Length | 1203 aa |
Translation table | |
GC content | 52% |
IMG OID | 638252693 |
Product | eukaryotic initiation factor 4F subunit P130, putative |
Protein accession | XP_566707 |
Protein GI | 58258589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGGA AATGGAGAAG AAAGAAAAGC GGAAAAGGCC GCTGCGAGGA TTATAATTTG AAAGATGGTT TTTGTTATTG TAAGGCGGAC TTTGTTCATA TTTTCTTTTT TCCTGTCACT CAAAAAAGTT GGGTAGTCAA CGTAGATACC GGGCAAATCT CTCGGTGTCA CAGTAACTCA TCCACAACAA ACGATGCCCC CCCAGCCCGC CCCCGCCAGG CAGAACGGCG TCCCCCAACA GCCCATCCCC CGCCCTCCGG TTCCCCAGCA AGGCATCCCA GCCCACTTCC CCGTCCCCAA CCAACCCCCA GCTCAGTATC CCATCATGGC CTATCCTCCC CAATCAGGAT TCTACGTAAG CTTCACACTA CATTCCACAT TCGTATCCTG ACTATAACCC CTTACCCTTT TTGCAGGCCG GATACAACCC TTATGAGCAA CAGCAGAACT TTGGGATGCC CCCCCAATGG GCTCCCCAGC ACCACCCTCA AAACCAGTTT CCGGGCGGCT ACAGTGCTCG GCCTGTCTCC CCGCCCTCTG GAACAGCTCC TTTCCAGCCC GCGCAGCAAT CCTTCATCCA CAACTCTCCC TTCCCCGGCG GCAGTGGGCC TAACGGTCCT CAGACACCCC TTCGGCCTTC CGGTCCTGCC TTGAACGGTC ACCAGCAAAC TTCTTCAAAC GCATCTCAAA CATCTGCTCC TTCCACCCCC AACTTCTCTC TCTCTGGTGC CAGTGCTACT TTTACTCCTC GACGATCTGC CGCAATCAAA ATTTCAAGAC CCGACGGCAC GGAGTTCGAC CTCAAAAAGG AGGCCTTGAA GGTTTCGTCG ACAACTGTGT CCCCTGCACC TACATCTCCC GCCTCTACTC CTCATACTCC TGCGACGCCC AATTCTGAAT TGAAAAATGT CAAGGTAGAC ACTCCCAAGA AGCCTGTGTT TGGATTACCT GTGACAGTCA AGATTGAGAG GCCAGAGGAG AAAGCTGCCA GGCTACTAGA GGAAGCGAAG AGAGAAAAGA TCAGGGCCCA AGAGGAAAAG GAAGAAAATG AGAGGAAGGA GAGACTCGAA AAGAAGGCCA AGGAAGAGGA AGAGAAGAAG GCCAAGGACG CTGCTGAAAA GGTAAGCATT CGAAGTCTGG TATATGCTTT TTCCGTCGTG TGACTCACAT CCCTGTGCAG GCTGAACAGG CAAATTCCGC CGGCGAGCCC ACCAACACAG AAGTCAACGG ATCTGTCGAA GAGGTCATCT CCGGCGCTCC TTCCTCAGTG TCTTCTCCTG CCCTTGGCGC TGGCCTTCCT CCTAAGCCTG TGTCTGTTGT CAACGGCGCA AACGCTCTTC CAGCCACCCT CAACCTCGCT TCTGCTTCTC CTCCTGTAGT CAGCGAGCCC TCTAGTGCTT CTGTCTCTGC TCTCAGCTCT GCTCGCCCTA TTGAGGACAT CCACGCCATC AGTTACCCTG GTTCACTGAA GTCTCCAAAC CCCCAGCTTA ATGTGTCTGC AGAGCCTCGC AAGTTCCGAT ACGACCGCGA GTTTTTGATG CAGTTCATGA ATATCTGTAA GGACAAGCCC GAGAGTCTTC CTCCTCTTGA GGAGATTGGC CTCGAGGCAG ACGGCGGTAG TGGGTTCGGT AGCCGTGGTT CCCGCGGGGG CCGATCCTCT CAGGGACCCT CAAGCCGTAC TGGCTCTACA GGTCTTGGTA TCGGCGGTGT CAATAGGGCA ACAATGGGTT CTTTCAGCAT GGGCCAATTT GGTTCTGGTT CTGGTTCTGG TTCTCTACGC AACTCGACTA GTGAACAGCG CTACCGTGCC AGTCTCCCAG GGCGCTCATC GAGCCAAGGT GGTCCCGGCG GTCTCCCTTC GTTATCCGGT CTTCCTTCGA TGGGTCTTTC GAGTTCTCGA GGTGGCGAGA GCCGAGGAAG TCAACGCGGT TCCAAGCGTG TTCCCCAATC TTCCCAACCT TCAACTCCCG CCGCTGCTCC GATTCCCATC TCTGAAAATG CCTGGACCCG AAAACGACTT GGTGGTGACT CCGAGGGAAC TCCCGCTTAC ATTGAGCGAA AGGTCAAGGC TTTGCTCAAC AAGCTTACTG AGGAAATGTT CGATCCTATT TCCAAGCAGA TTCTTGAATG GGCGAACAAA TCGGCGAACG AGACTGATGG TTTAACTCTC AAGCTCGTCA TCAAGCTTAT CTTTGAAAAA GCTACCGATG AGGCGCACTG GTCTTCTATG TACGCAAAGC TCTGTCGATT GTTGCACGAC GAGATCAGTA ACGATGTTTC CGACACCATC GATGGCCAAG TAATTTCTGG TCGATCCCTT TTCCGACGAT ATCTTCTCGG TCGATGCCAG GTAGATTTTG AAGCGGGGTG GAAGGCTCGT GAAGACACTG CAGTGGCAGC AGCGGCAAAG AAGGATGAGG ATGAAGCGAA GCAAGAGAAG GCTAAGGCGG AGGATGGTGG TGAGGAGAAG GAGGCTGACT TAATGAGCGA TGAGTACTAC GCGGCTCAGA AAGCTAAGCG ACGAGGTCTC GGTTTAATTC AACTGATTGG TGAACTCTTC AAGAGAGAGA TTGTTGCCAA TCGAGTCATC AGCCAATGTC TATTGAAACT TCTCAGCAAT GTCAATGATC CCGACGAGGA GGACATCGAG TCTGCTTGCA AGCTTCTCGC GACCGTCGGT GCCGCATATG ATCGAGCGGC CCGTGAGAAT CTCAACAAGG CCTTTGACGT GCTTAATCAG ATGATGAAGA TTGAGTCGCT TCCATCCCGT ATTAAATTCA TGATTATGGT CAGTATTTGC AATTTATCAG TTGTTCAGAT CGATGCTAAC ATCAAAATCA GGATCTTAAC GATCTTAGGA GGGAGGGATG GAAATCTCGA AAGAACCAGA GCGGCGTGAT GACTATTGCA GAGATTCACG AACAGAACGC AAAGGAGAAA ACCGCCGCCG CCGCTGCTGC TCGAGAATCT CTCTCTCGAG GTGGTTCTCG TTCCGGAAAT AGGCGCGACG GTACCCAACC CGGAGAATGG CAATCCGTCT CATCTAATCC CCGGTCTATC AGCCGCCCCA CCGACTTTTC AAATATTGGT CGAAACATCA GTTCAAGCAG TGTCACGCCG TCATTCGGTC CTTCAAGCGT TTTCGCGAAC CGCAAGGGCA AAGCCGCCAC GGCTAGCAAT GCTCCTTCAC CGACATCCCG CCCCTCCTCA GCTAACATGT TTAGCGCTCT CCACGAGGCC CAGGAGTCAG AGTCTCCCGC CGAAAGGCGT ACTAGTACCG ACGAAGGGGA GGCCCCCCAG AGAAAGAAGC TCAACCTTGC ACCTCGAACA AAGCCTTTGG CATCCGAGGA TGACGCTAGG GAGGAAGGAG AGGAGGCTCG GTCTGAGGAG GAAGCTGCCG GTCCTGAGTT GTCCGAAGGC GATATCAAAT CAAAGATTGA CCTTGACATG AAGGAGTTAT GGGGTGACAA AGACCAGGGA GGCTCAAGAA ACCCGGCAGA TGTGGCAGAG TACTTTAGCT CTTTACCCGA ATCACGTCGA CCCTTATTGG CCGCCAGGCT GTTGGATGAC TTATTTAGGA TTTCGAAACT CAAGGATGCC GAGGTGGTTG CAAAGGGTTG GAAGGTTTCT TTGGAACAGC AGGCTGTTTC TTCTGAAGTT CTTAAGCAGG CGTAAGTGCA TTTATTCCCT TCATGTGTCA GTTTTGACAT CTCTGTGCCG TAGTTTGGAA GCTCGTATGC CTACTTTGGA TGACGAAGCA ATCGACTTCC CCAATGCCTA CGACGCTGTT GCCCTACTCA CTCGATCTCT TTCACTCTCT GATGACGACG TCAGTGCTTT GGGAGACAAG ATCGAAGTTG ATGGTACTCC CCGAGTTACC CCCAAACAAA AACTTGAGAA GGCTCTCGCC AAACTCAATG AGCAAGCATG AAGAAGTTTT TGATACCCGT AGATCATTTA TTCATCT
|
Protein sequence | MRRKWRRKKS GKGRCEDYNL KDGFCYLTHP QQTMPPQPAP ARQNGVPQQP IPRPPVPQQG IPAHFPVPNQ PPAQYPIMAY PPQSGFYAGY NPYEQQQNFG MPPQWAPQHH PQNQFPGGYS ARPVSPPSGT APFQPAQQSF IHNSPFPGGS GPNGPQTPLR PSGPALNGHQ QTSSNASQTS APSTPNFSLS GASATFTPRR SAAIKISRPD GTEFDLKKEA LKVSSTTVSP APTSPASTPH TPATPNSELK NVKVDTPKKP VFGLPVTVKI ERPEEKAARL LEEAKREKIR AQEEKEENER KERLEKKAKE EEEKKAKDAA EKAEQANSAG EPTNTEVNGS VEEVISGAPS SVSSPALGAG LPPKPVSVVN GANALPATLN LASASPPVVS EPSSASVSAL SSARPIEDIH AISYPGSLKS PNPQLNVSAE PRKFRYDREF LMQFMNICKD KPESLPPLEE IGLEADGGSG FGSRGSRGGR SSQGPSSRTG STGLGIGGVN RATMGSFSMG QFGSGSGSGS LRNSTSEQRY RASLPGRSSS QGGPGGLPSL SGLPSMGLSS SRGGESRGSQ RGSKRVPQSS QPSTPAAAPI PISENAWTRK RLGGDSEGTP AYIERKVKAL LNKLTEEMFD PISKQILEWA NKSANETDGL TLKLVIKLIF EKATDEAHWS SMYAKLCRLL HDEISNDVSD TIDGQVISGR SLFRRYLLGR CQVDFEAGWK AREDTAVAAA AKKDEDEAKQ EKAKAEDGGE EKEADLMSDE YYAAQKAKRR GLGLIQLIGE LFKREIVANR VISQCLLKLL SNVNDPDEED IESACKLLAT VGAAYDRAAR ENLNKAFDVL NQMMKIESLP SRIKFMIMDL NDLRREGWKS RKNQSGVMTI AEIHEQNAKE KTAAAAAARE SLSRGGSRSG NRRDGTQPGE WQSVSSNPRS ISRPTDFSNI GRNISSSSVT PSFGPSSVFA NRKGKAATAS NAPSPTSRPS SANMFSALHE AQESESPAER RTSTDEGEAP QRKKLNLAPR TKPLASEDDA REEGEEARSE EEAAGPELSE GDIKSKIDLD MKELWGDKDQ GGSRNPADVA EYFSSLPESR RPLLAARLLD DLFRISKLKD AEVVAKGWKV SLEQQAVSSE VLKQALEARM PTLDDEAIDF PNAYDAVALL TRSLSLSDDD VSALGDKIEV DGTPRVTPKQ KLEKALAKLN EQA
|
| |