Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00840 |
Symbol | |
ID | 3257920 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 220470 |
End bp | 224673 |
Gene Length | 4204 bp |
Protein Length | 1027 aa |
Translation table | |
GC content | 48% |
IMG OID | 638256670 |
Product | hypothetical protein |
Protein accession | XP_570812 |
Protein GI | 58267312 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.515012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTTGTCTTT GTTTCTTCCT TACCCTAATA GCTTCTCAAA AATCACCCGT GCCTTCCTTC TTCGTCCGCC AATGTCGCCA ACATCTTTTC AGAATTCGGC CGGCCCTGGC CCTCTTCTCA ACCAGCACAG TCGTAGTCCA CCTAGTGCCA GTAATGTCAG CAGCCGTGAC CGTCAAGGCA CAGCGGAATC AGATATAGAA GCCAGAGAAG AGAACGGTGC CTCTAAAGCT CCACTTCGTA AGAAGCAAAG GAAGCAAAGG CCTGTGTTCA GTTGCGCTGG TATGTACATA TCTTCATGGT CTGTCTTGAT TGCCAGATCA TCGTTTGATT TTGCTGACAT TCTTTTAGAA TGCAGGAGGC TCAAACTCAA GTGTGATCGT CAGGGTGAGT GATTTGATAC TCTCTATGCA CCATACCAAA CTCATTTATC CAGTGCCATG CGATAACTGC GTCAAGAGAC GTTGTCATTC TATTTGCCCC GATGGGGTTC GCGTTACCAG ACAGTAGTAC GTCATTTACA CAGTTACTGG TGAGCTATTT ACTGTCGCTG ACGCAAATGA ACCCAGTCTC GCAAATCCTG ATTCCGCTTT GCTTACGCGT CTTGAGCAAC TGGAGGGCAT CGTATCTAGA CATAGATTGG ATCCCGTCAT TACAGAAGCC GCATCGGCGA AAGAAACTAA CAAAACTCAA ACGCCCTCAG GAATGCAGTT ACGCTCCCAA CAGGCGACAG GGAATATAAA TGGTAGTTCC CGAAGTCGCG GACAGAATTC TCCTCATATG GCTTCGCCTT CTTCTGTGGC CGCTCAGCCT CCAACTAATC GACCAGAAGA GGCATCTTCC AGCCTGCAGC AGTCAGATGA TGATGACCAT TTGCGAGGAT CTTCTTCATC AGGCCCTTTT CGGTATGATA TGCTAGAATC CCCGCGTTAC GAGCATCAAC CTCAGCCTCA ACATCTACAA CAACATCCAC CCCAAACTCA CGCGTCAGAT CTGCAGTTCA ACACTGTTCG CCCCGGTAAT CCAGATATCT CTCACAGGGT AGAAGCAGTT CATGAGATTA ACAAGATAGT TCCATGCGAT GGATCAATAC GATTTGATTT TCCCACGCAT TCTACTTCAT CGACATCACC CCCCGCAGCC GGCCGTCACA GCGACAGTAA CGCTGAACCG CATAGTCATC ATCCCCACCA TTTTGGTCCT CCTGTCCTAG CTGAGAATGT CGATGTTGAG GAAGAGCAGA GCTATGGTAC TTTGGTGATG GGACAAGGAG GAACGAGCAA GTACTTGGGG CCGACTGCAG CGAGTGAATG GTTGAGGGAT GTAAGTCTTC ATGACTCCTA CTTGAATACG TACGCTGCTT ATTTGGACAG CAAGAGACAC GAGACCACGC TGAGTCGCGA GATCAGTCTC GATTACCATC TCCTGAGATC CCAGAGGCAT CGGTGCCCCC ACACAATCTA CGCCCCACAG CTGTTCCTCA TTCATTTCCT TTTCACACTT CAAAGACCAA AGTTGTCTCA TGGTCGAGAT ATGATCTTCT CTCCAGGCTA CCTGATAGAT ACTACGCCGA CATGCTTGTT GACAGTTACT ACCGTTCCTT CAGCTGGCAG TGAGTAAACC TACTGTCTTG GCATGAATCT CCTTCGATGC TCATACAAAT GATGAAGTTA CGACATCTGC CCTCGTTCTG AATTCCAGCC TGTCTACGAA GAGATGTATC TCTTTCGGAT AACAAGCATC ACATCTCCTC CTGAACCTCT CCAGCGACTT GGCCGAGGCA GGATGAACTA TCAGGACCTA GGTCTTGTTT TTATGGTCCT TGCTTTCGGC ACATTGCATT GTCTTGAACT GGCGCCCAAT GATCCTACTG CATACGAGCT GGCTTCTGTA GCTCAGGTCG CTTTGTCAAA AGGCGATCTT CTGTCGCGAC CTTCAATTTC AGGGCTTCAA GCCTTGGTAC GTTCTCATGT TTTATCTTAC GATTGAAGAC CAATCAGGCT GACTGTAGTT TCTTTTAAAA AAGCATATAT TGGCCCAATT CAGTAACGAA TCTGAAGAAG GGCGAAATGG AGATGCATCC TGGCCATTAT GGGGTCTATC GATGAGGTTG ATACAGGCGG TGAGTTTGAT GGCCCTGAAA TATATCTGAT ATTCCGCATG CTCCGAATGC TGATCGCCGC TGTATCTAGA TGGGTCTTCA TAGGGATGGT GCCAGATGGG ATCTTGATCC AGAACTTGTA GAACAGCGAC GGTCAGTCAT TTCCTGATTC GACTGCCTGC CATGTGCTGT CATTGATGCG CAAACATTCA GCCGAGTATT TTGGGAATGC CACAGTGCCG ATATCTTCCA AGCAAACTGT AACAGTCGGC CGTTAGTTAC ACAGTTCATT GCCGTTATCG ATTACTGACA TTGTTTCGCA CACTATTAGT AACACTCTCT GGCCAGGCGT GTATGACACC GCGTACCCTT CACAACCCCC AATTCATATC GAGAAGGACT ATTATACTCT GAAGTTTGAG CTCAGTCGCA TAGCTGCATC GTGAGTGGCA ACTAATAACT TTTAGAGCAC CAATCTAGTA TGTTCGAGCC TGACTATCCC GGTGCAGAAT CCTCGGCTCA GCAACTAATG TTCAAGCTCC TCCTTATTCT ACCATTATGG AGCTCTCTCG TCAAGTGTGG GTATCACTTT GCCCTCCTGG TGGATTTGGG TCGCTGATAC TAGGTTCCAA CGAACAGCAC TGATTTCGAG CGTCAGGTAC CTTTCAATCT GCGATGCCGA CAAGCCCTCC AGGCTTTGCC TAGTATCTAT CCTGATCCTC AGGCTGCAAT TGACAACAGT CCTGAAATTA GCTCGAAAGA ACTTCATAAG ACTTTTCAGG TGAGCTTTTC GGGAACAAAG CTGTATGAAG ATCACTCAGC CGGGAGCTCA CGTTGGCAAC AGAAATTTTT CTTGTCCATC AGCATCTCCG AAACCCTATT ATTCCTCCAT CGTCCCTACT TCGTTCGTGC TCTCATCAGG ACCTCTCCCG ACCCCTCACA GTCCGTTTAC GCGCCTTCAT ATCTCACAGT TGTTGAGAGG TGTAATGTGA GAATTTGTTC ATCTGCCGGA TGGCCACACT TCCAAGAAAG CTTACATTTT TGCTTGTAGG CAATCGTTCA ATGCGTTGTC ACGATCCATA AAATCCATCG TAACGTTTCA ACCCGGCACT GGCCACTATG GGCAAGTTCC TATCGATATG AGTTAAAATT GATGGCTGAT CCTTTTAGCG CAGTATCATG CTTTCAATTC TGCCGTCTCA ATGGGTACTC TGATATTTAA GTCACCTCAA AATCCTCTGT CGGATTTTGC CCTCGGTCTA ATCAACACCG TGATTGAGGC GTTCACGTCG GCTGTCCAAA CGGGCTCTTC TCCCCGCCTG TCCACCAACC TCCAATGGCT AATTAGGCTC AGGCGTAGAT GCCAAGACGC TATTGACAAA TCCAAATGTG AGAGTGCCGG CCAACAAACA ACCAGCGATA ATAGCACCTT TGCTCTCCGT GAGGGCGATA CTGGCGACAA TAGCGATCAA GATGACCCAG ATAACTTTTC TCTTCTTGGT TGGAGGACGA GGTTAATCCA GCGTGCGGCT TTTGGTGGAC AAGTGGCCAA GATTATTTCG CAGTCCACGC CATCGCAGTC AAATGACAGT TCTCCTCAGA TTATGTCCTC GAATATCAAC CTGCTTCCAG ACATCTTGGG TCCTACGATG GGTGGTTTAA CAGGAGCAGG TGTCCTGAAT CCAATACCGG GAGCCGAGAA TATAGCGAAT AAGCAGATGT CATCGAGTAC GAGCACTGCG GGCGATCAAA CTGAGATTAA TGAAACTTTT ACAAATCAGC TTGTGAGTCT ATACCGATCA AATAGTCCTT CCGAGAATTA ATCTCGCGAG GCTGGTATAA TTGCTTGGGA CGCAATGCTG ACGTTTAAAA CAGTTGCAAC AATTCTGGGA GCCGATGCTG CTACAAGATG CTCCTAATGT AGGGGATAGT ACCTGGGACC CATTCCAGAC GGTTAACTGT TGGGATTGGA ACATGCATCT TTATAGGGAA GAAAGGTCTC ATCAAGCGCC CAATCCAGAT ACGCATGCGT GAAGAATGTT GTTGTGGTTA TTGGCGAAAT TTGTGTAATG TTATCATTTA TATATTGGGT GCGACTGTGT TCGG
|
Protein sequence | MSPTSFQNSA GPGPLLNQHS RSPPSASNVS SRDRQGTAES DIEAREENGA SKAPLRKKQR KQRPVFSCAE CRRLKLKCDR QVPCDNCVKR RCHSICPDGV RVTRQYLANP DSALLTRLEQ LEGIVSRHRL DPVITEAASA KETNKTQTPS GMQLRSQQAT GNINGSSRSR GQNSPHMASP SSVAAQPPTN RPEEASSSLQ QSDDDDHLRG SSSSGPFRYD MLESPRYEHQ PQPQHLQQHP PQTHASDLQF NTVRPGNPDI SHRVEAVHEI NKIVPCDGSI RFDFPTHSTS STSPPAAGRH SDSNAEPHSH HPHHFGPPVL AENVDVEEEQ SYGTLVMGQG GTSKYLGPTA ASEWLRDQET RDHAESRDQS RLPSPEIPEA SVPPHNLRPT AVPHSFPFHT SKTKVVSWSR YDLLSRLPDR YYADMLVDSY YRSFSWHYDI CPRSEFQPVY EEMYLFRITS ITSPPEPLQR LGRGRMNYQD LGLVFMVLAF GTLHCLELAP NDPTAYELAS VAQVALSKGD LLSRPSISGL QALHILAQFS NESEEGRNGD ASWPLWGLSM RLIQAMGLHR DGARWDLDPE LVEQRRRVFW ECHSADIFQA NCNSRPNTLW PGVYDTAYPS QPPIHIEKDY YTLKFELSRI AASILGSATN VQAPPYSTIM ELSRQVYLSI CDADKPSRLC LVSILILRLQ LTTVLKLARK NFIRLFSISE TLLFLHRPYF VRALIRTSPD PSQSVYAPSY LTVVERCNAI VQCVVTIHKI HRNVSTRHWP LWYHAFNSAV SMGTLIFKSP QNPLSDFALG LINTVIEAFT SAVQTGSSPR LSTNLQWLIR LRRRCQDAID KSKCESAGQQ TTSDNSTFAL REGDTGDNSD QDDPDNFSLL GWRTRLIQRA AFGGQVAKII SQSTPSQSND SSPQIMSSNI NLLPDILGPT MGGLTGAGVL NPIPGAENIA NKQMSSSTST AGDQTEINET FTNQLLQQFW EPMLLQDAPN VGDSTWDPFQ TVNCWDWNMH LYREERSHQA PNPDTHA
|
| |