Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF04880 |
Symbol | |
ID | 3258266 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 1420017 |
End bp | 1423302 |
Gene Length | 3286 bp |
Protein Length | 849 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257606 |
Product | hypothetical protein |
Protein accession | XP_571456 |
Protein GI | 58268600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCCA ATCCTCCCAC CATGCGCCGC GGTTCCATCA TAGGAGGAGG AGGAAATGTT GGCGGCAGCG AGGACTGGCG ATCAAACAAA GATTCGCCGA CGTCTCCCTT TCAATCTACC AAGTCTAATC ATCGACGAAG TCGGTCGCCG TCCCGCCGAG CTCATTCGAT AGGTGAGTAC CAGCGGCCTC CGCGGTATCT GTACGACTCG CCTGCTCCAC CTCGTGACAA CCCCTTTTCC TACAACCTAT CTTCATCGAG CCTTAGCCAT AGCAGTAACC CTTCTATCCC TCCTCAAACC CAAGCCCAAG CCCAGGCCCT TCATCAACTC CCATCCCAGT CCCATCTGCA TTCTCATCCT CATTCCCAAA CGCAACTTCA TCTCCCCCCC ATCTCCTCAT TCAGCACCTC CACCTCCGCC GGCACAACCA TCCCTGTCGG ATCTCCTGTG GAACACTCTC ATGGTACCCT CGTCCTCGGC AAATCCGGTC GTTCCCGGTA CTTGGGTCCC ACAGCCGGCA CTGAATGGCT TAAAAACCAA GAAGTCGGTG GGATCGGTAT GGAGACACCT TCGAACTCTC GTTCACCTGT GGATACTACC AATCCTGGGA ATGGAAGTCC TGCAGGAGGA AGAGGGGAAA AGGGGGGAGA GGAAGGTGTT TATCATGGAA GAGAAAGTGG AGGAGCATAC GAAGACCCAC TATACTCCTT TCCATTTAAC GAATCTGGAG AAGTATCGAC GGTGGAAGCA TTGTTTGCGC GATTACCCCC AAGAGCAGAT GCAGAGACGT TGGTAGATTC TTATTACCGT TATTTTGCTT GGAAGTAGGT TGCTCGTCCT TTTCTGGCCC CGGATGGGGG TCATGTTTCG CGGAAACTGG ACTGATATTT ATTTTTCCCT CACCTCCGCG CCCACTAGCC ACGACCCCGC TCCCCGCCGG ACCTTCCAAC CAATCTTCGA CCGCGTCTAC GCCTCTCTGC TCCATCCCCG TCCCGAAAAT AGCGTCCACC TCCAACAACT TGCTCTCGTG TACATGCTTC TTGCGATGGG TACCGTCCAC AACATAGAAT TACCCCCCCA CGATGAGAGT GCGGAAGAGT ACTTGACTTT GGCGCAAGCG GCTATGACGA AAGGGAATTT TATGAACCAT GCGACAATTG CGGGGTTGCA GACTTTGGTG AGCCTTCTCT CTTTCCTCTT AACCATTTGC ATTGGATAAG ATGAAGATGA CTGAGCGATT CTAATCGTGT TTCGCCCTTC TTGTAGGTAA CGATGGCTCA CTATTACCTC GAAACGGAGA GCGGACGAAA CGGGGATTCC GCGTGGCCAT TATGGGGTCT AGCGATGAGT CTTGTCGTCG CTGTAAGTCG TTTGTTTCCT CAACTCCCAA CCCGATTTTA CTCCTTCACC CGGTCCCTCC CACCCTCATC CATCCCTGCG CTCCCTTCCT TGCCCTTTCT CCTCCTCACT TCCCCTATCC CAACTCCAAC ACCAAACGTG TCTCTTTTCG CGGACATTCA GAGTTTAAGG TATAGGCTGA TGGATTCCTT TTGATCGTAG ATGGGATTAC ATCGAGATGG AGCGAGATGG AATTTGCCAG ACGATGTTGT TCAAGAAAGG CGGTATGTTT TCCACTTGTC TTTTTCTATT CGCTTCGAAA TGTATTATAC TGAAGCGACT TCTCCAGCCA AGTATTTTGG GAATGCCACA CCATCGAAGT CTTTCAAGCC AACTGTTTTT CTCGACCCAA CACCCTCGTC CCGCGCTACA TTGACACCGC CTTCCCTTCC CCCAACTCCG CCGAAGTCGC CATGGGTGGC AAAGGTTGGC CCACTCTCAA ATTCGAGCTC TGCCAAATTT CATCTCAGGT TCTTGATGCG GGTATGACCG TTCATTTTCA ATCCTACGAT TCTATCCAGA AACTTTACGG CCAACTATGT GAATTCGAGT TGAACGTCCC TTACGACCTC CGATGTCGTT CTGCCCTCTT GGCGTTACCG TCAGTATACC CTGACCCGGA GATGGCAAGG AAAAATAGTC CAGAGATAAG CCGGCACAAT CTCCATAGGA CATTGCAACA GTTCACGCTC TCGTTGAATA TATCGGAGAA TATACTGTTC TTACAAAGAC CGTATTTTGT GATGGCGATG CATGATGAGC CAGCAGATCC GACGAGGTCA GTGTATGGCC ATTCGTATCT TGCTGTCGTA GAAAGGTGCA ACGTAAATGC TTTTCCTCCT TTTCTGATCT CCGCATTTTG CTAACTTTAC CGATAACAGG TCATCATCCA AGTCGTCTCC GACTTGTACA AACTCCACCC CACCATCATC TCCCGTCAGT GGTTCTTCTG GTACCATCTC TTCACCGCCG CCGTTTGCTT GGGCACTCTT ATTCTCAAAA ATCCCCAATC TGCTCTTGCA ACATTTGCCC TCTCCCAAAT CGAACAAGCG ATCAACGTTT ATTCGGTACT GATCAAGCAG AATAACTCAC CCTCGATGGT GCAGAATCAT GATTGGTTAC TGAGGCTTCG ACAAAGGGGT GCCAAGAAGA TTGCGCAGGC AGCTGGAATG GGAGGGACGA ATCTGCCCCT GGGCGTTGGA CCTGGAGGAG GAGGTGGAGA TGGGGACACA GGAGGCCAAG AAGAAGAAGA TCGTGAACTC CTAGGCTGGA AAACCCGACT TATCGAACGC GCTGGCTCTG GCGTCCACAC TGCCGTCAAC ATCTCTTCCT CCAACCCCGC CAGCTCAGTG CCACACCGTA CACCCTCGCC TGGATCTTTG ACGCAAAATG GCGGCGGTAA TAGCGGGATG ACCCCAGCTA TGCACTTGCT TCAGCAACAT TTTGTACCGC CTTTCCAAAC TCCGCCTGTT AGTGGAATGT TGGGTACGAC GGCGCAGACT CTGGGAATGG ATAACTCGAC GGATTTACTG GTGAGTAAGA GCTAACTGAT TTTGTAGTCA CTCGTCTCTT CTGCGTTCGG AGAACGATAA GCTGACAAAA TTGCGCGCAA CAGCTACACC AATTTTGGGA TCCAATGATG ATGGCAGATT CCACAAACAT GACCGTACGT CCTTACTTTT TTTCATTTTT TCCTTTCATC CTTGCTGCTC TCAATATCAA AGCTAACTGC TCTTTTTTTT CCCTGTTAAT ACAGCAGAAC GCAAATTGGT GGTCGTGGGA TTTTGGAGGT CTAGCGGAGA ACGGCACTCC CATAGCTGGA GGAGCTGCAG GATCGCAAAC CCAACCTCAA GCAACCCCCT AACCCTAGAT TTGAGAAAGG ACCTGT
|
Protein sequence | MDSNPPTMRR GSIIGGGGNV GGSEDWRSNK DSPTSPFQST KSNHRRSRSP SRRAHSIGEY QRPPRYLYDS PAPPRDNPFS YNLSSSSLSH SSNPSIPPQT QAQAQALHQL PSQSHLHSHP HSQTQLHLPP ISSFSTSTSA GTTIPVGSPV EHSHGTLVLG KSGRSRYLGP TAGTEWLKNQ EVGGIGMETP SNSRSPVDTT NPGNGSPAGG RGEKGGEEGV YHGRESGGAY EDPLYSFPFN ESGEVSTVEA LFARLPPRAD AETHDPAPRR TFQPIFDRVY ASLLHPRPEN SVHLQQLALV YMLLAMGTVH NIELPPHDES AEEYLTLAQA AMTKGNFMNH ATIAGLQTLV TMAHYYLETE SGRNGDSAWP LWGLAMSLVV AMGLHRDGAR WNLPDDVVQE RRQVFWECHT IEVFQANCFS RPNTLVPRYI DTAFPSPNSA EVAMGGKGWP TLKFELCQIS SQVLDAGMTV HFQSYDSIQK LYGQLCEFEL NVPYDLRCRS ALLALPSVYP DPEMARKNSP EISRHNLHRT LQQFTLSLNI SENILFLQRP YFVMAMHDEP ADPTRSVYGH SYLAVVERCN VIIQVVSDLY KLHPTIISRQ WFFWYHLFTA AVCLGTLILK NPQSALATFA LSQIEQAINV YSVLIKQNNS PSMVQNHDWL LRLRQRGAKK IAQAAGMGGT NLPLGVGPGG GGGDGDTGGQ EEEDRELLGW KTRLIERAGS GVHTAVNISS SNPASSVPHR TPSPGSLTQN GGGNSGMTPA MHLLQQHFVP PFQTPPVSGM LGTTAQTLGM DNSTDLLLHQ FWDPMMMADS TNMTQNANWW SWDFGGLAEN GTPIAGGAAG SQTQPQATP
|
| |