Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF04020 |
Symbol | |
ID | 3258081 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 1164709 |
End bp | 1167922 |
Gene Length | 3214 bp |
Protein Length | 788 aa |
Translation table | |
GC content | 46% |
IMG OID | 638257520 |
Product | conserved hypothetical protein |
Protein accession | XP_571363 |
Protein GI | 58268414 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.43338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGGTCGGTG CGTGATATAA AGGGATGAGC ATAGAGAACT GCATCCCATT TCTACGTTTT CCATTCTTAA ATGATTCATC AACAGCCAGT AATGAGAACA ATCCCATTGT TATGCTTCAA CGACGTTTAC AGAGTCAACC AAAAGTACAA CCCTCAACCC GGAGCTCCCG AAGACAACTC ACCGGACAGG ACAATCACCG TCTCCCAGTT CGCAGAGTTG CTTTTGAGCG AAAGAAGCAA ATGGGCCGAT AGACAGAGTG AAGATCAAGA TGATCAAGAA GGTCCGAGCA AAGAAGGATT GGTGCTTTTC GCTGGTGATG TTTTCAATCC CAGCGTCGAA AGTTCGGTGA CCAGAGGATC ACATATGGTA AGTTATAACT TGAAAATCCG CCAAGAAGTG CTGCTGAGTT TATCTACAGG TACCAATCAT GAATGCTCTG AAGGTGGACT ATGCATGTGT AGGTGAGTAT CACTCGCTAT AACATCAAGT GATTAACTGC TGATGCTCGG TAACCAGGAA ATCATGACTT CGACTTTGGT AAGCAAGCGG CAATAGAATT GGCCTGCAAG TCTATCGACT GGCACTAATT GAGCTTCGTG CAGGCTTTCC CCATCTTACA AAACTTGTAG AATCTACCAG CTTCCCGTGG TTACTCTCGA ACATCGTTGA CACCAACACG GGTCGTCAAC CGGAACCTCT CAAACGATTC ATCGTTACGG AGCGATGCGG AGTGAAGATT GGCCTCATTG GTTTAGTTGA GAAGTAAGTT GGAGTCGGAA CTGCAAGTGA CTTCCGTCAC TGATATGGAT TCAACATCTA CAGGGATTGG ATAGCCACAA TTCCTTCCTG GCCTCATAAT TTCAGATATC GTTCGATGAA AGATACAGCC CTGGAGTTGT CTCGAGAACT TCGTGATCCC AACGGTGAAC ACCAGGTGGA CATTATTATT GCTTTAACAC ATTGTCGAGT ACCAAATGTA AGCTATACCG CCTTTCCTTC GTTACGATCA GCGGAGTTAG CTGACCCGAT CTAGGACATT CGGTTGGCTA TTGAACTTGG AGCAGTGGCA GACAAACCCG GAGTCGAGAA TGAACATGGT ATAGATCTCA TTGTTGGAGG TCACGACCAC GTAAGCAGCT TGCCAAAACA TTGATATCGC AGAGCTGATT TTCAAATCTT TTGACAGATA TATTATGTAT GTTGCGCAGA GTTTTCTTTT GGTCTTTTTT TGTTTTTTTT TTCAGAAAAA GTGTATCTCT AACCTTTCGT AGATCGGTAA AGGTGCAACA TCTTGGGAAG GTTATGCTGG ACGAAAGGAT GTGCCCGGAA CTACGGAAGA CCATGGTGTT CGGTAAGTGT GATTTAGTTT AATGTGATCT CTGATTTACA TTTCATGTAG ACTCATCAAA TCCGGTACCG ATTTCCGTGA TCTCACCTCT GCCAACCTCA CCGTCACTCC TACCCCCTTA GGTTCTATTC GTCGCCAACT CATTACATCT TTGACAGGAA AGCACCTCTA TGTACTCCCT TCTTCACCTT CATCACCACC GTTTGAAGAG CTTGTCAAAT CCTTGTTGTC TTCCGTATCT GAAGCCCTCA CCAAACCGGT ATGCTTTACT CTCACCCCAT TCGATGCCCG ATCAGAAGTA GTCAGAACGC AGGAAAGCGG ACTAGGCAAT TGGATCGCGG ATGTACTGAT GCATGCGTAC GCCGAGAGCA TTGTACAGGC GAAAGGCAAA GAAGGAGGTC TCGGAGAAGA GTTTGAAGGA GATGCTGATG CAGCCATTCT CTGTGGGGGT ACATTGAGAG GCGATTCGCA ATATGGGCCT GGAAAGATCT CTTTGGGTGA TATTTTAGGT GAGCTTCCTG AAACCATGCC GGCCTGTATG TTGAGGACCT TATTCAGAAA TCCTTCCTTT CGAGGACCCC GTTGTCTGCA TCGAGGTACG TCAGGTGCTA TGTATGGTCA CCAACTAGCC GTTAACATTT TCTAGCTTGA CGGCAAAGGC ATATGGAATG CTTTAGAATC TGCTCTTTCG AAATGGCCGG CTCAGGAAGG CCGTTTTCCC ATTGTCTCTG GTCTCGCCGT CAAATGGGAC CACACGAGGC CTCCTGGACA GCGAATTTTA TCTATACACC AGATTGCGCA ACCGAAGAAA GACGACGATG ACTGGGAAGA CCCTGCGGAC ATGGTAGATT TCAAGGAACA AGAAGATGGG ACAACAGTGG TGGTTAAACA GAAAAAGTTG CAGTTGGGCG AGGAAGTGAA GAATGAGGAA GGCGGAAGGA TGTACAAGGT CGTGAGTGTA GCTTCTTTTT GATGAGGCCT GAATCATCAC TGATACATGT CGCAGATCAC TAGAGATTAC ATGGCTCAAG GATATGACGG ATTCGAAGCA TTGAAGAATA GAAATTTTAT TGTAGACGAT GAGAATGGAC AGATTATGTC CAGTATATTG AGAAGCTTCC TTCTCGGTAA GTCTAAAGTT ATCAGCCGTG ACCGTAGTCT GACATTACAA AAAGGATCCT CTTATATCTT CCGTCACAAG CAGCTTGAAG AAGCTGCCCA CTCCCACCTT TCTCGTCGAA CCTCGCAAGT GCTACTTCGT GCCCGTGCTG AACACACATC TCAGTCTCAA TCATCTCCTT CGGCATCACT CTCTTCTTCA CCGAAGAGGA ATCTTCTGGT TACTCAATTC AAATCATCTC AGGACCGCAT GACTTCGCCC AACCACCTCA CATCTGCTTC TTTGCCCTCT AATGCCTTGT CGCCAAACTC CGAGATATCT GAATCTTCCG CTTGGGGTAG ACTAAGAAGG CACGTCGTAC AACATGACTG GGGAACAATT AGGAACGCTT TGCATGTTGC AAAGCATGAA CATATGAGCG GATTAGACGT GCTGGTGAGT AGATTCCTCC CGCATGCGGT AAGGCAGTCG CCGATACTTC ATAGGCCGGT CAAGCTATGC GTCAAGCCCG AAATCACATG CCTGGAGCTT GGTCACCAGT TCAAACCCCT CCAACACAGG AAATGCCTGA ATACGATGGC ATCGTACTTC CTAAAGACGA GGATGGTGAG ACTAATCTAG CGGATCTGGC CATCGTAAGC CCTTTAGTGG ATGGAAGGAT GAGAGATATT TCGGCCGATA AGCGGTAGCT AGTCAATTTG GACGGAAGGG AAAGGTAAAT CTATCTGTAC TTTACAAGTT CGAA
|
Protein sequence | MIHQQPVMRT IPLLCFNDVY RVNQKYNPQP GAPEDNSPDR TITVSQFAEL LLSERSKWAD RQSEDQDDQE GPSKEGLVLF AGDVFNPSVE SSVTRGSHMV PIMNALKVDY ACVGNHDFDF GFPHLTKLVE STSFPWLLSN IVDTNTGRQP EPLKRFIVTE RCGVKIGLIG LVEKDWIATI PSWPHNFRYR SMKDTALELS RELRDPNGEH QVDIIIALTH CRVPNDIRLA IELGAVADKP GVENEHGIDL IVGGHDHIYY IGKGATSWEG YAGRKDVPGT TEDHGVRLIK SGTDFRDLTS ANLTVTPTPL GSIRRQLITS LTGKHLYVLP SSPSSPPFEE LVKSLLSSVS EALTKPVCFT LTPFDARSEV VRTQESGLGN WIADVLMHAY AESIVQAKGK EGGLGEEFEG DADAAILCGG TLRGDSQYGP GKISLGDILE ILPFEDPVVC IELDGKGIWN ALESALSKWP AQEGRFPIVS GLAVKWDHTR PPGQRILSIH QIAQPKKDDD DWEDPADMVD FKEQEDGTTV VVKQKKLQLG EEVKNEEGGR MYKVITRDYM AQGYDGFEAL KNRNFIVDDE NGQIMSSILR SFLLGSSYIF RHKQLEEAAH SHLSRRTSQV LLRARAEHTS QSQSSPSASL SSSPKRNLLV TQFKSSQDRM TSPNHLTSAS LPSNALSPNS EISESSAWGR LRRHVVQHDW GTIRNALHVA KHEHMSGLDV LAGQAMRQAR NHMPGAWSPV QTPPTQEMPE YDGIVLPKDE DGETNLADLA IVSPLVDGRM RDISADKR
|
| |