Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00920 |
Symbol | |
ID | 3257697 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 247974 |
End bp | 252007 |
Gene Length | 4034 bp |
Protein Length | 1040 aa |
Translation table | |
GC content | 46% |
IMG OID | 638256678 |
Product | conserved hypothetical protein |
Protein accession | XP_570827 |
Protein GI | 58267342 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.491475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCAGAACAT ATCACGACAG TATACAGTAT GGACATCGAG AGACTCGTAA GGACGCACGT ATCCACGGCA GATTTGAATC CGTCTCAGGA GCTTGTTCAA GGTAAATGTG GCTCATTCCA ACCTCCGCTT GCTTAGAGCT CACAATCCGT AATTTTCAGG TGTCAACAGT GGTCAAGTAC AACTACTACA AGTGGTAAAG GCTTTGGGGA ATTACCTCAC CTCTACAGAA GATGATATCA GGCTTAAAGG TATGCTGTCG CTAGTGCGCC CACGGATGAC AGTTTCTGAT CTGTGTAGGG TTGACGTTTT TAACCAATCT CCTGGGTGTT ATCATACCTG GGAAGATCAA TCGTCAGGCG AGTATGTAGA GGTCAAAGTA TAATAGACAT AACGAAATTC TGATATGTCG CTTTAGCAAC GACCTTGACA AACTTCTATA TCTCAAAATT AGATGACTTT GAGTCATTAC CGCCAGCTCT AAGTGGATTA ACTACACTTT CGAAGTTAAC GACGTTCGAT GACACTGCAG CGGTCGATGT TTATAAAGGG TTTGTGGATG TCACATGGCA TGGCCAAATG TTGATAGCCT CCACAGGGTC GTGGAGAATG TCAATATCAA AGCATATGCG CAAGCTATAA GACACCTTGT CTACGTCTTG TTCGACAGCT TACTAGCAAC ACATCGAGAT GGTAAGTCTC TCCGTTTGTA TGTAGCCAAA TGATGATGTT GATCATGTCG CCTAGCTCTG AAGAAAATGG GAACCGCTTT CATCAATTCT TATACCAAAA TCGTCGACGG CGAGAAAGAC CCTCGGAACC TCATGCTTCT TTTCTCAATC GACCGAGTAA TCCTTTTGGA GTTCGACGTC AAGGATCATA TCGAAGACTT TTTTGACATC ACATTCTGTT ACTTTCCCAT TACCTTCCGC CCACCTCCTA ATGATCCTTA TGGCATTACT GCGGATGATT TGAAACTTGC TCTCCGAGAG TGTATGGCAT CTAATCCCTA TTTTGCAAAG ATGGCATTGC CTCTCTTTTT AGAAAAATTC GCCACTGCTA CGGGTGCTAC TATGGTAAGT GAAAACCTCC ATATTATATA GACCGTCTCT GACATCAAGA GCAGAAAGAT CTAATGCTCA CCATGGCCGC TTGTTTCCCA ACTTACGGTG CCGATGCGGT CAATGAACGT AGCAAAGAAC TCTGGGAGGG CATCAAGACC GAGATACTTT ACTCCTCTGA TTCAACTATT GAAGCCGCTG CCCTTTCTGC TCTTGAATCA CTCATGCGCG CACTCTACCC TAATGAGGAC AGTGTTCCGT CAGGTTTGGC GCAGGAGATC ATTCAGGAAT GTATGAAGTC CCTGGAGGAA CCAGAAAAGA ATCAGGCTTT AGGTTCGACC AAGATTATTG CCGCGATCTT CCGAGGTTCT CGTAAGTACA CACATTTTGT TGTTTTTTAA ACTAAAAATC GCTTGCAGCC TCGGCCGGTA AATTCGCCCT CTCGCAAGTA TTCCCTCAGC TTTTCCGAAC ATTCAACACT CCCACTGTAC CTTCTCACCG TGCTCCTCTC CTCACAGCCA TCTCTTCAAT TCTTCTCGCT TGTCAATCTA CCTACAACTC TTCCTCTCGA TCACATGAGC AAGAGCAAAA TTTGGAACCC TATCGAGGGG ACCTTTTGGA TATTCTGAGG GAAGGTTTAA GGACGGATGG CTTGAAAGGA CCGGCCATCA AAGGGTGTAT TGCTCTTGTT GGTGTTCAAG GCTATTGGAG CCGAAAGGAA GTGGAGGATG TTGTTAGGGG GATTGACGAA ATTCTGATCC ATGACGAAAA TCAGGAAATT AGGTAAATCA AGTTCTCTCC ATCATACGAG AAACATGCGC TAAGTGGAGA TGGAATAGAC CGGAAGTCAT TCAAGCACTT ATCACAATCT CCAAATCCCA TCCCACTGTC ATCGAGTCGC TTACTCTCCC GCTCTTATTC CACAACCTTC CTAGCTCGGC GCCTTCCGTA GAGGACTTCA CGGCGAGGGA GCGATATCGG TCTATCCTTG GTTCATTAGG GAGGCTCTGC ATTCAGCCAG CTCTGTGGGG CACAATGATC GTGAGGGTCA CTAGCAGACT TGACGTATTG GTCTCCGCTA CTCCTGAAAA CTCAGAGGGA GCTGATGTGG AAATGGACGA CATAGATGCT AGGGAATGTA ACATTGCCTA TGCGTGGGAC CTCCTCAACT CCTTGCTCAC GGTGATAGAG GCGAAGGTCA AACAAAAGCA TATGGATGTG GGCAAATATT ACGAAGAGTT GATGCCGAGA TTGCTGGGTT TGGTAGTAAG AGCGTCTCAG CAAAAGGTTG GCGAAAGCGG AGAACCTTTG TTCAAAGACA GGAGATTGGT GGCGATCGTA AGCAAGATTG AGGAAAAGAT GATCTGGGAA CTTGGCGCCG AGTATGTCGA TTGCTTCAAT TATTTGCGTA AATTGCAATT GACAATATGG CAGGAAACAA GAGAAACAGT TCAACCTTGT GTACAGAGCA TTCGAGCAGG GAGAGATGGT TGGTATAGTA CACGAAAAGT CAACGGTTCA ATCTTCTAGC CCTTTACGTG TAAGTATAAT GTCAGAGCAT GGTGCCGGCG GTTAACGAGC CCTTAATTCA GACTAGCGCA TCATCCGCAG AGCAAGACCT CATTGCGCTC TACTCTTCGG CCCTCCAAGG CCTTTCACCC CCTGTTTCTC TTCCATTGGC GTCATGCGGT GAGTATCTGA GAGGGAAGAT CCACTGGACT ATTCACGTCG CAAGGGATGA CTGGCAGGTT AAATGGGGAC TGCAGATGGT TTGTGCATTA GTGAATAAGA AGGACAATGG TGAGTTTCCT TCCAGTCTAC CCTTCTTTAT TAGTGCTGAG TTTATGAAGA TCTCAAAGAA GCCTTAGAAG GGGTTTTGGA GAAGATTTGG GCAGAGGTGC AGGATACTAC TCAAGACTTT GAGGTCAGAC GCAGAGGGCT ACTGGTTTAC TTCCACGTAA GCTGTCTTTT GGTCCTTCTG ACTACTCGTG CTGACATAAA TATGTGGCGT TACAGATCAT CAAAGCCCTT TCACTCCTAC GCCAGCCATT AGCATACACA GCCCTTGACA AAGTCATTGA AGTGTTGGGA TTGTTCAGCA TGGACCCTGA GTTTGTCAGT GAGGCAGCGA GAGCTTTTGG TGTGTTGGCG AAGAAGGGTG ACGGACATTT GATTGCCAAG GTGGGTCTGA CGCTTTCCTC GCTTTGTGTG TGACAGAATA GAGACTAATT GTTTGTAGCT CCTTTACGCT CAAAAATTGT GGAACTTTGT GCTTCCGAAA CTGATTGAAG GGGATAAGGA AGCTTCTGGT GAGCAGAACG ATTATACGGA AAATAGAAGA CAACTGACCA GGTAAACAGG CAAAGAAAGG ATAGTGTACC TTGTCGCTTT TGCCTCTCTC TTGCCTCTTG TTCCTCCCTC TCTTTGTCTC TCCGACCTTC CTACTGTGAG TAACTTATGG ATCAGAGGAA GTATGCTGCT AACCGTCCTG CAGATCCTCC CACTGATTCA ACGATCCTTG ACACTGTCGT CCCCTGTTCA GAGGACGAAC GTGATCCACG CCCTCATCTC CATTCTTGAG ACCCCTTCAT CCCCATCTAC TGATACTATC CTTCATTCTT CTGCCTCTTC CCTCGTTTCA GCCCTTCTAA CCTCCTCCGT TCCATCTCCG GAAACCCCTA CCTCCTCAAA AGTTAGACAA TCCGCCCTCG CTTGCCTGGC CATTATTCCC GATACGATCA GGTTTGAAGT GCTGTATAAG CAGAAAGCAG AGGTGATCAA GGAGCTGGGA AATGCGGTGG ACGACCGAAT CAGAAATGTG AGAAAAGAAG CTGTTGAATG CAGGGCAAGG TGGTATAGGT ATGGTCAGGC GACTTAGTTG TATCAATATC GCATAAAGGG TTTGTTAAAT TTGGGGCATA GATTACAAGT ACTTTTGCAT ACCAACAACC TAATTAACTA CATGCAGATG AGCT
|
Protein sequence | MDIERLVRTH VSTADLNPSQ ELVQGVNSGQ VQLLQVVKAL GNYLTSTEDD IRLKGLTFLT NLLGVIIPGK INRQATTTLT NFYISKLDDF ESLPPALSGL TTLSKLTTFD DTAAVDVYKG VVENVNIKAY AQAIRHLVYV LFDSLLATHR DALKKMGTAF INSYTKIVDG EKDPRNLMLL FSIDRVILLE FDVKDHIEDF FDITFCYFPI TFRPPPNDPY GITADDLKLA LRECMASNPY FAKMALPLFL EKFATATGAT MKDLMLTMAA CFPTYGADAV NERSKELWEG IKTEILYSSD STIEAAALSA LESLMRALYP NEDSVPSGLA QEIIQECMKS LEEPEKNQAL GSTKIIAAIF RGSPSAGKFA LSQVFPQLFR TFNTPTVPSH RAPLLTAISS ILLACQSTYN SSSRSHEQEQ NLEPYRGDLL DILREGLRTD GLKGPAIKGC IALVGVQGYW SRKEVEDVVR GIDEILIHDE NQEIRPEVIQ ALITISKSHP TVIESLTLPL LFHNLPSSAP SVEDFTARER YRSILGSLGR LCIQPALWGT MIVRVTSRLD VLVSATPENS EGADVEMDDI DARECNIAYA WDLLNSLLTV IEAKVKQKHM DVGKYYEELM PRLLGLVVRA SQQKVGESGE PLFKDRRLVA IVSKIEEKMI WELGAEKQEK QFNLVYRAFE QGEMVGIVHE KSTVQSSSPL RTSASSAEQD LIALYSSALQ GLSPPVSLPL ASCGEYLRGK IHWTIHVARD DWQVKWGLQM VCALVNKKDN DLKEALEGVL EKIWAEVQDT TQDFEVRRRG LLVYFHIIKA LSLLRQPLAY TALDKVIEVL GLFSMDPEFV SEAARAFGVL AKKGDGHLIA KLLYAQKLWN FVLPKLIEGD KEASGKERIV YLVAFASLLP LVPPSLCLSD LPTILPLIQR SLTLSSPVQR TNVIHALISI LETPSSPSTD TILHSSASSL VSALLTSSVP SPETPTSSKV RQSALACLAI IPDTIRFEVL YKQKAEVIKE LGNAVDDRIR NVRKEAVECR ARWYRYGQAT
|
| |