Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02820 |
Symbol | |
ID | 3254679 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 830413 |
End bp | 834465 |
Gene Length | 4053 bp |
Protein Length | 858 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253773 |
Product | conserved hypothetical protein |
Protein accession | XP_567877 |
Protein GI | 58260934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.438895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATA TGCCGCTGAT TTTTGTGCGC CAAAAGGCTT CGAAATTCGT CTATATCCCA CATGTCTCAT CAGTTCTCCA TAAACAATCT GTTCTCGGTA AACGGTCTCG ATGTCCTCAT CACTGGCGCC GGAACAGGTC TCGGTGAGTG AAGCCTTGGT ATGGAAGGAA AATCTAATGA ACTAAAATAT ATACAGGACT CTATATGGCC AAGGGCATGG CCATGAACGG TGCAGTCACC CATATCGCAG GCCGCAGACG CGAAAAACTG GAGGAAGCCA AGGTCACGAT TCTTGAACTT AACCCAGAAG CTCACGTTCA CATGTGAGAG ACCGTTGGTC TGTAATCAGT ATCATTATTC TCATACGTAC CCAGTCATGT GGCAGATATT TCCGACAGAC CCTCCATCAC CTCACTCGTC GCTTCCCTGA GCAAGCTCGA CGTTCTAATC AACTGCGCAG GTATCGTGCT TCCCGACACG CCTTGCAATC AGTTCACCCC TTTGCCCGAG CTCCAAGCTG CCCTTTTAGC TTCGCCGCAC GAAACATGGT CTCGCACTTT TTCAACAAAT GTTGAATCCC TTTTCTTTTT ATCGGCATCC GCGCTTCATC TACTAGCTGC AGCTCCAGCG GGCGGACGAA TCATCAACAT CTCGTCCATC GGATCCACCA TGTCCGACCC TGTCATAAGC CAGCCTGCCT ATCAAGCGTC AAAGGCGGCT CTCAATCATC TCACTCTCTT GCAAGCCAGT AAATTCCGTG AGCATGGGAT CCGTGTAAAT GCAATCTCTC CTGGGTACTT CCCTAGTCAG ATGAATGATC CTAGCAATCC TAAGAGCATG TTAGCCAGGG CACATGAGCT AGTACCGCTC AAAAGAGGTG GCAAGGAGGA GGATATAGCA GGTACAGCAA TCTGGTTGGC TAGTCAAGCG GGAAGCTATG TGGATGGGCA GGTTATTGTC TTAGGAGGTG GCAGAGAATG GGCTTGAGAT GCATTATAGC ATTTACTATT TTAAACCTGA TCAAAAATGG CGATGCATCC CTCTAATTAG CCATTTCATA ATATAACCAG TAATGTGGAT TTCAGACGAC TACATACTTG CATTGTTGTT GAAATCTATA TGCAGCGCCG GTTGATTATT ACTGCAGCTC CGAGCTTGAT GCCCGCGTCC GGGGGGAAGA CCTGGTCATC AAGTTCGGAG CTGAGGAGTT ATGTACACAA TCAGCGCCTC AGCATGTCTT TCATCGTCAA ATCGCCAATA TGTCGCAAAC ACATCAAATA TGTTTTCCTT ATTCCATGGT ATACAACATG AACCCAATCA TGAACAGACT TCCTATCAAG TAACCACCCA CAGATGAGAA CATCAGCTTC TTTCCTAATC GTTTTCGCTG CTGCCATATC GCTTGCGGCT CCTTACCAAC AAAAGACCTT CAACAACGGA GCATTATCCA GCTCAGGAGC TTTTTCGGAG AACACTGGAC TGCCATGGCA AGATGTAGTC TCCATTCCCA TCATAGACAC TTGGACACTT AATCCTGCCG GTGACGCTTC TGTCATCCGC ACTGCTACCA TAGATGTTGA AACAGATATG TAAGTGCCTT TGGGGCCAGT AGCTCGTTCT GCTTATCATA ACTAGCTCAT AGTTGATCAT GATACATCGT AGACCATCGT ATGACCTCCA TCTTCTTGCT ATCAATGCCC AATACACGGT CCCCACCGCC CTTCATGGTC CGACCCATCC GGCTGCACTC TATTCGTTCA TCACCAACGA TATCTTCGTT ACCCTTCTGC CTTCACAGTC TTCCTTCGGT TCCTGGGATC TTACGCTTCA AGTTCTCAAC TATACAACTC TTCCACCAGC ATGGGCACCA ACTGCCTCTG AGCCAACTAT CCTGAAGACC GTTCGGTTGG GAAATAGGAG GCCAGAGAGT ATGCTGTATT CTGCCGGACC GAAGATACTC TCTATTGTTC TTGGTGGGGG TTCCAAAGAC GAATTTGCTG GATCTCTTGG CCCGTTGACA CTACTGTCCA TTTCTAAACT AGATGAGGTC TGGACAGTCG AGCAATTAAC CGTTTTCCTC GAAAACGACC TGGTGAGTCC GATCATCCTG GCTGACCTAT GAATAGATAC TTGATTGAGC GCCCAAAAAT GCAGATTATC GACGAACTCG CCGCAAATGA TTCTGCCATA GCTTTCACTG CTCATAAGCC TGTCATTAGG CCCAATAATG CTACCCGAAG CATCGTAAGT ATGAATTCTA GTCATACTCG TATTTGATGA AACCAAAGGC TGATTTGCAT GTAGCTCTAC TTCCTTGATT TATTCTCCCC TTCATTCGCG GTACAAATTA GCTCAGGCAG TTTTGGTGCT GTATTTTCTC CAGCCCTGAG CTCTAATGGT CAGATCGCTT GGCTTGAGCA ACGAGATAAT GGGAATTGGG GAGGACGAAA AGACCTATGG ATGTATGATG GTCATACCCC TTGGAAAGTT CCATTCAAGG ATTGGGACCT GAGTCCATCA AGGGTTATCG TAAGCCATGA AATCCGCTGA CCAGTGACAC AAGCGGCACA TCATTTGACT CACTTGGATC CACAGTTTTC GGAAAACAGC GAGGCTCTCA ATCTTCTTAC TCTCAATGAC CAAGACACAT CACTTTTCCA CATCTGGACC CCTACTCGAT CATCGCCCCC TTCTACACCT GTGCGGATCC CGTCTAATGG CACAATCCAC TCTGTATATC ACGTCGGCAT TACCCCTCTT GATCATTCAC ATTTAATAGG CGTGATGTCT TCTCTTACAT CGGCTCACGA ACTTTGGGTC ATTTCACACT CGCCTCATGA TGATCCGACC TACAATTATG AGAACATCAG GTTGACATAT TTCAGTGAAC CGGTACTACA AGGAAGACAG CTAAATGCGG GTGAAAGTAT AGAGTTCGTG AATGAGTTAG GGTTGACCGT GAAGGGGAAG GTATTTCTTC CTAGTAAAAA TAAATCACAG GAGAAGGTAC CAGTCGTGCT GTTGCTGCAT GGCGATGGTA ACAGCGAAGG ATGGCGTAAT CAGTGGATGC AGTATTGGAA TCAGAACGGT AGGTCTTCTA ATTGATCTCC TTTCCATTTC AGGACTGAAT ATCGCAGCCT TGACCAGTGA GGGATATGCT GTGGTTACTA TCAACCCTAC AGGCTCCGAA GGCTACGGTA ACTGTGAGTG CAAATGCATA TACGATCTTC ATTTAACATG TTACTAACTG GTTGCTAGAC TTTGCCCAGT CGGGCCGGTT TAATTGGGGT AATCAAACTA TAAACGACAT TTCCCGCGGT CTTTTCCATT CTTTCAACCT ATTTCCTAAC CTGAACAATA CCAGCGTCAC GGCAATGGGT TATGGCGCCT ATGGCGGATT TGTTATTCAT TGGATACAAG GCCACTCCAC CGCCTTTGTC GCTTCCAATG GGGAGCCAGT GAGATGGAAG GGATTAGTGG TTCATGATGG CGTACTGTCT CCTAGGTGGT GGGCAGCAGA GACATCTTGC CCAGCGAAGG TAGAGTGGGA ATTTGGCGAA GTCTCGTATG ATGATGAGTC ACCATTGTAA GTTATTGGCA TCCCTGCTCC CGTGCCAATG ACTGAAATTA ATCTATGTTT ATCAGTTCGC TCTGGGATCC TGAGCGTTCG TCCCGTGAAT GGGCCATACC AGAATTAGTT ATCCACGATG GTCGAAGTGA GTTCCGAGAT GTGTGCACGT AATCCACGCT AAAATCTGCC CCAGATGACT GCGACGCTGG CCCTCTATCG CAAAGCTACG CCTCTTTTGC GCTTTTGCAA AGTCGAGGAG TCAACAGTGA GATACTGGTA TCAGATAGAT GGGCGTTTTC CAAATGGCAT AGGGCCATAT TTGATTTCCT TGAGTCTTTG TGATATCCCA TGCAGAATGA TGGATACCAT TACAACTGGA ACAATTAATG TGAATCATAC AAACTCAATC GAACTCCATA TGCACTAGCG CTATGAATGA CTGTCGGTTG GCATATGAGC TAGAAGCCTG GATATGTCAA TGA
|
Protein sequence | MNDMPLIFVR QKASKFVYIP HVSSVLHKQS VLGKRSRCPH HWRRNRSRPS ITSLVASLSK LDVLINCAGI VLPDTPCNQF TPLPELQAAL LASPHETWSR TFSTNVESLF FLSASALHLL AAAPAGGRII NISSIGSTMS DPVISQPAYQ ASKAALNHLT LLQASKFHFL SSNHPQMRTS ASFLIVFAAA ISLAAPYQQK TFNNGALSSS GAFSENTGLP WQDVVSIPII DTWTLNPAGD ASVIRTATID VETDIPSYDL HLLAINAQYT VPTALHGPTH PAALYSFITN DIFVTLLPSQ SSFGSWDLTL QVLNYTTLPP AWAPTASEPT ILKTVRLGNR RPESMLYSAG PKILSIVLGG GSKDEFAGSL GPLTLLSISK LDEVWTVEQL TVFLENDLLY FLDLFSPSFA VQISSGSFGA VFSPALSSNG QIAWLEQRDN GNWGGRKDLW MYDGHTPWKV PFKDWDLSPS RVIFSENSEA LNLLTLNDQD TSLFHIWTPT RSSPPSTPVR IPSNGTIHSV YHVGITPLDH SHLIGVMSSL TSAHELWVIS HSPHDDPTYN YENIRLTYFS EPVLQGRQLN AGESIEFVNE LGLTVKGKVF LPSKNKSQEK VPVVLLLHGD GNSEGWRNQW MQYWNQNALT SEGYAVVTIN PTGSEGYGNY FAQSGRFNWG NQTINDISRG LFHSFNLFPN LNNTSVTAMG YGAYGGFVIH WIQGHSTAFV ASNGEPVRWK GLVVHDGVLS PRWWAAETSC PAKVEWEFGE VSYDDESPFS LWDPERSSRE WAIPELVIHD GRNDCDAGPL SQSYASFALL QSRGVNSEIL VSDRWAFSKW HRAIFDFLES FAMNDCRLAY ELEAWICQ
|
| |