Gene CNK02820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02820 
Symbol 
ID3254679 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp830413 
End bp834465 
Gene Length4053 bp 
Protein Length858 aa 
Translation table 
GC content47% 
IMG OID638253773 
Productconserved hypothetical protein 
Protein accessionXP_567877 
Protein GI58260934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.438895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA TGCCGCTGAT TTTTGTGCGC CAAAAGGCTT CGAAATTCGT CTATATCCCA 
CATGTCTCAT CAGTTCTCCA TAAACAATCT GTTCTCGGTA AACGGTCTCG ATGTCCTCAT
CACTGGCGCC GGAACAGGTC TCGGTGAGTG AAGCCTTGGT ATGGAAGGAA AATCTAATGA
ACTAAAATAT ATACAGGACT CTATATGGCC AAGGGCATGG CCATGAACGG TGCAGTCACC
CATATCGCAG GCCGCAGACG CGAAAAACTG GAGGAAGCCA AGGTCACGAT TCTTGAACTT
AACCCAGAAG CTCACGTTCA CATGTGAGAG ACCGTTGGTC TGTAATCAGT ATCATTATTC
TCATACGTAC CCAGTCATGT GGCAGATATT TCCGACAGAC CCTCCATCAC CTCACTCGTC
GCTTCCCTGA GCAAGCTCGA CGTTCTAATC AACTGCGCAG GTATCGTGCT TCCCGACACG
CCTTGCAATC AGTTCACCCC TTTGCCCGAG CTCCAAGCTG CCCTTTTAGC TTCGCCGCAC
GAAACATGGT CTCGCACTTT TTCAACAAAT GTTGAATCCC TTTTCTTTTT ATCGGCATCC
GCGCTTCATC TACTAGCTGC AGCTCCAGCG GGCGGACGAA TCATCAACAT CTCGTCCATC
GGATCCACCA TGTCCGACCC TGTCATAAGC CAGCCTGCCT ATCAAGCGTC AAAGGCGGCT
CTCAATCATC TCACTCTCTT GCAAGCCAGT AAATTCCGTG AGCATGGGAT CCGTGTAAAT
GCAATCTCTC CTGGGTACTT CCCTAGTCAG ATGAATGATC CTAGCAATCC TAAGAGCATG
TTAGCCAGGG CACATGAGCT AGTACCGCTC AAAAGAGGTG GCAAGGAGGA GGATATAGCA
GGTACAGCAA TCTGGTTGGC TAGTCAAGCG GGAAGCTATG TGGATGGGCA GGTTATTGTC
TTAGGAGGTG GCAGAGAATG GGCTTGAGAT GCATTATAGC ATTTACTATT TTAAACCTGA
TCAAAAATGG CGATGCATCC CTCTAATTAG CCATTTCATA ATATAACCAG TAATGTGGAT
TTCAGACGAC TACATACTTG CATTGTTGTT GAAATCTATA TGCAGCGCCG GTTGATTATT
ACTGCAGCTC CGAGCTTGAT GCCCGCGTCC GGGGGGAAGA CCTGGTCATC AAGTTCGGAG
CTGAGGAGTT ATGTACACAA TCAGCGCCTC AGCATGTCTT TCATCGTCAA ATCGCCAATA
TGTCGCAAAC ACATCAAATA TGTTTTCCTT ATTCCATGGT ATACAACATG AACCCAATCA
TGAACAGACT TCCTATCAAG TAACCACCCA CAGATGAGAA CATCAGCTTC TTTCCTAATC
GTTTTCGCTG CTGCCATATC GCTTGCGGCT CCTTACCAAC AAAAGACCTT CAACAACGGA
GCATTATCCA GCTCAGGAGC TTTTTCGGAG AACACTGGAC TGCCATGGCA AGATGTAGTC
TCCATTCCCA TCATAGACAC TTGGACACTT AATCCTGCCG GTGACGCTTC TGTCATCCGC
ACTGCTACCA TAGATGTTGA AACAGATATG TAAGTGCCTT TGGGGCCAGT AGCTCGTTCT
GCTTATCATA ACTAGCTCAT AGTTGATCAT GATACATCGT AGACCATCGT ATGACCTCCA
TCTTCTTGCT ATCAATGCCC AATACACGGT CCCCACCGCC CTTCATGGTC CGACCCATCC
GGCTGCACTC TATTCGTTCA TCACCAACGA TATCTTCGTT ACCCTTCTGC CTTCACAGTC
TTCCTTCGGT TCCTGGGATC TTACGCTTCA AGTTCTCAAC TATACAACTC TTCCACCAGC
ATGGGCACCA ACTGCCTCTG AGCCAACTAT CCTGAAGACC GTTCGGTTGG GAAATAGGAG
GCCAGAGAGT ATGCTGTATT CTGCCGGACC GAAGATACTC TCTATTGTTC TTGGTGGGGG
TTCCAAAGAC GAATTTGCTG GATCTCTTGG CCCGTTGACA CTACTGTCCA TTTCTAAACT
AGATGAGGTC TGGACAGTCG AGCAATTAAC CGTTTTCCTC GAAAACGACC TGGTGAGTCC
GATCATCCTG GCTGACCTAT GAATAGATAC TTGATTGAGC GCCCAAAAAT GCAGATTATC
GACGAACTCG CCGCAAATGA TTCTGCCATA GCTTTCACTG CTCATAAGCC TGTCATTAGG
CCCAATAATG CTACCCGAAG CATCGTAAGT ATGAATTCTA GTCATACTCG TATTTGATGA
AACCAAAGGC TGATTTGCAT GTAGCTCTAC TTCCTTGATT TATTCTCCCC TTCATTCGCG
GTACAAATTA GCTCAGGCAG TTTTGGTGCT GTATTTTCTC CAGCCCTGAG CTCTAATGGT
CAGATCGCTT GGCTTGAGCA ACGAGATAAT GGGAATTGGG GAGGACGAAA AGACCTATGG
ATGTATGATG GTCATACCCC TTGGAAAGTT CCATTCAAGG ATTGGGACCT GAGTCCATCA
AGGGTTATCG TAAGCCATGA AATCCGCTGA CCAGTGACAC AAGCGGCACA TCATTTGACT
CACTTGGATC CACAGTTTTC GGAAAACAGC GAGGCTCTCA ATCTTCTTAC TCTCAATGAC
CAAGACACAT CACTTTTCCA CATCTGGACC CCTACTCGAT CATCGCCCCC TTCTACACCT
GTGCGGATCC CGTCTAATGG CACAATCCAC TCTGTATATC ACGTCGGCAT TACCCCTCTT
GATCATTCAC ATTTAATAGG CGTGATGTCT TCTCTTACAT CGGCTCACGA ACTTTGGGTC
ATTTCACACT CGCCTCATGA TGATCCGACC TACAATTATG AGAACATCAG GTTGACATAT
TTCAGTGAAC CGGTACTACA AGGAAGACAG CTAAATGCGG GTGAAAGTAT AGAGTTCGTG
AATGAGTTAG GGTTGACCGT GAAGGGGAAG GTATTTCTTC CTAGTAAAAA TAAATCACAG
GAGAAGGTAC CAGTCGTGCT GTTGCTGCAT GGCGATGGTA ACAGCGAAGG ATGGCGTAAT
CAGTGGATGC AGTATTGGAA TCAGAACGGT AGGTCTTCTA ATTGATCTCC TTTCCATTTC
AGGACTGAAT ATCGCAGCCT TGACCAGTGA GGGATATGCT GTGGTTACTA TCAACCCTAC
AGGCTCCGAA GGCTACGGTA ACTGTGAGTG CAAATGCATA TACGATCTTC ATTTAACATG
TTACTAACTG GTTGCTAGAC TTTGCCCAGT CGGGCCGGTT TAATTGGGGT AATCAAACTA
TAAACGACAT TTCCCGCGGT CTTTTCCATT CTTTCAACCT ATTTCCTAAC CTGAACAATA
CCAGCGTCAC GGCAATGGGT TATGGCGCCT ATGGCGGATT TGTTATTCAT TGGATACAAG
GCCACTCCAC CGCCTTTGTC GCTTCCAATG GGGAGCCAGT GAGATGGAAG GGATTAGTGG
TTCATGATGG CGTACTGTCT CCTAGGTGGT GGGCAGCAGA GACATCTTGC CCAGCGAAGG
TAGAGTGGGA ATTTGGCGAA GTCTCGTATG ATGATGAGTC ACCATTGTAA GTTATTGGCA
TCCCTGCTCC CGTGCCAATG ACTGAAATTA ATCTATGTTT ATCAGTTCGC TCTGGGATCC
TGAGCGTTCG TCCCGTGAAT GGGCCATACC AGAATTAGTT ATCCACGATG GTCGAAGTGA
GTTCCGAGAT GTGTGCACGT AATCCACGCT AAAATCTGCC CCAGATGACT GCGACGCTGG
CCCTCTATCG CAAAGCTACG CCTCTTTTGC GCTTTTGCAA AGTCGAGGAG TCAACAGTGA
GATACTGGTA TCAGATAGAT GGGCGTTTTC CAAATGGCAT AGGGCCATAT TTGATTTCCT
TGAGTCTTTG TGATATCCCA TGCAGAATGA TGGATACCAT TACAACTGGA ACAATTAATG
TGAATCATAC AAACTCAATC GAACTCCATA TGCACTAGCG CTATGAATGA CTGTCGGTTG
GCATATGAGC TAGAAGCCTG GATATGTCAA TGA
 
Protein sequence
MNDMPLIFVR QKASKFVYIP HVSSVLHKQS VLGKRSRCPH HWRRNRSRPS ITSLVASLSK 
LDVLINCAGI VLPDTPCNQF TPLPELQAAL LASPHETWSR TFSTNVESLF FLSASALHLL
AAAPAGGRII NISSIGSTMS DPVISQPAYQ ASKAALNHLT LLQASKFHFL SSNHPQMRTS
ASFLIVFAAA ISLAAPYQQK TFNNGALSSS GAFSENTGLP WQDVVSIPII DTWTLNPAGD
ASVIRTATID VETDIPSYDL HLLAINAQYT VPTALHGPTH PAALYSFITN DIFVTLLPSQ
SSFGSWDLTL QVLNYTTLPP AWAPTASEPT ILKTVRLGNR RPESMLYSAG PKILSIVLGG
GSKDEFAGSL GPLTLLSISK LDEVWTVEQL TVFLENDLLY FLDLFSPSFA VQISSGSFGA
VFSPALSSNG QIAWLEQRDN GNWGGRKDLW MYDGHTPWKV PFKDWDLSPS RVIFSENSEA
LNLLTLNDQD TSLFHIWTPT RSSPPSTPVR IPSNGTIHSV YHVGITPLDH SHLIGVMSSL
TSAHELWVIS HSPHDDPTYN YENIRLTYFS EPVLQGRQLN AGESIEFVNE LGLTVKGKVF
LPSKNKSQEK VPVVLLLHGD GNSEGWRNQW MQYWNQNALT SEGYAVVTIN PTGSEGYGNY
FAQSGRFNWG NQTINDISRG LFHSFNLFPN LNNTSVTAMG YGAYGGFVIH WIQGHSTAFV
ASNGEPVRWK GLVVHDGVLS PRWWAAETSC PAKVEWEFGE VSYDDESPFS LWDPERSSRE
WAIPELVIHD GRNDCDAGPL SQSYASFALL QSRGVNSEIL VSDRWAFSKW HRAIFDFLES
FAMNDCRLAY ELEAWICQ