Gene CNG02020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG02020 
Symbol 
ID3258908 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp564324 
End bp569195 
Gene Length4872 bp 
Protein Length1248 aa 
Translation table 
GC content47% 
IMG OID638257820 
Productendocytosis-related protein, putative 
Protein accessionXP_571901 
Protein GI58269490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATTTTGCCA TCCTTGGCCG CTAGCCCTCG TCATACGGAG TGAATTCATC CAATGCAGGA 
TAGAAGACAA CCACCAATTG CCATACCGCC CGCGCCGCTC CTTTCAGCAT CAGTCTCAAC
TCCCCGTCGC CCTACACCGC AGAGAGAGCA AGAACCGTCA GACGACACAG TCCGGCGGGT
ACCGTCGTCG TACTTTTCTC TTCCTTACAG CTATAGTAGT CCTGCAGGCC CATCACACTG
GAGGAACAAG CCGCTTGACA GTGGCTCAGC TCGTCGTCGC GACGGCAAGT TCCGTAGCGC
TAGCGTTAAA ACAGTGGGCA GCTTTTCTTT GAAGACACCG AATCGGCACG AAGATCTCGA
CAACTGGCAA TCTCTTCAAG ACATAACTGG AAGTGACGTG GATGAAGATC AGATTGAAGA
CAAGGCAACA ACAAATAGCC GATCGTCTTC GTTAGCAACT CCTAGCGCTC GGAGCGTACG
GTCACTTTTC ATATCTTCGG ACTTGTCTGC AGACGCCTTG AGTATACCTG AAGGAAGAAG
CCACACAAGC GATCGCATTT CTATAGTTGA TCGGTTATCA TTGTCTCCTA CTCAACTGGA
GGCCGGAGGT GATCATATTC TTTCTGTATC CCGAGATGAT TCTCAACAAG AAGAAGGCAC
GCCTCGTGCT GCTGTACTAC CATTACAATC GCCGGAACCC ATTGAAAGGC TGACTTCTGC
TACACCCCTG TTGGAAGCCT CATTTCCTTC AGATTTCCTA TCATCGAGAC AACCTATTAC
TGAGACAAGG TCCATTGTAG AGAAGAAGGC TTCCACTTTG GAAGGTATGT CTTACTTGTT
ACTTAGTTTC ATGCATCTGT TGACCTCTGT TGAAGGACAT AGATATAGTT GGATTCCTCC
CACATGGGCC ACAAATATAC TCAAATGCTC TATTGCCTAC CTCATAGCCT CTTTGTTCAC
GTTTGTCCCT GCCCTTGCGT CTATGTTATC GACCCAGTCG GAGACCGATG CGCATGGGCG
AGTTACTGCG GTACCCGCGT ATTCCGCACA TATGGTAGCA ACTATCGTTG TATACGTGAG
TAGGCGATGG TGCGTGACTA CGTGCTGAAT CCTAAAATCC GCCTAGTTCA ACCCGGCTAA
AACATTGGGA AACATGCTTC TGTCAACCAG ATATTGCTTC GTGCTCGCCG TACTTTCCAC
GGTAGTGTCC CTCTTAGCGA CCCTGACGAT TCGGTTATTC GATCATTACT CACCCTCACA
CGGAGAAAGA TGGGATTTTA TCAGTGAAAT GGGTGATTGG ATCGTTTGTA TAATTTGGAT
TGGAGGGACT ATGGGTGTGT TAGCTTGGTC CAAGTTATGG GTCGGTAACC CAAGTTTCAA
TTCAGGTAAA GTAAATGTCT CTTCAACAGC TATACAGCGA AACTGATTGG ACGAATAAGG
CTGCTCAATG GCCGCAATGA TTCTGTACAA TGTTGTCATA AAAGAGGGTG CAGTGCCAAA
ATTAATAGAG ATTTTGCTCA TCGTCTTCAC GGGAGGTCTG TGCGATGGTA GTCAACTGAA
CTTCTCACTG ACTTGATATG ACTTTAGTTT GCATCACAAA CCTTGTGTGC ATCACCGTCT
TCCCAGTGTC GGCGACTTCA AACCTTCAAA AATCAATATC CAAATCACTC AATTCTTTTT
CCACACTCCT CGACCTCCTC ACATCAACAT TCCTCCTCGA GAAAAGCACA GTCAAAGAGA
AGGGACTTAC TCTCAAGGAT GCAGTTAGAG ACCATTCTGC CGCTTTCAAG ACTCTGCAAA
AGGACCTTTC AGAGGCTAAA CACGAGCGAG TCCTTGATGG GAGGATCAGA GGCAGGAACT
TGCAGCTCTA CGATGCAGCA ATCGTCAGCC TTGGTCGGTT GGCCCAACAT TTGAGTAGTT
TGAGGAGTAG CACCAGGCTC CAAGAAAGTC TGGTCCGCGC CAGTCGCGAG GGGCGGATCA
GTTTGGAGGT CGGTGCAGAG CGTGGCCGTT CCAAAATATC CATTTCTGAG GTGGATGCCA
TTGACGATGA GAGAGGACAA GGATTATCTG AGGACATGGA CATTGCTACA AGCGTGCACC
TTTTCTTGAA ATTCAGAGAA ATTGCCGGGA CACAGATGGA TGATTTGAAC GTAAGTAACA
ATTGTTTTGA TGATGTAAAT TGGCTTGACT GTGAGCAGAC TCGGTGCGAT CAAGCCCTGG
AAGCTGTACA GGCTCTTTCC CAAGCTCGTC AGATGCCCTG TATCGATCTC CCTTTGATCC
GCTCCAAGCT CGCCACTTCC CTCAAGGAAT TCACTCTTTC ATCGAGCAGA GCCATCAAAA
GGGTCTACGC GGGACCCCGG CGGAAGAAGG GTGTCTATTT CAAAACTGAT CCAAGCTCTA
GTGACAGTGG AGAGAGCGAG AGTGATGTGG AGAGTGAAAA GATTTACGGG GACGGAAACG
AATGCAAGCT GGAGGCCACC GAAAACTTGC CCGTTGAAGA TCAGCCCGAC ATTAACCACG
GTCCTAATGA AACGGTCTTC TGGATCTACT TGTGAGTGTT CGGTCAACAT TAAGATAGGA
AGGCTGACGA CGCACGCTAC AGTTTCCTGT TTACTTTTGA AGAATTTGCA CGTGAGATGA
TATTTCTTGT GGATACAATG GAGGAGGTGC GTTTCTGGCT TTTTTGTCGT ATCAGTTATT
AACGTTTTCA CAGATTGTTA CGACTGAGAA AGTCACTTTC TGGGAGCATC TCAAAACAGT
TATAATGCCA AAAAGAGGGA GGAAGGAGAA GAAAAGCGAG TACCTCTATA AACAGCTTCG
TGAGTCTTGT AGTCATTATA TTTCAAGATA TTTTCCCAAC CTCGCAACGC AGAAAACATT
GTCCCAATCG ACCCTTCTCA ACTTCAGCCG CCATTGTACC CGAAAAATGG TCGTGATTCC
ACTGGACCCG TTATCGTGCC TGATTTGAAG TCACTCAGTT TTATTGGAAG GATCAAACAG
ATGTTCTGGG CGTTAGGCGA AAGGTCTAAA CAGCCTGACG CGCGTTATGC CATCAAAACT
GGTCTCGGAG GCGGTAGGTT GGAAGACAGC CTTTTAGGAT TCGCAAATTC TAATAGAAGA
TGCTTAGCTA TGTTGGCTGC GCCAGCATTC ACAGAAATTG GACGACCGAT ATTTCTGAGA
TTCAGAGGGG AATGGGCGCT GATCGCATAC TTTGCTACCA TGAGCCAAAC CATCGGGCAA
ACCAATTTTT TGTGAGTTGT TGTGCAACAA AACATTCGCT TGACATATAC ATTGCCCCGC
TTACAATGAT TTAGATCGAT GATGAGAATA TTGGGAACTT TGTATGTTGA AAATAAAGCC
TATGATTAGC AGATGCTGAC TTTATTATAA TGTAGAATCG GAGCGGGAGC TGCTGTCCTT
TTCACCAAAT TGTGGGTCAT AGTTTAAAAT GCGAAGGATT GTCCCTTATC TTCTGCTCAG
GTTTCCAGAC AACAACGTTG CTCTTCCTAT CCTAGGGTTT TTCTTCTCGA TCCCCTGCTT
CTATATTATA ACTCAAATGC CCGATTACAT GAATGCAGGG CGATTTATCC TCCTTACATA
TGTAGGTACA ATCTTGTATC ATGAGGAAAA TGCTGAATCG ACTTAGAATC TCACGTGCTT
GTATACTTAC AATACCAGAA CGAGAGGAGA CGTTACTGTT GAAATGATTG CCTATCGCCG
ATCAACAAGT GTTATTGTGG GCGTTCTGTG GGCTGCAATT GTTTCCAGGT ACTGGTGGCC
GTTCACCGCC AGGAGGGAGC TGCGTATGGG CCTGAGCGAG TAAGTCAGAT GTCACCTTGC
CCTTTCAGTT GCATTTTGAT CATCGTCATA TTGTTTGCTA GCTTCTGTCT TGATCTGTCC
TATCTATATT CAAAACTCCT CACAACTTAT GGTAATGGCG TCGACTACAA TGGACTAAAC
GCCGTAGGAG CAGATACTGA AGAAGGAGAG TTGGAACCTT TACTGCCGAC AGATGCTATC
AGGCATTCTC ATCTCGATCA CGGAGTAAGA CAATTTATGG CCATGTAAGT CTGCGCGAAT
GAACATTTTG TCCATAATCT CTTAATTTAC TCGTGTCATT AGGGAATTGC ATCTTCAGAG
TCAGCTTGAC AGTATGAAAA GCTTGCTGGC CCACACAAAA AACGAACCAC GCCTAAAAGG
ACCGTTTGCA TATGGATTCT ATAAAGAGGT CCTACTTAGT TGCGAGCGAA TGCTCGATAG
GCTGCACAGT ATGCGCTGTG TTACGACTAG AGATGAGTGG TAAGTTTACT GCCCAGTTTG
CGGCCCAATA CCAGTGGCGT TGACGAAAAC TTATTAGGGA CAACAATATC CGGGATACCT
TTGTTATACC AGTCAACAAG GAACGCCGAG AGATGGCTGG GAACGTCATT CTATATGTGA
GTACGCTTGC AATCAGTGCC TTGTGGCTTA CCGTTTACTC ACTCTTACTG TAGTTCTATA
CTTTGTCCGC CGGATTCCGG CTGCGGACGC CCGTGCCCCC ATACCTTCCT CCAGCTGAGG
AGGCTCGTCA GCGCCTCGTT GACGCCATTC GTTCCCTCGA TGTCGTTCGA CGCAGAAGCG
TTCGTGGAGG TGGAAGACAT CTCTTGTTCT TTGCATATAC ATTGGCCATG CAGGTGGGTC
ATTTTTAAAA AAAAACTTTT AACTGATGGC TTTTATCAAT AGGAGGTGAT AGCTGAGCTG
GAGTATCTGG GAGCGATGAT GCAGGAAGCT TTCGGTGTGA TATCTGTCAG TTCCGCTGAC
GACTTTGAGG ACTTATTCGA GGAGCCAGTA GAGGAAGTTG CCAAAGTGGG GAAAAGCCCG
AAGAGCCTGT AA
 
Protein sequence
MQDRRQPPIA IPPAPLLSAS VSTPRRPTPQ REQEPSDDTV RRVPSSYFSL PYSYSSPAGP 
SHWRNKPLDS GSARRRDGKF RSASVKTVGS FSLKTPNRHE DLDNWQSLQD ITGSDVDEDQ
IEDKATTNSR SSSLATPSAR SVRSLFISSD LSADALSIPE GRSHTSDRIS IVDRLSLSPT
QLEAGGDHIL SVSRDDSQQE EGTPRAAVLP LQSPEPIERL TSATPLLEAS FPSDFLSSRQ
PITETRSIVE KKASTLEGHR YSWIPPTWAT NILKCSIAYL IASLFTFVPA LASMLSTQSE
TDAHGRVTAV PAYSAHMVAT IVVYVSRRCE MGDWIVCIIW IGGTMGVLAW SKLWVGNPSF
NSGCSMAAMI LYNVVIKEGA VPKLIEILLI VFTGVCITNL VCITVFPVSA TSNLQKSISK
SLNSFSTLLD LLTSTFLLEK STVKEKGLTL KDAVRDHSAA FKTLQKDLSE AKHERVLDGR
IRGRNLQLYD AAIVSLGRLA QHLSSLRSST RLQESLVRAS REGRISLEVG AERGRSKISI
SEVDAIDDER GQGLSEDMDI ATSVHLFLKF REIAGTQMDD LNTRCDQALE AVQALSQARQ
MPCIDLPLIR SKLATSLKEF TLSSSRAIKR VYAGPRRKKG VYFKTDPSSS DSGESESDVE
SEKIYGDGNE CKLEATENLP VEDQPDINHG PNETVFWIYF FLFTFEEFAR EMIFLVDTME
EIVTTEKVTF WEHLKTVIMP KRGRKEKKSE YLYKQLQNIV PIDPSQLQPP LYPKNGRDST
GPVIVPDLKS LSFIGRIKQM FWALGERSKQ PDARYAIKTG LGGGRLEDSL LGFANSNRRC
LAMLAAPAFT EIGRPIFLRF RGEWALIAYF ATMSQTIGQT NFLIGAGAAV LFTKLFPDNN
VALPILGFFF SIPCFYIITQ MPDYMNAGRF ILLTYNLTCL YTYNTRTRGD VTVEMIAYRR
STSVIVGVLW AAIVSRYWWP FTARRELRMG LSDFCLDLSY LYSKLLTTYG NGVDYNGLNA
VGADTEEGEL EPLLPTDAIR HSHLDHGVRQ FMAMELHLQS QLDSMKSLLA HTKNEPRLKG
PFAYGFYKEV LLSCERMLDR LHSMRCVTTR DEWDNNIRDT FVIPVNKERR EMAGNVILYF
YTLSAGFRLR TPVPPYLPPA EEARQRLVDA IRSLDVVRRR SVRGGGRHLL FFAYTLAMQE
VIAELEYLGA MMQEAFGVIS VSSADDFEDL FEEPVEEVAK VGKSPKSL