Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG02020 |
Symbol | |
ID | 3258908 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 564324 |
End bp | 569195 |
Gene Length | 4872 bp |
Protein Length | 1248 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257820 |
Product | endocytosis-related protein, putative |
Protein accession | XP_571901 |
Protein GI | 58269490 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTTTGCCA TCCTTGGCCG CTAGCCCTCG TCATACGGAG TGAATTCATC CAATGCAGGA TAGAAGACAA CCACCAATTG CCATACCGCC CGCGCCGCTC CTTTCAGCAT CAGTCTCAAC TCCCCGTCGC CCTACACCGC AGAGAGAGCA AGAACCGTCA GACGACACAG TCCGGCGGGT ACCGTCGTCG TACTTTTCTC TTCCTTACAG CTATAGTAGT CCTGCAGGCC CATCACACTG GAGGAACAAG CCGCTTGACA GTGGCTCAGC TCGTCGTCGC GACGGCAAGT TCCGTAGCGC TAGCGTTAAA ACAGTGGGCA GCTTTTCTTT GAAGACACCG AATCGGCACG AAGATCTCGA CAACTGGCAA TCTCTTCAAG ACATAACTGG AAGTGACGTG GATGAAGATC AGATTGAAGA CAAGGCAACA ACAAATAGCC GATCGTCTTC GTTAGCAACT CCTAGCGCTC GGAGCGTACG GTCACTTTTC ATATCTTCGG ACTTGTCTGC AGACGCCTTG AGTATACCTG AAGGAAGAAG CCACACAAGC GATCGCATTT CTATAGTTGA TCGGTTATCA TTGTCTCCTA CTCAACTGGA GGCCGGAGGT GATCATATTC TTTCTGTATC CCGAGATGAT TCTCAACAAG AAGAAGGCAC GCCTCGTGCT GCTGTACTAC CATTACAATC GCCGGAACCC ATTGAAAGGC TGACTTCTGC TACACCCCTG TTGGAAGCCT CATTTCCTTC AGATTTCCTA TCATCGAGAC AACCTATTAC TGAGACAAGG TCCATTGTAG AGAAGAAGGC TTCCACTTTG GAAGGTATGT CTTACTTGTT ACTTAGTTTC ATGCATCTGT TGACCTCTGT TGAAGGACAT AGATATAGTT GGATTCCTCC CACATGGGCC ACAAATATAC TCAAATGCTC TATTGCCTAC CTCATAGCCT CTTTGTTCAC GTTTGTCCCT GCCCTTGCGT CTATGTTATC GACCCAGTCG GAGACCGATG CGCATGGGCG AGTTACTGCG GTACCCGCGT ATTCCGCACA TATGGTAGCA ACTATCGTTG TATACGTGAG TAGGCGATGG TGCGTGACTA CGTGCTGAAT CCTAAAATCC GCCTAGTTCA ACCCGGCTAA AACATTGGGA AACATGCTTC TGTCAACCAG ATATTGCTTC GTGCTCGCCG TACTTTCCAC GGTAGTGTCC CTCTTAGCGA CCCTGACGAT TCGGTTATTC GATCATTACT CACCCTCACA CGGAGAAAGA TGGGATTTTA TCAGTGAAAT GGGTGATTGG ATCGTTTGTA TAATTTGGAT TGGAGGGACT ATGGGTGTGT TAGCTTGGTC CAAGTTATGG GTCGGTAACC CAAGTTTCAA TTCAGGTAAA GTAAATGTCT CTTCAACAGC TATACAGCGA AACTGATTGG ACGAATAAGG CTGCTCAATG GCCGCAATGA TTCTGTACAA TGTTGTCATA AAAGAGGGTG CAGTGCCAAA ATTAATAGAG ATTTTGCTCA TCGTCTTCAC GGGAGGTCTG TGCGATGGTA GTCAACTGAA CTTCTCACTG ACTTGATATG ACTTTAGTTT GCATCACAAA CCTTGTGTGC ATCACCGTCT TCCCAGTGTC GGCGACTTCA AACCTTCAAA AATCAATATC CAAATCACTC AATTCTTTTT CCACACTCCT CGACCTCCTC ACATCAACAT TCCTCCTCGA GAAAAGCACA GTCAAAGAGA AGGGACTTAC TCTCAAGGAT GCAGTTAGAG ACCATTCTGC CGCTTTCAAG ACTCTGCAAA AGGACCTTTC AGAGGCTAAA CACGAGCGAG TCCTTGATGG GAGGATCAGA GGCAGGAACT TGCAGCTCTA CGATGCAGCA ATCGTCAGCC TTGGTCGGTT GGCCCAACAT TTGAGTAGTT TGAGGAGTAG CACCAGGCTC CAAGAAAGTC TGGTCCGCGC CAGTCGCGAG GGGCGGATCA GTTTGGAGGT CGGTGCAGAG CGTGGCCGTT CCAAAATATC CATTTCTGAG GTGGATGCCA TTGACGATGA GAGAGGACAA GGATTATCTG AGGACATGGA CATTGCTACA AGCGTGCACC TTTTCTTGAA ATTCAGAGAA ATTGCCGGGA CACAGATGGA TGATTTGAAC GTAAGTAACA ATTGTTTTGA TGATGTAAAT TGGCTTGACT GTGAGCAGAC TCGGTGCGAT CAAGCCCTGG AAGCTGTACA GGCTCTTTCC CAAGCTCGTC AGATGCCCTG TATCGATCTC CCTTTGATCC GCTCCAAGCT CGCCACTTCC CTCAAGGAAT TCACTCTTTC ATCGAGCAGA GCCATCAAAA GGGTCTACGC GGGACCCCGG CGGAAGAAGG GTGTCTATTT CAAAACTGAT CCAAGCTCTA GTGACAGTGG AGAGAGCGAG AGTGATGTGG AGAGTGAAAA GATTTACGGG GACGGAAACG AATGCAAGCT GGAGGCCACC GAAAACTTGC CCGTTGAAGA TCAGCCCGAC ATTAACCACG GTCCTAATGA AACGGTCTTC TGGATCTACT TGTGAGTGTT CGGTCAACAT TAAGATAGGA AGGCTGACGA CGCACGCTAC AGTTTCCTGT TTACTTTTGA AGAATTTGCA CGTGAGATGA TATTTCTTGT GGATACAATG GAGGAGGTGC GTTTCTGGCT TTTTTGTCGT ATCAGTTATT AACGTTTTCA CAGATTGTTA CGACTGAGAA AGTCACTTTC TGGGAGCATC TCAAAACAGT TATAATGCCA AAAAGAGGGA GGAAGGAGAA GAAAAGCGAG TACCTCTATA AACAGCTTCG TGAGTCTTGT AGTCATTATA TTTCAAGATA TTTTCCCAAC CTCGCAACGC AGAAAACATT GTCCCAATCG ACCCTTCTCA ACTTCAGCCG CCATTGTACC CGAAAAATGG TCGTGATTCC ACTGGACCCG TTATCGTGCC TGATTTGAAG TCACTCAGTT TTATTGGAAG GATCAAACAG ATGTTCTGGG CGTTAGGCGA AAGGTCTAAA CAGCCTGACG CGCGTTATGC CATCAAAACT GGTCTCGGAG GCGGTAGGTT GGAAGACAGC CTTTTAGGAT TCGCAAATTC TAATAGAAGA TGCTTAGCTA TGTTGGCTGC GCCAGCATTC ACAGAAATTG GACGACCGAT ATTTCTGAGA TTCAGAGGGG AATGGGCGCT GATCGCATAC TTTGCTACCA TGAGCCAAAC CATCGGGCAA ACCAATTTTT TGTGAGTTGT TGTGCAACAA AACATTCGCT TGACATATAC ATTGCCCCGC TTACAATGAT TTAGATCGAT GATGAGAATA TTGGGAACTT TGTATGTTGA AAATAAAGCC TATGATTAGC AGATGCTGAC TTTATTATAA TGTAGAATCG GAGCGGGAGC TGCTGTCCTT TTCACCAAAT TGTGGGTCAT AGTTTAAAAT GCGAAGGATT GTCCCTTATC TTCTGCTCAG GTTTCCAGAC AACAACGTTG CTCTTCCTAT CCTAGGGTTT TTCTTCTCGA TCCCCTGCTT CTATATTATA ACTCAAATGC CCGATTACAT GAATGCAGGG CGATTTATCC TCCTTACATA TGTAGGTACA ATCTTGTATC ATGAGGAAAA TGCTGAATCG ACTTAGAATC TCACGTGCTT GTATACTTAC AATACCAGAA CGAGAGGAGA CGTTACTGTT GAAATGATTG CCTATCGCCG ATCAACAAGT GTTATTGTGG GCGTTCTGTG GGCTGCAATT GTTTCCAGGT ACTGGTGGCC GTTCACCGCC AGGAGGGAGC TGCGTATGGG CCTGAGCGAG TAAGTCAGAT GTCACCTTGC CCTTTCAGTT GCATTTTGAT CATCGTCATA TTGTTTGCTA GCTTCTGTCT TGATCTGTCC TATCTATATT CAAAACTCCT CACAACTTAT GGTAATGGCG TCGACTACAA TGGACTAAAC GCCGTAGGAG CAGATACTGA AGAAGGAGAG TTGGAACCTT TACTGCCGAC AGATGCTATC AGGCATTCTC ATCTCGATCA CGGAGTAAGA CAATTTATGG CCATGTAAGT CTGCGCGAAT GAACATTTTG TCCATAATCT CTTAATTTAC TCGTGTCATT AGGGAATTGC ATCTTCAGAG TCAGCTTGAC AGTATGAAAA GCTTGCTGGC CCACACAAAA AACGAACCAC GCCTAAAAGG ACCGTTTGCA TATGGATTCT ATAAAGAGGT CCTACTTAGT TGCGAGCGAA TGCTCGATAG GCTGCACAGT ATGCGCTGTG TTACGACTAG AGATGAGTGG TAAGTTTACT GCCCAGTTTG CGGCCCAATA CCAGTGGCGT TGACGAAAAC TTATTAGGGA CAACAATATC CGGGATACCT TTGTTATACC AGTCAACAAG GAACGCCGAG AGATGGCTGG GAACGTCATT CTATATGTGA GTACGCTTGC AATCAGTGCC TTGTGGCTTA CCGTTTACTC ACTCTTACTG TAGTTCTATA CTTTGTCCGC CGGATTCCGG CTGCGGACGC CCGTGCCCCC ATACCTTCCT CCAGCTGAGG AGGCTCGTCA GCGCCTCGTT GACGCCATTC GTTCCCTCGA TGTCGTTCGA CGCAGAAGCG TTCGTGGAGG TGGAAGACAT CTCTTGTTCT TTGCATATAC ATTGGCCATG CAGGTGGGTC ATTTTTAAAA AAAAACTTTT AACTGATGGC TTTTATCAAT AGGAGGTGAT AGCTGAGCTG GAGTATCTGG GAGCGATGAT GCAGGAAGCT TTCGGTGTGA TATCTGTCAG TTCCGCTGAC GACTTTGAGG ACTTATTCGA GGAGCCAGTA GAGGAAGTTG CCAAAGTGGG GAAAAGCCCG AAGAGCCTGT AA
|
Protein sequence | MQDRRQPPIA IPPAPLLSAS VSTPRRPTPQ REQEPSDDTV RRVPSSYFSL PYSYSSPAGP SHWRNKPLDS GSARRRDGKF RSASVKTVGS FSLKTPNRHE DLDNWQSLQD ITGSDVDEDQ IEDKATTNSR SSSLATPSAR SVRSLFISSD LSADALSIPE GRSHTSDRIS IVDRLSLSPT QLEAGGDHIL SVSRDDSQQE EGTPRAAVLP LQSPEPIERL TSATPLLEAS FPSDFLSSRQ PITETRSIVE KKASTLEGHR YSWIPPTWAT NILKCSIAYL IASLFTFVPA LASMLSTQSE TDAHGRVTAV PAYSAHMVAT IVVYVSRRCE MGDWIVCIIW IGGTMGVLAW SKLWVGNPSF NSGCSMAAMI LYNVVIKEGA VPKLIEILLI VFTGVCITNL VCITVFPVSA TSNLQKSISK SLNSFSTLLD LLTSTFLLEK STVKEKGLTL KDAVRDHSAA FKTLQKDLSE AKHERVLDGR IRGRNLQLYD AAIVSLGRLA QHLSSLRSST RLQESLVRAS REGRISLEVG AERGRSKISI SEVDAIDDER GQGLSEDMDI ATSVHLFLKF REIAGTQMDD LNTRCDQALE AVQALSQARQ MPCIDLPLIR SKLATSLKEF TLSSSRAIKR VYAGPRRKKG VYFKTDPSSS DSGESESDVE SEKIYGDGNE CKLEATENLP VEDQPDINHG PNETVFWIYF FLFTFEEFAR EMIFLVDTME EIVTTEKVTF WEHLKTVIMP KRGRKEKKSE YLYKQLQNIV PIDPSQLQPP LYPKNGRDST GPVIVPDLKS LSFIGRIKQM FWALGERSKQ PDARYAIKTG LGGGRLEDSL LGFANSNRRC LAMLAAPAFT EIGRPIFLRF RGEWALIAYF ATMSQTIGQT NFLIGAGAAV LFTKLFPDNN VALPILGFFF SIPCFYIITQ MPDYMNAGRF ILLTYNLTCL YTYNTRTRGD VTVEMIAYRR STSVIVGVLW AAIVSRYWWP FTARRELRMG LSDFCLDLSY LYSKLLTTYG NGVDYNGLNA VGADTEEGEL EPLLPTDAIR HSHLDHGVRQ FMAMELHLQS QLDSMKSLLA HTKNEPRLKG PFAYGFYKEV LLSCERMLDR LHSMRCVTTR DEWDNNIRDT FVIPVNKERR EMAGNVILYF YTLSAGFRLR TPVPPYLPPA EEARQRLVDA IRSLDVVRRR SVRGGGRHLL FFAYTLAMQE VIAELEYLGA MMQEAFGVIS VSSADDFEDL FEEPVEEVAK VGKSPKSL
|
| |