Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC03240 |
Symbol | |
ID | 3256155 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1021589 |
End bp | 1024826 |
Gene Length | 3238 bp |
Protein Length | 938 aa |
Translation table | |
GC content | 48% |
IMG OID | 638255547 |
Product | protein-Golgi retention-related protein, putative |
Protein accession | XP_569646 |
Protein GI | 58264980 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.114288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCACCGCCA GCGCGACACT GATACTACTG TACAGTAATC TCGTCTCATC CACCTGCATC TTACAACATC AAGCCCACCA TGGACGAAGC CAAGCTTCTT TCAGATGCCT TGGCAAATGT CAAGGTTCAA ACTGTACAGC TCAAGAGATG CCTCGACCAA GACGAAATAA TGGAAGCTCT CAAGGCCGCC TCTTCGATGC TTGCTGAACT TCGAACATCA TCATTATCGC CAAAACAATA TTATGAACTG TACATGTCTG TATTCGACAG TCTGAGGTTT CTGAGCAACT ACTTATATGA GGCCCACACC GAGGGGAAAC ACCATCTGGC TGATCTTTAC GAGCTGGTCC AGTAAGTCCC TGATAGGAAA TCGAGATGGG TTGGGCATGG GCTAACTTAG GAAAAGATAT GCAGGTAATA TTGTACCACG GCTGTATTTG ATGATTACTG TCGGGTCTGT TTACATGTCC GTGCCAGATG CTCCCGTCAA AGAAATCATG AAGGACATGC TCGAGATGTC CCGAGGTGTA CAGCACCCCA CTCGAGGTTT GTTTCTTCGA CACTATCTCT CTGGTCAGAC CAGAGACTTC TTGCCTGTCG GCAATAGTGA TGGGTATGTG AAACAGTATC CACCAAACAT ACTAACAGTC ATGCAGTCCT GGCGGCAATC TTCAGGATTC TATTGGTTTT GTTCTCACAA ACTTTATCGA GATGAACAAG CTTTGGGTGC GACTTCAACA CCAAGGTCAT TCTCGTGAAC GCGAGAAGCG CGAGATGGAA CGTCGGGATC TTCGAATCCT TGTCGGCACC AACCTCGTCC GTCTATCCCA GTTAGATGGC GTTGACTTGG ACATGTACCG CAAGATCATA CTTCCATCGG TCCTCGAACA AGTTGTCAAC TGTCGTGATG TTATCGCGCA AGAGTACTTG ATGGAAGTCG TCATACAGGT ATTTACAGAC GACTTCCATC TCCACACACT CACACCTTTC CTTGGCGCTT GCGCCCAATT GCACCCACGA GTTAATATCA AGGGTATTGT CATTGCTCTG ATAGACAGAC TTGCTGCATA CGCAGTGAGG GAGGCAGAGA GCGAAGATCC CGAGGAGAAG AGGAGAGACG AAGAAGAAGC TGCAAGAAGG TTGGCAGAAA AAGTCAAAGG CGCTAGAGGA AAGGGAAAGA ACGTGGAGGA GGGCGAGAAG AATGCACCTT CTCCGGTGGC GAAGCCTGCT GAAGCAGATG TGTGGGGAGC TACAACTGAC ACAACCTCTA CAACTCCTGT CACTGAAAAT TTAAGCGGGG AATCATCCAA GAGTCCCGTA GAAGGAGAGA AATTAGGTGA ATCACCTGCC CCGACTCCTG CACAGATGGA GAAGGAAGAA ACCGCGAAGA AATTCAGAGG AATTCCTGAA GATGTCAAGC TTTTCGAAGT TTTCTGGCAA CAAGTAGTCG AACTTATCAA GGTACGTATT GCCAAGATTA TGAAGACGCA ACTGATGTTC TGTAGGCTCG ACCAGACTTG TCTATCATGG ACATCACCGC TCTCTGTGTC TCGTTGACAA ACCTTTCTTT GAGTTGTTAC CCTGATCGGC TTGAATATGT CGATCAGGTC TTGTCTTTTA CTCACGGAAA AGTACACGAT TACTCTCAAA AGTATGTGCA CATGCTCATC TTCTACCTGA TAGAGCTAAT ATTGGACTCA GCCCCGATCT GCACTCTTCT CAAACTGTCT CAAATCTCCT TGCACTCCTT CTCGCACCAA TCAGCTCATA CGTATCTATT CTCACTTTAC TCGCCATCCC CTCATATCTT CCACTTTTGT CAGTACAGCC GTATTCCACC CGTCTATCCA TTGGTCAAGC GGTGGTCTCC TCCGTGCTTA AGAACAATAC GCACATTGAA ACCTCTGACG ATGTCACCGG TGTGCTTGGT CTTTGCGCTG TACTTGTCAA AGACCAGAAA GATCATACTA TTGGCGGCGG CGCACCGCAG AGAAGAGGTC AGGCAATCGA CTGGAGAGAA ATGGCGGAAG AGCAGGGATG GGTCGCAAGG ATGGTGCATC TCTTTAGGGC CGATGATCTT GGTGTCCAAT TTGAATTGCT GCAGACAGCG AGGAGACATT TCACTGAAGG CGGTGAGAGA ATACGGTTCA CTTTCCCGCC CTTGATTGCC TCTAGTATTC AACTCGCTAG ACGCTTCAAG ACGAGGGAAA GCGTCGAGGA CGAATGGGAA ACCAGGGTAT CGGCCTTGTT CAAATTTATA CACCAGCTCA TTTCCATCTT GTATCACAAG GTTGAAGCTC CGGAGACATG CTTGCGTCTC TTCCTTCTCG CTGCTCAAGT CGCTGACGAC TGTCGCCTTG AGGAACTTAC CTACGAATTT TTTGTCCAAG CATTTGTCAT TTATGAGGAG TCCATATCTG AATCTCGAGC ACAGCTACAA GCTATTACCG GTATTATCTC GTCTTTGCAA ACAAGTAGAG TGTTCGGAAC AGATAATTAC GATACTTTGA TAACCAAGGC CGCATTGCAT GGGAGCAGGC TTCTCAAGAA AAGCCACCAG GCTACAACAG TGCTTTATGC GAGTCACATG TGGTGGCAAG GAGATGTTCC TGGACGGGAG AAGAATGACA AGGTATGTTA GGTGTGTTGG ATTGTAAACC GCTAATGTCT GTCAGCCGCC GTTCCGGGAC GGCAAGCGAG TTCTCGAATG TCTTCAAAAG TCTCTCCGTA TCGCTTCATC TTGCATCGAT GAAATCACCT CTGTACAGCT GTACGTTGAT GCCCTTGATC GATATGTCTA TTATTTCGAG CAGGGAGTGG AAGCTGTCAC GCCCAAATAC GTCAATTCTC TGGTGGAGCT TATCACCTCG AATATCGATT CGGTGAATAG TGGCGGAGAC GTGCATCCCA GCTCAGCCGG TGGAGGACTA GTGGAAGGTG TCAGTGGCGG GGATATGATC ATCAAGGTGA GCGCTCTTTC TAACTGTTCT ATTACTGTAA CTGATGGACG ATGAAAGCAC TTCCGAAATA CATTATTATA CATTCGAGGC CGACAGCGAC AGGCTCAAAC AGACGTTGTT GATCAGGGAG ATGAACGAGA AGGGGGGGAA GAGAAGAAGA AGGTTGATTG GGAAAGCGTG GACGTAGCGG GGGGTTGCTT GAAGATGGGC CTTACGCACT AAGCAGGGCA AAGTAGTAAC TGAGCGATTA TACAGAGTAA GGTATGGCCG AGCTAATA
|
Protein sequence | MDEAKLLSDA LANVKVQTVQ LKRCLDQDEI MEALKAASSM LAELRTSSLS PKQYYELYMS VFDSLRFLSN YLYEAHTEGK HHLADLYELV QYAGNIVPRL YLMITVGSVY MSVPDAPVKE IMKDMLEMSR GVQHPTRGLF LRHYLSGQTR DFLPVGNSDG PGGNLQDSIG FVLTNFIEMN KLWVRLQHQG HSREREKREM ERRDLRILVG TNLVRLSQLD GVDLDMYRKI ILPSVLEQVV NCRDVIAQEY LMEVVIQVFT DDFHLHTLTP FLGACAQLHP RVNIKGIVIA LIDRLAAYAV REAESEDPEE KRRDEEEAAR RLAEKVKGAR GKGKNVEEGE KNAPSPVAKP AEADVWGATT DTTSTTPVTE NLSGESSKSP VEGEKLGESP APTPAQMEKE ETAKKFRGIP EDVKLFEVFW QQVVELIKAR PDLSIMDITA LCVSLTNLSL SCYPDRLEYV DQVLSFTHGK VHDYSQNPDL HSSQTVSNLL ALLLAPISSY VSILTLLAIP SYLPLLSVQP YSTRLSIGQA VVSSVLKNNT HIETSDDVTG VLGLCAVLVK DQKDHTIGGG APQRRGQAID WREMAEEQGW VARMVHLFRA DDLGVQFELL QTARRHFTEG GERIRFTFPP LIASSIQLAR RFKTRESVED EWETRVSALF KFIHQLISIL YHKVEAPETC LRLFLLAAQV ADDCRLEELT YEFFVQAFVI YEESISESRA QLQAITGIIS SLQTSRVFGT DNYDTLITKA ALHGSRLLKK SHQATTVLYA SHMWWQGDVP GREKNDKPPF RDGKRVLECL QKSLRIASSC IDEITSVQLY VDALDRYVYY FEQGVEAVTP KYVNSLVELI TSNIDSVNSG GDVHPSSAGG GLVEGVSGGD MIIKHFRNTL LYIRGRQRQA QTDVVDQGDE REGGEEKKKV DWESVDVAGG CLKMGLTH
|
| |