Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05570 |
Symbol | |
ID | 3255659 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1565426 |
End bp | 1568531 |
Gene Length | 3106 bp |
Protein Length | 829 aa |
Translation table | |
GC content | 52% |
IMG OID | 638255199 |
Product | ER to Golgi transport-related protein, putative |
Protein accession | XP_569300 |
Protein GI | 58264288 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0446639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCCCGACCA TACTCCGTAC ATCCCCGTTT ATCCCCCTTC CGTCCTTCCT CCTACCTGCA CCTGCCTGAG TAACCATGGT GAGTGACAAT CCCAGCTAAC AACGCCCCCT TGCTCTCGCT TACCACCCAA ACACAGCCCA TGTTACTCGA CATCACCGTA CGTCACATAC CTCCAGCCTG CGATCCATCA CCTTCCTCAC ACGTCCACTA ACCCCCCTTA CCTACAGAGG AAACTACTGG CCCGATCAGA CAGGGTCAAG TCAGTAGACT TCCACCCTAC AGAACCCCAT GTCATCTGCG GTTTATAGTA AGTCATATAC CGTCTCTGAA CCTTCACCGG TGCCATGCGC ATTGGGTGTA GCGAGAAAAA GGGCGGAAAG AACTTAAAAG CTGACTGGCA TGCGCTTCTA TATAGTAATG GTCAAGTCAA GATATGGGAC TATGAGACAG GCACAGATGT CAAGGCTTTT GAAGTGACCG ACGTCCCCGT CCGATGCGTC CGCTATATCG CCCGAAAGAA CTGGTTCGTT TCTGGCTCTG ACGATTTCCA ACTCCGAGTA TACAACATCT CTACCGGCGA GAAGATTACC CAATTCGAGG CGCACCCAGA TTATATCAGA TGTCTGACTG TTCACCCGAC GTTGAGCTTG GTCTTGACTG GTAGTGACGA TATGACTATC AAGTGCTGGG ACTGGGACAA GGGATGGCGA TGTGTTCAGG TGGGTTTCTT GTTCAATCGT TCTTTCGTCG GCATCAAAAA GCTAAACCCT AGGGAAGATA TTTGAAGGGC ACACCCACTA CATTATGGCT CTCGCCATCA ACCCAAAGGA CCCTCAGACA TTTGCGTCCG CCTGTTTGGA CCATACCGTC AAGGTCTGGT CTCTTGGAAA CTCGGTCCCC AACTTTTCTC TCGAGGCGCA CGAAAAGGGT GTCAACTATG TCGACTACTA CCATGGCGGT GATAAGCCCT ATATCGTGAC AACTGGTGAT GACCGGTAAG TTTCTTACTT GCCCTTCGTT TCTGTTATGC ACAATGTTAA AACATTCCAT TCCCATAGAC TTGTCAAGAT TTGGGATTAC CACTCTAAAT CCTGCGTGCA AACCCTCGAA TCACACACTG CCAACGTGTC GTTTGCCATC TTCCACCCAT CACTGCCTAT CATCGTCTCT GGTTCCGAAG ACGGGACCGT CAAGATCTGG CACTCTGCTA CTTATCGACT GGAGAACACT CTAAGTTATG GGCTTGAGAG GGCTTGGTGT GTGGCGTACA AGAAGAGCGG GAATGAAGTT GCTGTCGGTT TCGATGAAGG TGCTGTTGTC GTCAAAGTGA GTCTTTTCTT GTAGGTTGAT CTCTTTTAAC GTTTTGTTGC AAAGCTCGGT CGAGATGAAC CCGCCGTATC AATGGACACT TCTGGCAAAA TCGTTTACGC TCGCAACACT GAAATCCTCA CTGCCAACCT CTCCACTCTC TCCGACAGTG AACCCCTCGA AGACGGACAA CGTGTCCCCC TCCCACTCCG TGACCTCGGC ACCACCGAAG TCTACCCCCA ATCTCTCCAA CACAGCCCCA ACGGCCGATT CGTCACTGTC TGCGGCGATG GCGAGTACAT CATTTACACT TCCCTTGCGT GGAGGAACAA GGCGTTTGGG AATGGATCGA GCTTTGCTTG GGCGGGCGAC TCGAATACCT ATGCTGTGCA AGAAGGCAAG GCTAAGATCC GTGTTTTCCG AGCGTTCAAG GAACGACCCA ACCTCTTAAA ATCGGCGGGC AATTGGGCTG TGGAAGGTAT CCATGGCGGC ACGTTGCTCG CAGCCCGTGG CAATGGGTTC GTCATGTTTT GGGACTGGGA GACTGGATCG GTCGTGAGAA GGATAGAGGT GGATGCTACT AGTGTGAGCT GGAGCGCGAC GGGTAATTTC GTCGTTATCA CTGCAGAGGA CTCGTTCTAC GTGCTCTCGT TCAATAGGGA GGCGTATGAT GCCAAGTTGG ATAGTGGAGA GTTGATTGGA GATGAAGGTG TGGAAGAGGC GTTTGAAGTT ATCGCCGAGA TTAGCGAAAC GTGAGTTTTT TTTTTTTTTT CATTAGCGAA GAGTAAGAGC TGAAATAAGT ATATATGAAT CAGGGTCAAG ACATCTAAAT GGGTCGGCGA CTGCTTTGTC TACACTAACT CCACCAACCG CCTTAACTAC CTGATCGGCG ACCAATCCCA CACTGTCAAC CACTTTGATC AAGGCATCTA CCTCCTTGGT TACCTGCCTT CCCACAACAG GATCTATGTC GCCGACAAGG ACATGAACAT CTATACCTAC GCCCTCTCCG TCTCCGTCGT TGAATACCAA ACCGCCATCT TGCGAGGCGA CCTTGATGCA GCTGCAGAGA TCTTGCCGTC CATTCCGCAA GATCAGCGAA ACCGGATCGC GCGATTCTTG GAGGCGCAGG ACCTGAAGGA ACTAGCACTC TCAGTGTCGA CGGATCCGGA TCAGAGGTTT GATCTGGCTG TATCACTTGA TGATCTCGAA ACGGCGCTTT CACTCGTGCG CGCTGCCGAC GAGTCTGCTG CTACTCCTTC CGGTGACGCC GCCGGTGTCG GTGCAGGCGT CAGTGTAAAC CAAGCGAAAT GGAAGGTTGT CGGCGACAAG GCCCTCGCCG CATGGCAAAT GGACCTCGCA AAAGAAGCGT TCCAGAATGC CAACGACCTC TCATCCCTCT TACTCCTCTA CACCTCCCTC TCCGACCGAA CCGGTCTCTC CTCCCTTGCC CAAGTCGCCT CCCAAAAGGG TCTTAACAAT CTCGCTTTTG CAGCATACCT CCAGCTTGGC GACGTTGCCG CCTGCATCGA CCTCTTGGTC AAGACTGATA GGCTGGCCGA AGCGGCATTG TTTACAAGAA GTTATGCGCC TTTGACAAAT GAGACGGAGA AGGAATGGGG AGCGACGGTC AAGCTTTGGA AGGAGCAGTT GGTAAAGGAT GGAAGAGGGA AAGTGGCGGA GAAAGTGGCG GAGCCGGGGG AGGATAGAGA GTTGTTTGAC TAACGACTGC TCTCTACCAT AATAAATCAA AAGATATATA TATGAACCAT GCATACGAGC AGATGACTGA TACACC
|
Protein sequence | MPMLLDITRK LLARSDRVKS VDFHPTEPHV ICGLYNGQVK IWDYETGTDV KAFEVTDVPV RCVRYIARKN WFVSGSDDFQ LRVYNISTGE KITQFEAHPD YIRCLTVHPT LSLVLTGSDD MTIKCWDWDK GWRCVQIFEG HTHYIMALAI NPKDPQTFAS ACLDHTVKVW SLGNSVPNFS LEAHEKGVNY VDYYHGGDKP YIVTTGDDRL VKIWDYHSKS CVQTLESHTA NVSFAIFHPS LPIIVSGSED GTVKIWHSAT YRLENTLSYG LERAWCVAYK KSGNEVAVGF DEGAVVVKLG RDEPAVSMDT SGKIVYARNT EILTANLSTL SDSEPLEDGQ RVPLPLRDLG TTEVYPQSLQ HSPNGRFVTV CGDGEYIIYT SLAWRNKAFG NGSSFAWAGD SNTYAVQEGK AKIRVFRAFK ERPNLLKSAG NWAVEGIHGG TLLAARGNGF VMFWDWETGS VVRRIEVDAT SVSWSATGNF VVITAEDSFY VLSFNREAYD AKLDSGELIG DEGVEEAFEV IAEISETVKT SKWVGDCFVY TNSTNRLNYL IGDQSHTVNH FDQGIYLLGY LPSHNRIYVA DKDMNIYTYA LSVSVVEYQT AILRGDLDAA AEILPSIPQD QRNRIARFLE AQDLKELALS VSTDPDQRFD LAVSLDDLET ALSLVRAADE SAATPSGDAA GVGAGVSVNQ AKWKVVGDKA LAAWQMDLAK EAFQNANDLS SLLLLYTSLS DRTGLSSLAQ VASQKGLNNL AFAAYLQLGD VAACIDLLVK TDRLAEAALF TRSYAPLTNE TEKEWGATVK LWKEQLVKDG RGKVAEKVAE PGEDRELFD
|
| |