Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00910 |
Symbol | |
ID | 3257990 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 244558 |
End bp | 247792 |
Gene Length | 3235 bp |
Protein Length | 932 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256677 |
Product | Golgi to vacuole transport-related protein, putative |
Protein accession | XP_570825 |
Protein GI | 58267338 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.752282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGCAAAATA ATGTTCGAGA GGACGCTTCA GGATCTCATC CGAGGTCTAC GCGCCCACAA GGGTGCCTCG AAAACCCAGG AAGATGCCTT CATCGCGGAG GCTATGACGG AGATACGCGA CGAGCTCAAG GGCAAGGACA TGGCGCTCAA GGCGGAAGCT GTCATTAAGA TGTGCTACGT GAGTATCCTG CTGTCTAGGG CATTACAAAC CTGACCGTTA TACATATAGT TGATGATGCT TTACCCCATT CCGCCGCCGG CAGGGTTCGC CTTTCATGTC GTGGAGGTTA TGAGCTCCCC GAGATATCAT CTCAAACGTA AGTGACAACG GGTTCAGCAG CTTTCTATTA TTAATGCATA ACCTAAAGAG CTGGGATATC TTGCTGCACC TATGGCCTTC TCTGGAGATA CTGAAGAAAT TGTCCTTACC GTCAACGGCA TCAAAAAGGT AAATACCTTA GAGACCTGAC ATCGCATCTG CTAAGTGATT TACTGCAGGA TCTTCTGTCA CCTCATGTTC CTCTACCTCC CCTTCCACTC ACAGCCCTTC CACAACTTCT ATCTCTCTCC TCTTCATTAT CAACATCATT GCATCCCGAC CTCCTCCATC TTCTCACCCA CTCGTCGCCA CGTATCCGCA AACGAGCAGT TCTGTGTCTG TTACCTTGCT GGGAAGCATT CCCCGAAGGT TTGCGCGAGG GCTTCCCAAG GTTAAGAGAA AGGCTGCAGG ATGAGGATCA GGGTGTAGTG GGCGCCACTG TGGGCGTTGT AATGGAGTTG GCTAGGAGAC AGGGCGGGAA GAACTACTTG CCCTTAGCAC CCGAGTTGTT CGGGATCTTG ACGGGAAGTA GCAATAACTG GATGCTTATC AAGGTTGTCA AGCTGGTATG TCAATGTGAT CGCCATTTTA ATCACAGGCT GATGCAGCGT TCCAGTTCGC CATATTAACA CCCCTCGAGC CTCGTTTGGT CCGAAAGCTT CTTCCTCCTA TCACCACACT CATCTCCAAC ACCTCGGCTA TATCCTTATT GTACGAATGT GTGCGGACAT GTATCGTGGG CGGCATGCTG AACGCAGATA GACCAGAAGC CGACGCCCTT GCAAGAGTAT GTGTGGAGAA GCTGGGAGGC TACCTGAGGG ACGAGGGAGG TGACCAAAAT CGTGGGTACC GCCTTTCTCA CAGTAACCGA TATGCAAGCT TATTTGAGTA GTGAGATACA TTGCCTTGTT AGCAATGGTT AAAATTATTC CAACCCATCC ACAGTTGGTC GCAGAGTATC AAGACGAGGT TCTACAGAGT TTGGATGACC CCGACGTGTC TATCCGAATG CGAGCTCTAG AGCTCGCTAC GAACATGGTA AATCAATATA TCGTCTTGAC AAATATTAAC GTACTAACAC CAATCTAGGT CGATCCCAAT AACCTCCAAA CAATAGCAGA CACACTTCTG TCTCACCTTG CACCTGTCTC CCCTGTATTG CCATCAGCTG CCGCATCACT GGCGGCTATC GCTTCCTCAT CTGGCACGTC GAGTAACGCC TTGCCGTCTC TCTCGCCCGC ATACCGTCAT CTCCTCTCCA CTCGCTTACT AGCCATTCTT TCCCACAACA CCTATGCCAA CGTAACGGAT TTTGAATGGG TTCTCAGTGT GTTGGTGGAC GTCGCCTACG TCTCAAGAGT AAACGTTAGT CAGGATATCA AGAAGATGAT TTTGGACGTT GTTGCCAGAG TGAAGAGTGT TCGAAACTAT GCCGTGTCTG TCTTGGAAAA GGTGTTGGGA GATGACGACT TTAAAGAAAG GCTAGGAGAC GATAACGAAA GTGCCGACGG CCTGATTGAA GCTGCCGTTT GGGTCTGTGG CGAGTATCCT TCGGAGCTAT CATCACCTCT CTCTGCCATT TCCAACCTTC TTTCACCCTC TACCTCGACC ATTATTACTT CCCTTTCTAT ACAAGCAGTC GCCAAGATCT TCGGTTATTA CTGCACAATC GCTGCCTCTT CTTGGTCTGG AGATAAGTTT GAAGAGACCA AGGCGCTGGT AGCGAGCATT GACAAAGGTC TGACGGAGGT CGAGCGAAAT GCAAAGGGAG ACATGGAAGT TGTCGAACGA GTTGGCGAGA TCAAGGGTTT ATTAGGATTC GTCAAAGCTG ATCTCGAGCA CCATCTCCCA CCTCAGAATA CGTTACGAAG GGATAGTGGA TCTTCTATCC CTGAGCTGGA GGGAGGGTTT GAGGCCGAAG CAAAGCAAAC CAATCAGAAT GACCCACCAT ACCCGAAGTC ACTGTACATC TTCCCACCCC TATCCACGTC CCATCCTCTA AATGCAGTTG CGTCCTATGC GCAATCATCA ATCCGTATAC CAGAGGGACT TGATCTAGAT ACCGATCTTG TCCCAGGCGG TGGGTGGCCG GAGGATATTG AAGAAGTTGA CGAAAGCGAG GAAGAAAGAG AGAAGGGTCT ATTGAATCTG GGAGAAGGTG GTGGGGAAGG GATGGAAGAG TTGAGGAGGG TATTGAGAGA GGGCAGAAAG AAGAAGAAAG GGAAGAAGGG TGAAGAAGGC GAGGATAGGG TTGAAAGAGA GAGGGTAAGT TTGAATCTTA AGAATCACGG TTACATTGCT CAATTTTTTT GGCAGCGGAG GGCAGCGAGA CGGGCAAAGC ATAAAAATGA TCCATATTAT CTCTATGATA GAGAGGACGA GGACGTGGAC AACATTCCGA TTGTTAAGCT CGACGATTCT GAGCTACCAC GTAAGTGATT CATGTGTTGT ACCTATTAAG ACTGACACAA TATCTTTCCA GACGACATTA CAGATCCCTC ATCTCGGCCA AAATCCAAAT CAAGACAGAA GAAGAAAGCC CCGCCTGAGT TTGATCGCAC AGGCGAACTT CCCGAGGGCG TTTCTTCTCC CCAAATTCCA ACACCGTCTT CTCGCCTGAG CCCATCCGCA TCACGGATGA ATTCCACTAC CGGTTTGGCT GCCGTGGATC TTTCCGCCTC GGGCTCTATG AGCAAGCCTG TTTCAAGAAG TAGTAGCCGC TTTGAAGAAT ATAAGCTACA TGAGGAGGAA GGTCTTTCAG GTGCCACGTC ACAAGTAGAC TTCCAGAGGA ATGGTGGTGA GGATGTGCCT GTTGCCAGCG TGCCAGAAGT ACAAGTCGTG AAGGTTAAGC GGAAGAAGAA GGCGGGGGAA AAGAAGAAAA AGAAGGAAGG AAAAAGGGAG AGTCCAGCAG AATAA
|
Protein sequence | MFERTLQDLI RGLRAHKGAS KTQEDAFIAE AMTEIRDELK GKDMALKAEA VIKMCYLMML YPIPPPAGFA FHVVEVMSSP RYHLKQLGYL AAPMAFSGDT EEIVLTVNGI KKDLLSPHVP LPPLPLTALP QLLSLSSSLS TSLHPDLLHL LTHSSPRIRK RAVLCLLPCW EAFPEGLREG FPRLRERLQD EDQGVVGATV GVVMELARRQ GGKNYLPLAP ELFGILTGSS NNWMLIKVVK LFAILTPLEP RLVRKLLPPI TTLISNTSAI SLLYECVRTC IVGGMLNADR PEADALARVC VEKLGGYLRD EGGDQNPMVK IIPTHPQLVA EYQDEVLQSL DDPDVSIRMR ALELATNMVD PNNLQTIADT LLSHLAPVSP VLPSAAASLA AIASSSGTSS NALPSLSPAY RHLLSTRLLA ILSHNTYANV TDFEWVLSVL VDVAYVSRVN VSQDIKKMIL DVVARVKSVR NYAVSVLEKV LGDDDFKERL GDDNESADGL IEAAVWVCGE YPSELSSPLS AISNLLSPST STIITSLSIQ AVAKIFGYYC TIAASSWSGD KFEETKALVA SIDKGLTEVE RNAKGDMEVV ERVGEIKGLL GFVKADLEHH LPPQNTLRRD SGSSIPELEG GFEAEAKQTN QNDPPYPKSL YIFPPLSTSH PLNAVASYAQ SSIRIPEGLD LDTDLVPGGG WPEDIEEVDE SEEEREKGLL NLGEGGGEGM EELRRVLREG RKKKKGKKGE EGEDRVERER RRAARRAKHK NDPYYLYDRE DEDVDNIPIV KLDDSELPHD ITDPSSRPKS KSRQKKKAPP EFDRTGELPE GVSSPQIPTP SSRLSPSASR MNSTTGLAAV DLSASGSMSK PVSRSSSRFE EYKLHEEEGL SGATSQVDFQ RNGGEDVPVA SVPEVQVVKV KRKKKAGEKK KKKEGKRESP AE
|
| |