Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE01600 |
Symbol | |
ID | 3257819 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 440222 |
End bp | 442999 |
Gene Length | 2778 bp |
Protein Length | 728 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256750 |
Product | conserved hypothetical protein |
Protein accession | XP_570779 |
Protein GI | 58267246 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5533] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCTT TCCGCAAGTT TGTCGGTAAC AAAAACGCGG GCATATCTGC ATCAGAGGGT GATCAAGATC CAACCGATGC ATCCTTGCTA CCGCGCGCAA ACGCCGACGA AGAGCGTCTG TGGGGGATAG AAAACTTCTC CAACTCTTGC TTCTGCAACT CAGTTTTGCA AGCTCTATAC GCGTGTTCCA CGTTCCGCGA TTTCGTGGAA GCTTATCCCG ATATCCTGCC GCCTCGGCGA CCGATTGGAC CTTCACAGGT GGAAAAAAGG TATCCCAGCG TCGAATGGGA TGGCCCAGTT CCGGGATGGG ACCCATCCAT TAGACTAAAT AAGGAACATA GGGCTTTCAT TGCCCTGCAG GAGGCGAATA CTTCAATGAC TCCTGGCGGA AAGGAAAAAA GGAATTGGAT GGGTCGGAAA ATGTCTTCCG CACAATCCGC TCCGACATTG GCGACCCTTC AAGCCAGCCA GCCCGAACCA CTTCCAAGTT TACCAAACCC TTTCCCCAAT TTTGACGATC CTCTGCTGGT CAGAACGCCT GCATCGGACA ATGACCCGCC TCTTACATTG TTCCAGACTT TCCAGACGCT TTTCTACTAT TTCTCACATT CTCCGCCTCA CATGCCCATC AAAGGCAAGG GTCAGACGAA GGACTCTGAA GGCGCTCTCA TTGAGTATAC CGAAGAGGAA AAAGTGGATG ATTCGGGTGT CGAGGTGCCT GAGAAAGCCT CTTCATCGAA TCAAGGCCAG CCCGCTCAAT CAACTTCAGC ATCGTCGCAG CAAACTCAGA GTCAAGGCCA GAACGCTCAG TCTTCAGGTC CTTCGAAACT TGCCTCCCTT CCACCGCCCT CCACATTTCG TGAAACAGGT GCTTGGCGAG CAGGTCAGCT TGGCTGGGGT GTTGTCCAGC CTAACGACGT CATGGATGCT GTCAAGCGCT CTGCGCCATC TTTCAACAAC GACGATCAAC ATGATGCACA CGAATTCTTC AGTGTTGTAG TCAACACTCT CGCCAAAGAA GTCGACGCTG TCAATGAAAA ATTGAGAGCG CAAGGGAAAG AAGTGGCGAA GATGACTGCG CCCTGGGCAA AGACATTTGT CGAAGCACTA TTTCAAGGTA TCACTACCAG TGAGACAAAG TGTCTGAGCT GCGAAACAGT GAGTTGCGTG TGCTTCATTC TTCGAACTTT TCTGACTACG CACGTGCCTT ACTAGATATC TTCTCGAGAC GAAGAATTCA TTGATTTGTC TGTGGATATT GAGCAGCACT GCTCTATCAC CTCCTGCCTC CGCCAATTTT CCTCAGACGA AATGATGTCC GGAAGGGAGA AATTCTCTTG CGAGTCCTGT TCTGGTCATC AAGAGGCGAA GAGAAGGTGA GTGAGCCAGG TTCAGCGCCA TAACATCGCT GATCCATTTC TATAGCATTA GGATCAAACG GCTCCCGCCT ATCTTAGCTG TCCATCTGAA AAGATTTGCC CACAATGAGA GTTATAGAGC CATCAAGTTG TTTTACCGGG TAAATCATCC CACGACTCTG ATCCCACCCA ACACAACAGA TAATTGTGAA AACCCCGACC AAATTTATGA CCTTGTGGCG ATTATGGTGC ACATTGGAAA GTATGTCAAA CTTTGAGTAT TATTATTAAT ACACTGTCGC TCACCTTTCA AACTAGCGGT CCTGTTCAAG GCCATTATGT AACGGTCAAG CGAACGCCTT CTGGTCGTTG GGTCATGTGT GATGACGATA ATATTGAGGC CATTGAGGAA AATCAGCTAG AGTACTGGCT TGGTAACCGC ACCCAGGGCC AAGGCTATGT CTTATTTTAT CAGGCGCGGG GCATCACTGC CGAACACTTG GGTTTGAAAG TGGAAAAGCG AAAGCCCAAT GGAGTATTTG AGCCCGTTGG AGAGGGCGTC AGATATGCTG ATGGTTATGG ACAAGCTCCC GCTATGGCGC CGGTAGCCTC AGCTACCCTG AATAATGTCA GGACTGTGAA TGGTGTAAGG GAGGAAGAGG AAGAGGAAGT CGCCTCGAGT TCGGGGTCAA TTTCAGTCTC TGTACCTGGT CTAAAGCCGT TGTCATTGGC TTCGCCAGTC TCGTCACCTT CCGCTACACC CAGCAGTGTT GGCTCCCGCG TTGATCGCCA ACTTGGCTTC AATACACCCG CAAATGGGGT AGCGCATGAG TCTACGGCAC GACCACCACT CAAAAAGGAA CTTAGCGACA AAAAATGGTT TCGTCGCATG TCCATGTCAG GTATATCCTC ATCTGCCAAG GACAAAGAAA AGACCTCTAA TAGCAAAGAC AAACCGGTTA ATGGCCTTTC TAACGGAGGT ACTACCCCCC TTGCCGAGTG CCGGACGGAC ACGTCATCCT CGGCGGACGT GACCTCCCTA ACGACTGGTC AACCAACGTC AATCCCGAAG TCGCAGAAAA GATCCCTAAA CAGTGCCATA CCCTCTGCCC GCTCTTGGAT GGGAAGGGCG GAAAAGACAC ATGGAAAAAT ATCGAGATGA TTGCGCACAG TGTGGAGTTG AGGTCCGTAC GCGAGGCGGG ATTGAGGTGC GAGATGGAGG AGAGAGCGCT AGCGGAAAAA GTGAAGCAGG TTTGGCTACA AAGGGAGAGA AGGTAGGGAT GGGAGGAAGG TTGGTACTTG GGATAGGCAG TGTATGTACA GTGTCAGTAT GTAGATGGAG TCGTTGTACA GTAGGTATAG GGGATTATAC GAAAATACAT CTTCACAGGT TCTCACTATG CAATATCTCT GCTATATA
|
Protein sequence | MSAFRKFVGN KNAGISASEG DQDPTDASLL PRANADEERL WGIENFSNSC FCNSVLQALY ACSTFRDFVE AYPDILPPRR PIGPSQVEKR YPSVEWDGPV PGWDPSIRLN KEHRAFIALQ EANTSMTPGG KEKRNWMGRK MSSAQSAPTL ATLQASQPEP LPSLPNPFPN FDDPLLVRTP ASDNDPPLTL FQTFQTLFYY FSHSPPHMPI KGKGQTKDSE GALIEYTEEE KVDDSGVEVP EKASSSNQGQ PAQSTSASSQ QTQSQGQNAQ SSGPSKLASL PPPSTFRETG AWRAGQLGWG VVQPNDVMDA VKRSAPSFNN DDQHDAHEFF SVVVNTLAKE VDAVNEKLRA QGKEVAKMTA PWAKTFVEAL FQGITTSETK CLSCETISSR DEEFIDLSVD IEQHCSITSC LRQFSSDEMM SGREKFSCES CSGHQEAKRS IRIKRLPPIL AVHLKRFAHN ESYRAIKLFY RVNHPTTLIP PNTTDNCENP DQIYDLVAIM VHIGNGPVQG HYVTVKRTPS GRWVMCDDDN IEAIEENQLE YWLGNRTQGQ GYVLFYQARG ITAEHLGLKV EKRKPNGVFE PVGEGVRYAD GYGQAPAMAP VASATLNNVR TVNGVREEEE EEVASSSGSI SVSVPGLKPL SLASPVSSPS ATPSSVGSRV DRQLGFNTPA NGVAHESTAR PPLKKELSDK KWFRRMSMSA KTNRLMAFLT EVLPPLPSAG RTRHPRRT
|
| |