Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC00670 |
Symbol | |
ID | 3256475 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 190671 |
End bp | 194144 |
Gene Length | 3474 bp |
Protein Length | 869 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255284 |
Product | nucleus protein, putative |
Protein accession | XP_569358 |
Protein GI | 58264404 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.86397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAATGCGCCA GCTATACGCA AAATGAGCAA CGACGAGAGC AAGACAAACC AACAGCAAGC GCCTCTTAAG AGGAGACGTA TCACTGTGAG TCCGTCATCT TGGAGTTCTT GTAGGGTGCC CGCGGTTATT TTTGGGGAAA AGAAGTTGGA ACGACAGTTG TCATATGGCA TGTATACTGA CACATTGCTA TGCAGAGGGC GTGTGACAGA TGCCATAGAG GCGGTATCCG TGTATGTTTA AGAATCTTCT TCGGGATCAA ATGAGGTGGA TTGCTGACTG ATGGCGGATG TCCGATGTCG CTCAGTGTGC AGCCAGTTCA AACCCATCGG TATGCGCACC ATGCGCAGAC TTTGGTTCAG AATGCACTTA CAACAGGCCC ATGAAACGGC GTGGTGTAAG CCATTGTTCT CTACAGCTCC CTTGACGAAT ATATTGCTGA CAAATGATGT TACCATTGAT GCAGCCTCCG CCTTCGAAAG CGCGTGAAAG TTATGGATCG GCGTCAGGCA TTGCTCTGAC AAGATGGACA CTTCCTTTAC CGAGCGATAA CTGGACATAC CGAGAGATAG CCTCTCATGC GCACATAGAA TCGCTCGTTG AGGCGTTCTA TGCTATTGTG TACCCCATGT AAGTCAGAAT AACCTTCCGT CACATTTCCG ACTGACCAAT GGAAAGTTAC CCGATGTTTC ACTGGCCAAC ATTCACAGCA AATATCCGCC GTCGAGTGTA CACCACCTAT CCAGCCTTTC ACGCCTTGAC AATGTCGGTA TGCGCCATCA CTTCCGCACG TCTTCGTGAT GGCGCTGTTC CCTCGCCAAA TTCTAGCACC CCAACTTCCG ACCCACAGCC CCCTACGTCG GAGACTTTCT ACCAAGCTGC CGTAGCCTCG TACCCTCGAG ATATCACCAC CGCTTCAGAC TTCGATTACA AAAGGGCCAA GCCATTGTTG GCGACTCTGG CCATCCAGTA TGGTCAGATT CCTGCTGTTC ATGCGCATAT CGGCGACTAT ATGACTCTTT GTGCTATTGA TGGGTTCCAT AACGAGTCGA GGTGGCCGAA TGACCTAAAC GAGATTGAGG TGCAAGAGCG TCGTCGCTTG GTCAGTAATA ATCTGTCATG TCCCCTTAAT ACACGTGCTG ACACTTTTTT CTAGTTCTGG CTCGCATATC AGCTTGACGT ATATGCCGCG ACAACCTGGG GAGGTATTAT CCGTCATCGT GAATCACAAT CTACGGTCTT GTATCCAGCC GAAGTCTACT CGGACGAAGA AATAACCCCT ACAGGTATTG TAAAATCTGT CAATCCAGCT CATCCAGTGT CATTCTGGCG AGGATGGAAT TTTGTTGTCG ACTTGTACCG TATCCTCGAA CATGCTGTTA CGAGGTTGAG GGCAAGAAAC CACACTTTTG ACGCTGGTAA TCAGATTGCC GCTTTATTTT CAGAAGGTCG AGTGGGGTCG GGGACAGAAT TGAAGCCAGG CGATTTGTTG GTTCTTGTAG AGAGGCTGTA CCGTGCCTTA CCCCCAGAGC TAAGAGCCAC GAGTGAGATG ACGGGGGATG TTGAAAAAGA TCGTTATGGT TTCCAAGGTG AGTCTTCACA AGTGCAAAAA AAACCAATCC TTACGAGAGA GACAGCTGCA AACATTCTTG TGACAATGCA GACCATGAAG ATGGTTGTTG CAGGTATGGC CGAATGGTCC GTCGAACAAC GCTGTTCTAT TGCCGGTGAA CTTCTAGATG CGTTCGCGAC GGTACCTAGA GCGTACATCC AGGCTATCAG CACTCCTCTT GTAGGTCATC ACGCTGTGCC GCTAAGTCAG GTGCTTACTT TGGGGATAGC TTCATCATCT TGCTGGTGTT GGTCATCTAC TTGCTTCTAT CATTCACTCA CCCCTTTCTC CCGCTGCTTA TTTGCACGTT CGCACCGTAC TCCTCTCCAT GGCTGACCTC TTATCCTCAC TTGAGTCACA TCTGACTTCC ACTGGAGGCA TTGCCCCTAA GTTGAGAGAG CACGTGGAAA GGATTGACAG ATACATGACT AGTGCGACGG AGAGTAATGG CAAGCTGGCT ACTTTCGTGG GTACCTTGGC CTACATTCTT TGGGCATGTG ATACTAAGAG ATGTGTAATA GGCATCACCC ATCCATGATG TGCAAACAAG GCAACATCTA GCTACGCCTC ACCACGCTAA AGATACAACA CCCACTACTA CATTCTCTTT GCGCAACGTC AGCGCCGTTC CGGGAACTGC GAGCAGTGTA ACTGGTACAG GTAGCTTAGG CATGAGCACT GGTACAAATT CATCAAATGT TCGTACTACT TCAAGACCGT CCACGAGCGC CAACGCGGAT GTCTACAATC CGAGCCCGCT GGGAAATGTC AATGCGAAGT TTCAGTTCAA GCCTAATTCT ACATCTTCCT CCAGCTCTTT ACCTCTTCCT CATCTTCCCC GACCACCAGC ATCTCAAATC CAAAACACTA TTTTACCGTC AAGTTCATCC TCCGTTCCCC CCCCAATGCA TCTTCCATTC GGAGGTATTG GGGCAGGTGT AAGTGACACT GGTGCTGGCG AGAGTCTTCA AAGACGTGAG CCTCCAGTGA TCACACCTAG CTCTTCTCAT TCTTCCCACT CTACGCCATC ACAGTTCTCA CCCAAGTTCG TGAATACTTC ATTGGTCTCG CACCCCGTCC CTCCGCCATC TATTGGACAG CCTCAAGGCA ATTTGGCTGG TTTACCACTA CCTCTTCAAC CTTCTCATTC TCAGAACCAA TCCCAAAGCC GCACACAGCA CAACCAACCT CAACAACCGA TCGAAAATCC TTTCCACCTG ACGATATCAA ACTTCCCGCC CGAAACTCAG GAGCCAGCGT ACCAATTGCC CGACGATCTA TTTGTAGATT GGCCATTTTT ATTCAATGAG TTTGGGTTCC AAGGGGACGC GTTCGATTTC TTGAGTTCGG GTATCGGCGG CGCTGGGGGG GGCCCTGGCG GTGAGAGTGC GAGTACAGGT GTATTTGATG GTGTTCAGGC AGGAGCTGGT GGTAGTACAG CGCCGGTCGG TAACTTCAAT GGGGTAAATA TGACTGGGAT TGGTGGAGGC AGGATACCAC CCGCGCTTGA GGGTGCGAAC CTCCAGTAAA GCACGGCAAT GAGATTGGTG GTTTCCTTTC CAGTCTGGGA GCAAAAACTG AATCGCGCAG GCGCGTAAGT TTATGAAAGA CAAAGAAGAA GGTTAAGCAA GGAATGTGTC AGTTGGGGTT TTGTCTCTTT ATTTCTCTCT AATTTGATTC ATGTCTGAAT AGGTATTCTC GTGACACTCG CATCCTTGGA TCCTTTCATA TCTTGGAAAA AAATTCTATA TTTTCAATAC ACCTCCTGTC CATTATATAC AGGATTAATC CAATCACTCA GTGGGGTTAC TCACTTAATA GAACGTTACG ACGTTAAGTT GATGTTCTCA ATAATAGAAT CTTA
|
Protein sequence | MSNDESKTNQ QQAPLKRRRI TRACDRCHRG GIRCAASSNP SVCAPCADFG SECTYNRPMK RRGPPPSKAR ESYGSASGIA LTRWTLPLPS DNWTYREIAS HAHIESLVEA FYAIVYPIYP MFHWPTFTAN IRRRVYTTYP AFHALTMSVC AITSARLRDG AVPSPNSSTP TSDPQPPTSE TFYQAAVASY PRDITTASDF DYKRAKPLLA TLAIQYGQIP AVHAHIGDYM TLCAIDGFHN ESRWPNDLNE IEVQERRRLF WLAYQLDVYA ATTWGGIIRH RESQSTVLYP AEVYSDEEIT PTGIVKSVNP AHPVSFWRGW NFVVDLYRIL EHAVTRLRAR NHTFDAGNQI AALFSEGRVG SGTELKPGDL LVLVERLYRA LPPELRATSE MTGDVEKDRY GFQAANILVT MQTMKMVVAG MAEWSVEQRC SIAGELLDAF ATVPRAYIQA ISTPLLHHLA GVGHLLASII HSPLSPAAYL HVRTVLLSMA DLLSSLESHL TSTGGIAPKL REHVERIDRY MTSATESNGK LATFASPIHD VQTRQHLATP HHAKDTTPTT TFSLRNVSAV PGTASSVTGT GSLGMSTGTN SSNVRTTSRP STSANADVYN PSPLGNVNAK FQFKPNSTSS SSSLPLPHLP RPPASQIQNT ILPSSSSSVP PPMHLPFGGI GAGVSDTGAG ESLQRREPPV ITPSSSHSSH STPSQFSPKF VNTSLVSHPV PPPSIGQPQG NLAGLPLPLQ PSHSQNQSQS RTQHNQPQQP IENPFHLTIS NFPPETQEPA YQLPDDLFVD WPFLFNEFGF QGDAFDFLSS GIGGAGGGPG GESASTGVFD GVQAGAGGST APVGNFNGVN MTGIGGGRIP PALEGANLQ
|
| |