Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF04250 |
Symbol | |
ID | 3258177 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 1233521 |
End bp | 1236757 |
Gene Length | 3237 bp |
Protein Length | 606 aa |
Translation table | |
GC content | 45% |
IMG OID | 638257543 |
Product | expressed protein |
Protein accession | XP_571705 |
Protein GI | 58269098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.583211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTAA GTCAACATCT GATCAATTTA GGGCGACTGA GCCCTGGCTG ACCGATTTAT TTTGTTCCAG CCATCACTCT ATCTTCTGAT AGATTGACTC TGGGGTAGGT AAAACTGTGC GGGAAAGATA AGATGGATTG CTTACCTATG ATAAAGGGTA AATAAGACGT TGGGCGCTAT GACAGGTCTT GAGTTTGACA ATGTAGATGT CTTGGGTCCT GTGTCTGGCC GTATTGGTAT GGGGTGAGTG ACATCTTCTT TCAGTCGATT GTGAGAGGGC GGATCTCACA TGTCATTCTG GTAGCTATTT TGACTGCTAT TGCATTCCTG CCGTAAATAC TTCAGTATGG ACACCGGACA ACCCAAACCC ACAATCATCT CGTCATTGTC GGTGTTCATC CACCAGCTAA ATATTCCATT GAATACTCAT GTGTTATCGT AGATTATGTC GGCCAATCTC AGATGGCCAG TTTCGAAGTC CATCAAGGGA ATGACAGAAA CAATAACCCT TGGATTGGCT TGATCATGAG CGAAACGGTT CGTCTAATCA TGTAGCTTAT ACCTGAAATA TCAACTGATG CCTATCAAGT ATGAGGGTAC AGGTCAATTC TTCCAGCAAT GGTGGTTCTT ACGTGATGGC GAACCTGGCT ACCACACCTT TACTAGACTC CAGTTCGACA ATGGGACTGA GGGTATTGAT TTAGGTAATT TGCAAGAGTT CCGCCAAGTG GTGAGTACAA CCGCCGGCTC TATGACTTGC GTGTGCACGC TTGCCTAATT TAGTAAAACA GATCTCCCCT AATAGCGACA TCTGGACCCA TCTTGTGACG AACGACGTCC AATACGGTCA TCTCCCCTCT GATGAAGCTA CGGGTAACCA GACGTGAGTT ACCTGTGTTG CATTACCCGT GCATTCATAT TGATGTGATT TGCAGTACTG TTCAAGATGC TACATGGTAC TACAACATCA CTGAAGGGAA CCCCTATAGA ACCGAATCGT GAATCCTTTG TTTTCAAAAT TCAGCGCAGC TCGCACTTAC TTGAAGTTAG TTCTGACTAT TTTACCAAGT ATGAATGGGC AGATCTCTAT GGTAAGTTCC CCAATCTGCG AAAATGGCTC TATGTTGATA AGGCTATTAA GGTGATCACT TTGCCCACGG TTTCTATGCC GACGGAACAG CCTCAAATGG TTCAACCCTC GGCTTCTGGA CTGTGTTCAA CAATCTCGAA GGATACACGG GTGGACCGTT GCGCTCCGAT CTAGTGGTTG ATACCAACAT CTACAATTGT ACGTTATTCT GAGATTACCC AGTTTTGTGG ATGCTGATGA CGCGTTGGTG TAGATTATGC CTCAAATCAC CGTGGTGCCT CAACTATGAA TATTACCTCT GGCTTTGATA GGATCTTTGG TCCGACATTC ATCTATTTGA ATAAGGACGG TGACCTTCAT AATCTTTATG ACGATGCAAA GAGTTACGCG TGAGTGATTG GATCGGCATC TTGGAAGTGT ATGCTTATAC TGATGTTGCT TTAGCAATAC TTCTTTCGCT GCCGACTTTT ACGACGATGT CGCCGACCTC ATTCCAGGTT ACGTCAATTC GTCTGGTCGA GGCGACTTCA AAGCTCAAAT TAGCTTACCG GAGGGTGCAT CCAACCCCAA AATTATCCTA GCGCAGAGCG GAGTTGATCC TCAGGATAAT GTGGATTACA GTGAGTCGTT GCTCGAGCAA TGAAACTAGG GCATCTATCT GACATTGAGG TGTATAGCTG CCAAGCAGTA CTGGACCAAT GTATCATCCG ATGGCAGGGT CACCATCCCT CGTGTTCAAG CAGGCACTTA TAGAGTTACC CTTTACGCTG AAGGTAATTT TATCCTTTCT AAAGCAAAAC AGTTGTACTT ATATATACGC GTAGGCATCT TTGGCCAGTT TGAGCAAGAT GAAGTTGTGG TCAGCGCCGG TGACGGAGAT GGTGCTGAGT TCCGAATCAA TTGGGAAGCC GAATGTCACG GTGAGTAGTA TATCCCTTTA ATTTTGACTT GATGAACGTT GCTAACAGCA TCCTATTCAA GGTATCGAAC TTTGGAGGCT TGGGGTCCCA GATAAGGTGC GTTTTGACGA GGTGATCCCA TACATACCTG TACAACGGAT CACTGAGTTC AATTCCATAG ACCGCCGGCG AGTTCCTTCA CGGCTTCGCC AAAGATCCCG ACCATACCCT TCACCAAGAA GAATATCGAG AATATTGGGG TGCATGGGAC TTCCCCACCG ATTTCCCTGA CGGTGTGAAC TACACGATCG GCACCAGTGA CCCTTCGAAG GACTGGGTAA ATGCTGCTCT AATGACGCAC CTTTGCATTG CTGACGATTT AATACAGAAC TACGTGCATT ACAGCCGATA TGGAGGCTCG TTTACCCGAC CGGAGTATGT ATTGGATAAC GTCAATGCCT GGACTGTCAA CTGGGTACCC GAAGGTGGGC TGGATGTCAG TGGGAAGACT GCGTAAGTTT GGTTGCAACT TCTTGCAAGT GTCCGATACT GTACTGACAC TTGACCGCTT TTGGCTGCAG TGCTCTTACT GTCCAACTCG TATGTTCTTT ACAGCTTTTT TCATCTTATA CTGACCCTCG CTAATTTCTG TGTATTTTCT TCCTTTAGGC TGGCGCACGA ACTGCTTCCG GTAATCTTGA CTTACCCGAG CCGACTTCCA ACTATTCCAA TGTCGATTAC ACTGTCAACG TGAACGGTAA CCCTCTCACT TGGACCATCC TTTACAATCA GTCTTCATCT TGCTCAGATA GGAGTGGCAT TGCTTGTTAC AATTTGAGGA ACGTGTAAGT CCCGCATATT TTTTCGTAAC GATTTTTTCA AGTCAGACGT TGAAACTTGC GGCAGGTTCA CGTTCCCTGG AGAATGGCTG AAGAACGATA CCAACAACGT GTTCGAGTTT GCTCTGCCTT ACAATGCTAG CGGTGGGGAT GTGAACTTCA GGAATTACTC GATTTCTGTT TTGTGAGTCT CTTTACAACC CTCTAGTTTG CATGGGGATG CTTATATGTC GCTACAGGTA TGACGCTATC CGGCTTGAGC TCTCCGATTA GATGCCAAAT TTGAGCGTGA AAAGGGCTTT TGTACATATG TAAAAAAGGA TACTTTTCTG TATCATCAAA TGTCAGGGAA AAAGAAATTG GAAGAAGTTT GAACATTGTG TAAGAGCTTT ATAATGAGGT ATTTTGT
|
Protein sequence | MTTLGAMTGL EFDNVDVLGP VSGRIDYVGQ SQMASFEVHQ GNDRNNNPWI GLIMSETYEG TGQFFQQWWF LRDGEPGYHT FTRLQFDNGT EGIDLGNLQE FRQVISPNSD IWTHLVTNDV QYGHLPSDEA TGNQTTVQDA TWYYNITEGN PYRTESSDYF TKYEWADLYG DHFAHGFYAD GTASNGSTLG FWTVFNNLEG YTGGPLRSDL VVDTNIYNYY ASNHRGASTM NITSGFDRIF GPTFIYLNKD GDLHNLYDDA KSYANTSFAA DFYDDVADLI PGYVNSSGRG DFKAQISLPE GASNPKIILA QSGVDPQDNV DYTAKQYWTN VSSDGRVTIP RVQAGTYRVT LYAEGIFGQF EQDEVVVSAG DGDGAEFRIN WEAECHGIEL WRLGVPDKTA GEFLHGFAKD PDHTLHQEEY REYWGAWDFP TDFPDGVNYT IGTSDPSKDW NYVHYSRYGG SFTRPEYVLD NVNAWTVNWV PEGGLDVSGK TAALTVQLAG ARTASGNLDL PEPTSNYSNV DYTVNVNGNP LTWTILYNQS SSCSDRSGIA CYNLRNVFTF PGEWLKNDTN NVFEFALPYN ASGGDVNFRN YSISVLYDAI RLELSD
|
| |