Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND04970 |
Symbol | |
ID | 3257296 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | - |
Start bp | 1361853 |
End bp | 1364819 |
Gene Length | 2967 bp |
Protein Length | 813 aa |
Translation table | |
GC content | 54% |
IMG OID | 638256433 |
Product | conserved hypothetical protein |
Protein accession | XP_570440 |
Protein GI | 58266568 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.96924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTCC ACGCCGCCTC AAACGCCCCC GCGACGAGCC CTCCGCTGCC GCCCGTGCGC CACGCCCTTT CGATCAAACT GCCCGAGCGC TCCTTCTCGC ACTCCACCTC CACCGACGCA TACACCCCCC TCACGCCCAC CAACGAGAAC GGCCACATCG GCATCACACG CAGGCGCCAC TCCGTCTCCT CGCCAAAGTC CCGCGCCGCA TCGCTAGACC ACCAGCCACG GCTCGCACCG AAAAACAGCA CCGCCTCGGA CTGGAGGATG AACGGCTGGA CAAAGCACGG TGAACCTTCG TCCTCCCCCG CGCCCGACGT GAGCGAGCAC GAGACCACCC CGGTCGCCCA TCCGATCGAC CTCACGCCGC CGCTTCTCCC TGCCGCTTCC AGTCTCCACC CGCTTCCCCG AGACCCAAAC AACATCCTTC CTCTCTCCGT CGCGTCCTCG CCGTTCGTCA CGCCCTCCGT CTCGCGCGCG CCGTCCCCTC GTCCTGCGCC CAGAGTGGAC AGCAGCGCCG AACAGATCAC AGTCTCTACT GGCATCGTTC CCGCGTCTTC GTCTGCGCCC GGCTCGAGCT CCAGTCCGAG GACTTCAATC TCGACGTCCT CGTCCCGTGT TTGGCCGGCA GGCCGATCGG CCAGCAGGAG CTCAGCCGAA GACGATGATA CAGGTCCCTA TGGCTCAGCG GCCGGACCGT CGTACCATCC CCGTTCATGG TTTTCTCGCG CAGCAGGGTC CATATCCCCA AAGATAAATG CGCCTACAGC ACCCCGCATC ATGCGACTCA GCTCGGGGAG GTTGACTGGG ACGAGACGGT GGGGATGGAT CTTTGAATGG CTTGGGGTGC AGTCTAAGCC TGAACTGCCG AAACGAGGAG GGCTGAGCAA GAGGAGCGAT CGAGAGAGGG AGAGATTGAT GGGTCAAGGG GTAAGGCGAA GAGGAGATGT AAAGATCCTT GGGTCAAAAT GGCTAGCCAG GGTTATAGCC TTTATACCCA CAGAACCGTG GAGCATTGTG CGTTCATCTT TTTTCTTCTT GCCTATTGCA ACACCGCTAA CTCGTCGCAG AGCCTCTTCC TTATCTTCTT TGCAGTCTTT GCAATCACAC TAACGTTTAC CATAAAGCAC ATCCTCAACC CAGATAAAGA AGCATTACCA TGGCGTCAAT ACTGCACGAC CACCTACCCA TCTCTCTATT CTCTGCAATA CCCCTCCGAC CAGCCTCATA CCAAACTCAC CCTCTCACCT CTCTCTCCTG ATCACCCCGC ATGGCCTTAC AAACCCCATA CATCTCCGCT ATGGACAGCG GACATGCCGC AAGCGGATCT TGATGCGGCG CTCGAACCTG TAGGCGTACT CCTCGGTGTC TTTACCACAG ATGCCGGTCT CGAGCGTCGG CATATGATAC GGCAAAGCTA TGCGAGTCAT TGGCGAAGTC GTCGAAAGGG GACGGAAGGA GTGAGAATCA AATTCGTGAT GGGAAGACCT AGGAAACGTT ATGAGAAGGC TGTCCAGCTC GAAATGGAAG GTGAATTCCT AGGCGCTCTG TTTAAAAGGA GGAATAACGC GCTAACAAAT TGCAGCATTC AATGACATTC TGCTACTGGA CATTGATGAA AATATGAACA ATGGCAAAAC GCACGCGTTT TTCTCTTGGG CTGCCGAAAA TGCATCTGTA CCGGACTGGG AATATCCATC CCATCCCCGA TCCGACTCTG ACTATGCCAA TTCTGGCACG GCAATCGAAG CTGCGCAAGG AGGAAATTTG CACGCCCCTG TTTGGCGAGG TGAAAAGAAG CCGCAATATG TTGTAAAAGC AGATGAGGAT TCGTTCATTA TGCTTGGAGA GCTGGAGAGA AGGCTAAGGG TAGTGCCGAG GATGAAAACC TACTGGGGCT GTGAGTTGTG CATCCCTCCG TACGGAAAAA AGATGGTGTA AGCTTACAAT AAACGTAAAT GTGTAGATCT GGTGAAAAAC AAATTCATGG CGGGTGAATG CTATGCTCTG TCTTTTGATT TGGTCGAGTA CATTGCTGCC TCCCCAGCGC TCAAAACTCT CACCAAAGGC AAGGAAGATA AGCTTGTTGC CAAGTGGATA GGGATGCATC CCCAAAAGGA GGAGATTGTC TGGTCGACGG ACAGATGTTG GATATATGAC CACCCCAAAG CTGGTACCGT GTAAGTGTCC AGATTTTTTC TTTTCTTTCT TTTCTTGCAC TCGACGACTT GCGCTGACAT CGAGAACAGT TACTCACACG GTTTTCTGTA TCCTTCCACA GTCGAACAAG TCCGCGTAGA AAATCAAACT GGACTTTCAC CTTTGACTCT TGCCCAGCGC GGCGGGCCAG GAGCGGCCGA CGCTTATTCC ACCGTCTCCA AATTCGGAAC CGCCTACCGA CCGCTTTCCA CCGACATGTC TGCGGCCGAG CAAGTAGAAG CTCTTGTCGA AGGCTCTCCT CTCTCCAGAT TAAATGAAGA TGAGCTATCC TCATCGAGTC GCAAAGTCCA GCAAGCATTT TCTCCCACAG AGTCTCTTCG TCAAAAGATC GATCGACTAT ACTCCTCAAG GCCGACTAGG ATAGAGAGAT TCTTGGGCGA TGAAGAGGAA CGAGGAGGAA CGGTGGTGGT ACATTATATA AAAAAGGCAG AATGGTTTGT GGAGACCATG ATAGCCATGC TGGGCACGGC CGAAGAGCAG AGAGTCTGGC ATCGTGGTGT AGGGAGTGGG CTGGGCGCTT TGGAGAGGCG AAAAGGGCGA GTGCCTGTAT CAGGAAATGG ACAGGAAGGA TTCGATGCCG GAAACAGGGT CAGACTTAAA AAGGAAGACG GCCTGTAGAT ACGTTGTAGA TTATATGACT CGAGTTTAAG AGTCGAGATT TGCTGTCGAA GATCACCAAT GTTATTCGAC CCACGCACGC ACGCTTCGTG CTGTAAAATA TCATAAATCT ATATCATCAT ATCATAG
|
Protein sequence | MPFHAASNAP ATSPPLPPVR HALSIKLPER SFSHSTSTDA YTPLTPTNEN GHIGITRRRH SVSSPKSRAA SLDHQPRLAP KNSTASDWRM NGWTKHGEPS SSPAPDVSEH ETTPVAHPID LTPPLLPAAS SLHPLPRDPN NILPLSVASS PFVTPSVSRA PSPRPAPRVD SSAEQITVST GIVPASSSAP GSSSSPRTSI STSSSRVWPA GRSASRSSAE DDDTGPYGSA AGPSYHPRSW FSRAAGSISP KINAPTAPRI MRLSSGRLTG TRRWGWIFEW LGVQSKPELP KRGGLSKRSD RERERLMGQG VRRRGDVKIL GSKWLARVIA FIPTEPWSIP HTKLTLSPLS PDHPAWPYKP HTSPLWTADM PQADLDAALE PVGVLLGVFT TDAGLERRHM IRQSYASHWR SRRKGTEGVR IKFVMGRPRK RYEKAVQLEM EAFNDILLLD IDENMNNGKT HAFFSWAAEN ASVPDWEYPS HPRSDSDYAN SGTAIEAAQG GNLHAPVWRG EKKPQYVVKA DEDSFIMLGE LERRLRVVPR MKTYWGYLVK NKFMAGECYA LSFDLVEYIA ASPALKTLTK GKEDKLVAKW IGMHPQKEEI VWSTDRCWIY DHPKAGTVYS HGFLYPSTVE QVRVENQTGL SPLTLAQRGG PGAADAYSTV SKFGTAYRPL STDMSAAEQV EALVEGSPLS RLNEDELSSS SRKVQQAFSP TESLRQKIDR LYSSRPTRIE RFLGDEEERG GTVVVHYIKK AEWFVETMIA MLGTAEEQRV WHRGVGSGLG ALERRKGRVP VSGNGQEGFD AGNRVRLKKE DGL
|
| |