Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND04280 |
Symbol | |
ID | 3257114 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1190945 |
End bp | 1194082 |
Gene Length | 3138 bp |
Protein Length | 695 aa |
Translation table | |
GC content | 53% |
IMG OID | 638256363 |
Product | hypothetical protein |
Protein accession | XP_570125 |
Protein GI | 58265938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.441651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCCC CGGACGACTC CCCCGCCGGC TTCCACGCCC TCCTCCACGA CACCCTCGCC GCCCCCCGCC TCTCGGCCTC GAAAGTCTCC CAGCTCTCCC GCCTCGCACT CCACAACGTC GCCCACGACC ACCGCATCGT CACCACCCTC TACAAGCTCA ACGCCGCCCT CCCGCCCGCA TCCCCGTCCC GGATATCCAG CCTCTATGTC TTTGATGCCG TCGCACGCGC GGCAAAGGCG GCAGTCAGCA AGGGCGTCGG CACCGAGCTC ACGAACGAGC GGGGAAAGGG CACACAGGCC AGTCTGCTGC GCAAGCTGGA GGGCGTCGTC GCCAGCTGGG TCGACGGCAT GATTGACGAT GGCAAACGCG GCGTGTGGGT CGAGGGCAGG GTACGTCGCG TTACTTTTTG ATATCGTGCT GCGTCCTGAC GTCGGCGCGG TGTCGTCTGT TTGTCATTAG GAGAAGACGC GGAAAATTGT TGACATCTGG CAAAAAGGCG CTACTTTTCC GCAACCGTGT CTAGATGAGC TCAAGGTAAA GTTGAATAAC GCTGTGGCGT CGGCTGGTCC GTCAAACTCT GCAGACAGTG CGGCTGTCAA GAAGGAACGG AAACCTTTAT CGTTAGCGGA CGGTGAACCC AAGGGATCAG GTAAATCTTT TCTTTCTCTT CTCTCACAAA CGTTTCTTCT GTTTCTTCTT CCGGCTTCAA TGGAGATCCT GTATGGTGCA ATAGTCTGGC AAGGGGTTAT GGCCAAGTGC CTCGACGGAG AGGTGGAAAA GGAATTCAGC AATCCAGGAC AAATTAGGCA CGCAGCTGCT TTTGACGGTT GCTTGTAAAC GTACCCTTGT AGGCATTTGT GCAGCAAAAG ACGACAAACA ATTGCTCTTA CTTTCTGTCA CTGATCGGCA GTGCGGGCGA TTGCAAGTTG AGGCTTGTTA CGGATATATA CCCTATCCTC ATCTCGTCCG GTAAAAAGAG TCACTCAAAT CCTCGTCAAC CATTTGGCAT GCATCTCACA TTGCGGCTTG CAGCCAAAAG AAAGACCAAA AAAAAAACCA AAAAAAAGCC AAGTGAAGGC CTAGATTAAT TGTCTCGTTC AATCCTGCAT ACACAATAGC CTCGAGTGTA TATACCGAGG ACAGCGACTG TCTTGAGGGG GGTGGGGGAC GGGGTGCATG CGGAAGCTCT TTTGTTCTAC GAGGTGCATC GGGTATCTAC CAAAGAAACC CCAGTACTCT TGTAGACCAG CCCCACCCCG TCCAAATCTA CACCCCCCCT CGTTGGTTTT GATCTGAGCG ACCTCGGGAA GACAGTTACC AGATCCCACT GCCTCAGGCC AGGAGAGAAC ATCCTTGTCG CCCATTTTCT CTTTTCCCCT TTTTTATTAT ACTGCATCAT CTTGTACCCA TTTTTGAACG CGTTACTGAG ACATACACTT TCCGTCCCCA CAGGTCCATA TAATCGATCG ACCACACCGC CCTATCCGCC ACCCGCTTAC ATTCTGGCAA AATATGCACG CAGGTCGTCA AAAAGTAGTC GTAAAGTGGC AGAAAGCAAA CCCGAGCAGC CCGGACAGGC TCAACCGGGC GAATTACCGC CAGAAGTCGC CAAATTGTTG GGAATCGAAA AGCAGGAATC GGCCAAGGAT GAAAAATCAC ATACACCCTT GAGGTGAGTC TTTTCGAGTG CTTTTCAATA ATAAAAGAAG GCCGGTATAG ACTGACCGAA TGCATGGCAA TTCAGCGGTA TTCCAGCCAA CGTTGCTGCC CTCCTTCCTT CTACTTTCGA TCCATCATCG CACTCTTCCT CGCCGCATAC AGTTCCGTCG GCGGCGGCAG CGGTAGCCTC GCAAGAGTCA ACAGCGCCGC CACCGTTGGC GATTAACCAC GAGCAATTGG CGGCATTGGC TAAATTTGCA AGCAACAGTC AAACTACAAG CGGCGGGCAG CATGAATTGC ATGAATTGCC CCCTTTCCCT CCTCCACATC CTGTTCCGTC TGGCGTCATC ACCCCACAGC CGTACATCCC ACCGCCCAAA CCCAAACCCA TCTCGCCCCC TGGAGCCCGG TATCATACTT CCTCTTCTTC TCGCAGCGAT AGAGGTCTTG GGGAAAGAGG TTCACCTAGA CACATTGATA AAGAAAGAGG AGAACGAGAT TACCAACATC GACAAGATGA TCATTGGAGG GCGGGTAGGC AAAGAGACAA TCGTTCCCGG TCACGCTCGC CTGATAGACG TACGAATGGT CATCAGCGGA TGAATATACC TCGCCGCCGG CCGGCTGCAT CGTTACCCAA TCCACTGCCA CCACCACCAC CGGCCGCGAG ATTGGGTCAA GGTCGAGGTA TCGGGGAGGG CGATGGAGAA GAGGATATGG CGCTAGATGT TTCCGACGGA GAAGAAAGAG CAAAGCAGGC CGGTGCTGCC CAAGCAGCAG CGTATCCTCC CCCCATCGTC GCCCCGCCAT ACGTGTCCGT ACCGCCTGAT GTCGTCGCTC CGCCATTTCC TCCCCCTTCT AGTGCGCTGC AACATCAACC ACCACCCATA TCCATGCCCA TGTCATCTGG ACAGCCTATG AAACGGCGAA ATGACCTCGT CACTCTGCAT ACTTTTCCAC TCCAAACGTT TGATCCCTCC AGTGCGGATT CATGGGCAAA ACTCGGTGAA GCTTGGCAGA ATAGTACAGG AAGGAAGCCG GAGCAAATGG AATTGATGCA TTGGCTGGCA ACCGGCCAGG TTATGGACTT TAAGATGATG CTTATGGGTA CGGGCCATGA GAATGGAGAT AGCTCGAGTC AGAGTCACGT ATCACCAGGT GACATGAATG GGCTATCGTC AGCAACTGCA AATCCAGCGA TAGGGCTACT GCCACCGTCA CAGATGAGGA TGGATGTTGG TGCGGGATCT ATAGGTCAGC CGCCGCATTT GTCTATAGGA GGAGGAGATC AGTGGAGGAA TGTGCAGGGA CAAGCACAAT GGGGCAATGA CGATTCTCAA AGGCTGTGGG GAGCACAAGG TCAGGGGTAT TAAATGTATA TATAGATATA TTTTTCCACC AGTCAGACCA GTGGAATCAT TATGCTAGTT GATTAGATTA TACAGCAGTT CTCTTTTGAA GAGAATTCCT TGAGGAAT
|
Protein sequence | MSPPDDSPAG FHALLHDTLA APRLSASKVS QLSRLALHNV AHDHRIVTTL YKLNAALPPA SPSRISSLYV FDAVARAAKA AVSKGVGTEL TNERGKGTQA SLLRKLEGVV ASWVDGMIDD GKRGVWVEGR EKTRKIVDIW QKGATFPQPC LDELKVKLNN AVASAGPSNS ADSAAVKKER KPLSLADGEP KGSGPYNRST TPPYPPPAYI LAKYARRSSK SSRKVAESKP EQPGQAQPGE LPPEVAKLLG IEKQESAKDE KSHTPLSGIP ANVAALLPST FDPSSHSSSP HTVPSAAAAV ASQESTAPPP LAINHEQLAA LAKFASNSQT TSGGQHELHE LPPFPPPHPV PSGVITPQPY IPPPKPKPIS PPGARYHTSS SSRSDRGLGE RGSPRHIDKE RGERDYQHRQ DDHWRAGRQR DNRSRSRSPD RRTNGHQRMN IPRRRPAASL PNPLPPPPPA ARLGQGRGIG EGDGEEDMAL DVSDGEERAK QAGAAQAAAY PPPIVAPPYV SVPPDVVAPP FPPPSSALQH QPPPISMPMS SGQPMKRRND LVTLHTFPLQ TFDPSSADSW AKLGEAWQNS TGRKPEQMEL MHWLATGQVM DFKMMLMGTG HENGDSSSQS HVSPGDMNGL SSATANPAIG LLPPSQMRMD VGAGSIGQPP HLSIGGGDQW RNVQGQAQWG NDDSQRLWGA QGQGY
|
| |