Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL06220 |
Symbol | |
ID | 3254961 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 720749 |
End bp | 723847 |
Gene Length | 3099 bp |
Protein Length | 805 aa |
Translation table | |
GC content | 51% |
IMG OID | 638254097 |
Product | expressed protein |
Protein accession | XP_568236 |
Protein GI | 58261652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.067136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTTAACTGA ACGGCGACTG CCCTCCGTCC GGCTTCACCC TTGCTCGCGA TATACGTGTC ACCTCGGCAT ACGCAGCCGC CGACCTTCAC AGTACAACAT TAGATTCATC GTATATAGAT CCTATATAGT ATCCTGCTCG CAATGCCTTC CGCTTCAAAT GGTTGGAAGC CATCTACGTC GCCCGTAGGT GAAGGGTGAG TACATTTGTC TCTTTGTCGT GTGTGATCTG CATGCATGGG AAATGCTGAT CTCCTTTGAC TCCACGTTTT CCTTCCCCTG CGCCCATGCC CCGTAATACA CTATTACACT GGTTTTACGA CATCCGCTTC CCCTTATCGG CACTCCTTTA TCCGCTTTTG ATAATACGCA TCTTACACTT GCCCGAATTA TATTTTATTT GTCAACTGTC AATTCATACA TGTTCATCGA TATCGGTCTA CATCTACAGC CTCACTGCAC CTAGAACGCC CGAGCAATCG ACAAGCTCCT TGGCCGCCAC TTCCCCCTCC TCTTCCACCT TCCCTCGGCC AAAAAAGCTC AGGTCGAGGA CATCGACCAT GACCAACATC TACACTCCCC CTCTTCCGCC CGTCATGTCA CGCGAAAATT CTGCAGAAGA TGTCCGGGGA GAGTACGGTC AAGATTCATA TCGTCCTCCG CCGGGTACTC TTCTTCCGTC AGCACACGGC CGCCGAGCTT CCAAATCGGT GTCATCTGTT CGGCCGAGTG TAGTTGTCAC TCCTTCGAAA TACTACACGC CTCAAACGCA CAATCAAATC TCTTTTGTCA ATACGCCAAA TTCCAACACC GGCAATGGTG TTATGGACAG TCCCGAGGCG GATAAAAAGA ATATGCTTAA GAGGCGAACA TCGACGCCGA GTTTGTTTAA GCGTGCTACT CGAGGTGACG AAGACGAAGA GGACGAAGGG TTGCGAGGCA AATTGATGTC AAAGGAAGAG CCTCAGCAAT CCCAACTGTT GCAGCAGCAG CAAAGATCTG AAGTGAGAGC GAGGTCAACG TCTACCAGTG TGGTCGTTCC TCAGACCCAA GGCCTGACTG AGCACTCCAC TTACACTTCC AACCCTATTC GCCCTTCGAT GCCACCCAGT TCATTTACCC AACAACCCCC TCCACCTCGT CCATGGTCCC CTGGACCTGC TAGTCCTAGT TCACATGTCG AAGGGGCCTC ACATGAAAGA TCTTCCGGTT TCCTGTCGTC GGCTTACAAC TACACCGAAT CAGGCATTGT GCAACTGTTC AATTATGTTC GACCTTCTTC CTTTTCACAT CATTCATATG CTCGACATGA ACCTTCCGAT GTTGATTCCG AAAAAGGTCT GAACGGTTCC GAAGACGAAA CGGAGGAAAG CAGCGTTAGC TACTTTACTT TACCACCTAC CCCTCCCGAA CAATCCGAAT TCTCGTCCTT TGCCTCTGCC TTGCCATCAG ACTCTCTCCC CACCCCGACA CTCTCCACAC AATCGCTGTC TCGAGACGAA CCAAAACGTG GCAAACTACG CAGGGCATTT CGTAGACGAT CCGGATTGGA AGGTGATAAC GGGAATGGAT TGTTGTCTGC TGTTTGGAGC AAGGTCATGG GAAGTGGGGG TGGGAATGGC AAATTGAGCG AAGTGCTGAG AGATTTGGGC TGGATTGTTG GAGTATTGGC ATTGACTTTT GTGGTGACGC TTGGAATTGT GATTTGGCTG ATCCAGGGGA TGCCCATGTG AGTGTGCTGT CTTAATCGAT CCATAAGGAC CTTTGGCTGA CAGTCTCGAC AGCACCACGC TGAAGCACAT TCCTCAATCG ACTACCGATG TCCAACTGCT GTCGGCTGAG ATTCGAGGTT ACATGGCTTC CAGCAGTTAT GGATGGTGGC ATACAGTTGG AGTATTGACC TTTGTTGGAT GCTGGAAGCA TGCCTGGAGT GTCCCTGGAG CTGTCGTCTT GGTGAGTTTT CACCTTGCGA AGTCGAGTGA AGATTTGTTG CTTATCCATT TTCAGAACAT TCTGGTTGGA TCTCTCTTGG AACCCATGCC GGCCCTTGGT CTTTTGACGA TTATCACCGC GTCCGGCTCT CTTGGTGCTT ACCTCCTCTC CCGCCCGCTC GCCCCTCTTA TCGCCGTCCT CTTCCCTAAA CCTCTTGCCC TCGTTCGGGC CGCTCTCGCT CCCGAGTCTA TCCCCGCTCC AGATTCTGTT GAACCAACAC TCAATGAAAC GATCACCCCT ATTCAAGCAT CTTCCGACCC TTCTGCACAA GCTATTGGCG GTCCTACTGA AGCATCTACC ATCTGGCGAA GGCTGCTGGT CATGCGTGCG ATGGGCTTTG TTCCTTGGAG TGGTATGAAC GTTGCGTGTG GTGTAGTAGG CGTTGACTGG AAAGTATTTT GGCTCACGAC CGCAGCGGGA AGTGCGAGTT GGAGTTATGT CACTGCTAGT GTCGGGAATA TCTTGTCGAG GCTCAAGGTG CCCAACTCGG CTATCTCTGC AGCACCGGGG GAGATGACTG GGGAGAGTTT GACGAGTCTG TTGAGAGACC CGGTGTTGAT CACCAAACTT GTCTTCCTTT CGGGTTTGAC TCTCTTACCT GTCATCCTTA AACGCCGATC ACCAGCTTCC CCATCACCCA CTCCCCGCTC CACTTCGTCA TTCGAGCTTT CGGAACTGCC TACTTCATCC TCTGCCTCTT CATCTAGACC TCTCAATCCC AAGATCAACA CCCTCCGTCT TTCAGGGCTC GATAACCAAC CCATGTCACC ACTCTCCCAG AGTTTGGCCA AGTTCACCCC CACGCCTCGC ATATTTGACC TTCTCAGTTT TGGACGGATA GCGGTGAGGC AGAGTGGAAG GATTGTCGTC GGTGGCGTAA GGAGTGTGGT TGGAGGAGTG AGGGGGGCTG TGAGGAGTGT GACGCAACAA TAAGCATATT CTTTTCTTTT TTTTTTCCGG TTCATCAGTG TATCTCATAT ACTCATCTTG TGTTATTACT ATCTCAAATT CGCTGTAGGC GTTTATCTGG ATTTGCTCTA GCATAGTACA GTGATAATCT TTTACAGAGC TCTTTCATCA CTGGAACGAT TTTTCATGTA GATTAGTCG
|
Protein sequence | MPSASNGWKP STSPVGEGLT APRTPEQSTS SLAATSPSSS TFPRPKKLRS RTSTMTNIYT PPLPPVMSRE NSAEDVRGEY GQDSYRPPPG TLLPSAHGRR ASKSVSSVRP SVVVTPSKYY TPQTHNQISF VNTPNSNTGN GVMDSPEADK KNMLKRRTST PSLFKRATRG DEDEEDEGLR GKLMSKEEPQ QSQLLQQQQR SEVRARSTST SVVVPQTQGL TEHSTYTSNP IRPSMPPSSF TQQPPPPRPW SPGPASPSSH VEGASHERSS GFLSSAYNYT ESGIVQLFNY VRPSSFSHHS YARHEPSDVD SEKGLNGSED ETEESSVSYF TLPPTPPEQS EFSSFASALP SDSLPTPTLS TQSLSRDEPK RGKLRRAFRR RSGLEGDNGN GLLSAVWSKV MGSGGGNGKL SEVLRDLGWI VGVLALTFVV TLGIVIWLIQ GMPITTLKHI PQSTTDVQLL SAEIRGYMAS SSYGWWHTVG VLTFVGCWKH AWSVPGAVVL NILVGSLLEP MPALGLLTII TASGSLGAYL LSRPLAPLIA VLFPKPLALV RAALAPESIP APDSVEPTLN ETITPIQASS DPSAQAIGGP TEASTIWRRL LVMRAMGFVP WSGMNVACGV VGVDWKVFWL TTAAGSASWS YVTASVGNIL SRLKVPNSAI SAAPGEMTGE SLTSLLRDPV LITKLVFLSG LTLLPVILKR RSPASPSPTP RSTSSFELSE LPTSSSASSS RPLNPKINTL RLSGLDNQPM SPLSQSLAKF TPTPRIFDLL SFGRIAVRQS GRIVVGGVRS VVGGVRGAVR SVTQQ
|
| |