Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00050 |
Symbol | |
ID | 3257894 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 6398 |
End bp | 9319 |
Gene Length | 2922 bp |
Protein Length | 666 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256586 |
Product | expressed protein |
Protein accession | XP_571077 |
Protein GI | 58267842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGATACTTA TTGCAACACT CAATCATCAA CAACGCTTCC ACGATCTCAG TTATTGGTCC GACTGCTAGA AATTTCGCCA TACTCTAATC TTTATCCTAT TCTCAAATTT CGTTTTGAAG ATGACCCCCG CATACGAACG AGAAGAAACA ATGAACTCAC CCTATTCGTC GTCCCACTAT CTCGGTGTGA GCGAGGGTGG AAGTAGCGGG GATTCACAAA TAAACAACGT CAGGTCTGAT GCCGCTCAGA CAAGAAATCC ACTTCCTCGA CAAGAGACCA ATGCGACCTT GCCTCCTTAC GCTAGCTCGA CCGCAATCCC TGCCTACATC TCCAACAGCT GCCAAGAAGC ATGTTCAGTG GGCCGCGCAA AAGGCTATTT GCTCTTGCTT TCGGCTTTTT CTAAAGTACT CGATCAAGTC CGCAGGACGA GCTCGTGTAT TAGCTTGGAA CATTCCGTGA GCGAGGTTTC ACCCGTACAA GCTGTAAACG GCTCTACATC ATGGTCGTCA GATTGCGAAG GACCGTTGTC TGAAGAATGC AGGGTGGCTT TGTTTGTGGA AATGGCAGTC CGAAGGTTTG GCTACTGGCA GGAGCGGCTG CTAGCACTGA AAAGTACAGC ATTACCCCCT CTGGATGTCC TACTAGTGTG GGCGACTTAT TTGGGTTCCC CCATTTGGTA AGTAGTCTTA TCTTATGTTT CTTTCCTTAT GATTTTGACC CTGCCCCTCT TCAGGTATCA TGATGACAGC GTTCAGGGGG GTACCACCCA TCCTCAATCT GCATTCCCTG ATTTTCCACT TGAATCAGTT GTAAGTGAAC TCAGTCTATA CGCAGGTGCT CACACATCAA TTTGGATAAC AGATTGCATG CATCGATCCT GACACCCTAG ATTATCAACC GGTGGATCCT GGAGAGTGGA AAGCGGCTAC TGGAACACCC TTCGATCCGA TTGTTCACTT TGAATCATCA ACCAGTGCGG CCGTAAATTG TCCCGGGTGC CGAACAAAAT TTTCGTGGCG TGAGTAGGGC TGCAAAGTCG CGATAAACAC CGCAACTAAC AGGTTCAACC AACAGCTTGG ATCTTAGAGG GAGGGAAGGG ATATGCACAA TGCCTTTTTG TCGCCGAGTG TCCCACAACA AACTGTCGAC TACGAATTGA CAAAGAGGCT ATGGGAGTTG GACGTCTTGC ACGTCGGATC GCAGACATGT ATGATAGGTC TAATTGCCAA CTTGCGTGAG TTCTCTGCCA GATATTGAAC CCAACTCACC ATTGACGGGC CAGCACTCGG ATACTCACCG GTGGGCTGTT AAATCATCCG CTAGATATAC CTTTTAAGTC CATACAAACG TCAGAAGACA AGTCTAAAAG TTTATGTAAA CACTGGCGGT GGCGTCGATC GTACGCATTT GCATTCTTAG TGGAACAGCT ACCTCGCGTA GACAAGAGAG AAATCAGTAG ATTGTCAAGC GCCTTTGTGG GAGCTGAGGA AGCGAGCATT GATCTTGCAC CCGCGGTGAG ATCTGGCATT CCTCGAAATA ACAGTAGAAA GGACAGTAGC TGATAAAACT CGCCGGTGCC AGACCTTTCG CCATATCGGT TTCATCGATA CGCTCAAGCA CCTTGGATGG TTGAATCCTT CAACATGGGT TGATAGGTCA GAGGCTTTGT ACACGGCAAG GTTGCGTTAC GCCAAGTGAG TCAAGTCAAC CTGGAATCTG CAACGCATTA TCATGAAATA ACATTGATCC TCCAGATTTA TGAACCTCGC TCGATCCCCA AATCTCATAC CAGTCCCGAC ACTAGGCATC GAGATCATCT GGCAAACCCA CATGCTAGCC TCCACTACCT ACCGGTCAGT GAACGCATTA TATGCGCGTT GCTGGAGAGG CTGATCATAT TACATAGTCG GGAAACGCAA ATGCTTGTCG GCCGCGTTGT CGATCGTGAT GAAGCCGTAG AAGAGTCCGT CTTGGCGGAA GCTTTCACAG AAACCGCTGA AAAATGGAAG GTGAGGGTTT TGTCTGATGG CCATAGCTAA ATCTACCCAG GCCTCTTTTC ACGTCCCTTA TACCACATCC GGCATGCTCT TACCTCTGCC GCTATCAGGC CTTGCCGCGA AAGTAGCGTT GAAGCTGCAT TGGCGACCTC GTCCTCGGCG ACTATCCCTT GATAGGAAGG TCGATTATGA CAATCCCATC GTCTATGCAG TTGTCGATTC TACACTCTCT CTGCCTTGTG AAAGGAATTC AATTGCGTTA ATTGACGACG ATAAGGCGCG TACGAAGAGG GCACTCAGAC GGGAGGAATA CGAATTGTAC AGAGTCGCAG CGCTTCAATG GGCTGATAAG GGCAAGGTCG ACCCGGCATT GGCGGAAGGT CTAAGAAATC TCACGCCAGC ATTTATGACG ACCACGCTCG AGGATGCAAA GTGCGGGCCT GGAGGATTTC CTTGTTCGCC ATCATATTTG ACAATTTCGG GCAATCATAT TAGCGGCAAA CTTATTTCGT CCACCTCAAA TGGCATGGGT ATTGTAGGGT GAGTATGGTT TCTGTTCTTT CCCTACATGC AGGATCGCGA AGCTAATAGT GATAAACTGC AGCGCGGGGT TGTGGAGAGG GAACGATGCT GAGCTGGCCA AAATGCAACC ATCAACGTCA TTGTAGGGGT ACCCAAACAG GCTGGAAGAC TGATCATCCA CATCCGCCTA ACACGGACGC TTCGTTCATA TCAGCTTGAA GAATAGTAGC ATGGTCCCGG CTCATGAGTA CATATTGGTT GCACGTCTGC GGATTTTACT TTTAGATTTT CTTAAGGGTC GGTTGCAGAG GTTGTGGAAA AGAAACTACA GGAGTGTTTG CACGGTTTTC TTCAAAGCCT GGGTTTCAGG TATCCCAAGG GGTTCTGCAG CTTTTCTCCA GC
|
Protein sequence | MTPAYEREET MNSPYSSSHY LGVSEGGSSG DSQINNVRSD AAQTRNPLPR QETNATLPPY ASSTAIPAYI SNSCQEACSV GRAKGYLLLL SAFSKVLDQV RRTSSCISLE HSVSEVSPVQ AVNGSTSWSS DCEGPLSEEC RVALFVEMAV RRFGYWQERL LALKSTALPP LDVLLVWATY LGSPIWYHDD SVQGGTTHPQ SAFPDFPLES VIACIDPDTL DYQPVDPGEW KAATGTPFDP IVHFESSTSA AVNCPGCRTK FSWPWILEGG KGYAQCLFVA ECPTTNCRLR IDKEAMGVGR LARRIADMYD RSNCQLATRI LTGGLLNHPL DIPFKSIQTS EDKSKSLCKH WRWHKREISR LSSAFVGAEE ASIDLAPATF RHIGFIDTLK HLGWLNPSTW VDRSEALYTA RLRYAKFMNL ARSPNLIPVP TLGIEIIWQT HMLASTTYRR ETQMLVGRVV DRDEAVEESV LAEAFTETAE KWKASFHVPY TTSGMLLPLP LSGLAAKVAL KLHWRPRPRR LSLDRKVDYD NPIVYAVVDS TLSLPCERNS IALIDDDKAR TKRALRREEY ELYRVAALQW ADKGKVDPAL AEGLRNLTPA FMTTTLEDAK CGPGGFPCSP SYLTISGNHI SGKLISSTSN GMGIVGAGLW RGNDAELAKM QPSTSL
|
| |