Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01930 |
Symbol | |
ID | 3258260 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 568883 |
End bp | 571977 |
Gene Length | 3095 bp |
Protein Length | 907 aa |
Translation table | |
GC content | 50% |
IMG OID | 638257318 |
Product | conserved hypothetical protein |
Protein accession | XP_571685 |
Protein GI | 58269058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.386056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTAT CAACGTCAAC CTTACTCGAG CGTCGCCGAT CATCACTTTC TCCCCCTTCC TCTCCCCCAG CCGAACGAAG CAAAACACAG CCCTCAGTCA CTCCCGTCCC TTCTGCAACT CTTTCAGTTT ATCCCTCCCC CTCATCCATG AGCCTTCCCA CTGTCCCTCT CAAGGACGTT CCTGCTGATC AGTCTCAAGA AGGAGGATTA CAACCTTCAT CTCAACGCTA TCACCCCCCT CCTCTTCTCC CAAGGAGAGA GTCATTCCAT TGCTCTAATA ACAGACCCCA TCCACCGCCT TTTCGTACTG CGCTGAGTTC ATTAGGGCCG ATGACGACCT CCAATCTCAT ACTTGGTACT CCCACAGAAG AGAAGGCAAG CCCAAAATTC AGTATTAAAA GGGGTAAAGA AGAAAAAGTG GACAAAGAAG AGTTGACCAC CAAAGGAAGA AAAAGAAAGA GATTAGCCAA GGCGTGTAGT GCCTGTCATG TAAGTTGTTT TTCTATAGTG GTCAACCGTA TTCAGCTAAA TTAGAATCTA TCATTAGAAA AACAAGAGAA GGTGTGATGG TTTTGCCCCC TGTTCCAACT GCGAGTTTTC CAATAGACCA TGTCAATATC TCAATGCGCA AGGGGAGCCC ATTCCGCCAC CACGGACCCG CGACCCTTCC AACAACACCT CGAGCAAGGG CAAAGACGAC TGCAAAGCTA GCAGTGCCGA TGGGAGTGAC ATCGCCAGTC AGGAGGATAG GCGAGAAAGT GGGGAGAGCA ACCAGTCGCT GCGAGATGGA CAAATGGACC CTGAATCCGA AGTGGACAGA AAGCCATCCA TTGGTCCACT CCAGGTTGTA GACATGGATG TCTCACTGGG CGCTGAGCTT GTGGACAGTG AGTGAGGCTG AACAAGTCAA GTGTTTATAT ACTGATTCTG AATTATTAGT CTTCTTCAAG CGTTGCTTGC CTTTACCATT CATGCTACAC GCGCCAACTT TCAATTACCG CCTTTATCTC AACCAGGTCT CGCCCATCTT GCTCGATTCC ATGTACGCGT TTGCCGCCCG ACTATGCGAA AACCCGTTTT TTCTACAAAC ATTTCCCCCA AATCACCCCC CCCATTTGCG GGGTGAACTC TTCGCACTTC GTGCTCATCG TAGCGCAGAA AACTTGATCC AGCAGCGCAA CATATGGAGT GAAGAGACTC GCCGGGCCGA CCGAGGGTCA TGGCAAGAGA CGGAGTTAGC TCAAGCAGCC TATCTGTTGA GCGTCTACTT CACTTGTCTC CGTGAACCTA AGCTTGGTCT TTTCTATCTT GATGCTGGGG TCGATATTCT TCGTCCATCA CCCGCGACTT ATATACAACC GCCAGCCGCG CCGACAGGAG CGAGTCCTAT AGAGTATACA ACTCACATGG AGTGCCGAAC CCGCACATTC TGGGCCTTTG TATTGCATGA CCTGTGCGCG GCCTCCAATG GTAGGCCAAG GAAACTTGGA GAGGTAGATT TGGGAGCAAT TCCTTTACCA GGAACAGAAG CCCATTGGGC CAGATGGGGT GGTGGAGGCA TCGGGGGGAG AGAGCCCGGC AGAAGGGATG GCTTGATCGC GGGCACGGGG AATTGGCTCG GCGAAGAGGG AGCTGTAGGG GAGATAGGTA ACGTCATTCG GATTGTAAGT ACATCGTGAT GGGAAACGCA TAGATACTCA CTTCATCTCT TAGCTTTCCA TATTGGCAGA CATTATGTCT CTTGCGACTG ATCCTAATGC TGGTGACTCC AAACAGACTC TTGCTGCCAG ACTTGAGGCC GCTCTCAAGG CCTGGGCGAT GGCCTTACCC TCGCACATGC ACTTCAACGA GCCAAACCTG ACCATGGCCG TCTCCAAACT GTCATCTCCA GTGGCCGAAA TTAAGACGTC AGGATGGATG TATGCCTACA TGCATGCTGT CGCCGAGTGT GGGATGTTTT ACCTTCAGGC AGCTGTGGCA CCGGTCAGCG ACGGCGTGTT CACGGCTAGG AGGCAAAGTC AAGCAATTGA AAATCTAATC GTCATCATGG ACGCCATCAA TCAAACAGGC CGCGAAGGGT TTAGCTGTAA GTCTCCGTTT CATATGCAGA GGTTGCGATG ACTTAATTCC GGATTACAGT CTTATTCCCC CTACTCGTCA TTTCTAACTG GCAGGAGCAC CTTGAGAAAT CGGACCTTCT TGTCAGAGAC GTAAAACATC ATCTGACTGA GGAGCGTCTC AACCATTGGT GGTCGGAAAT GGCTCGGGAA TGGGGTGTCG AACGACATGA CGTTCTCAGG CGTGGATTTT ATATTCTGCC CATATCCCCT GTTGTACCAC AACGAAAATA TCGTTATTCG CAGTCTTCGC ATCCCGACCG CCCTTTCCTG GCAACTTCAA GTTTGGGACT GTACCAAGCG TCGCCTCCGA ACAGGATTTC TTTGGAATCT GCTGCTACTA CTTCACCTAC ATCAACAGCG GCAATTCCTG TTACCACTCC CAATTTCAAT CGTACGTCTC GCTTCAACCT TCCCACCTTG CCACCTCTTC GGCCCCGCGC AACCTCTGGT GCGAGTGTCT TGTCCAACTA TTATGGCCGC TCGCCTTCTC CTCCGCTCCA TCTTCCCTCT GTTGCATCAG CATTATCAGA TCGAGGAAGT GACAAGGAGC ACGAGCGATT TTCCCTCCCG TCGATCTCCT CAGAGCTACG TTATCGTGAG CCTTCAAGCC CCCGTCACCC TCTATCCCAG AGTATTAGTT TAAGGTATAT CCCGAAGCAG CACCCTTATG TGCGCGAAAA ACCTCGATCA CCCAGAAGGA GCATAAGTGA GAGGGATATG CGAGACGGGG ACCATATTAC CGGGATTGCG GCATTAGTAA CAGCTGCCGA GCGAGAAAGA GAAAGAGAGT CAGGAGGGCA GAATATTCGC TCGTAAAGTT GAAAAGGTTC TTTGTCTTGG GCTCTTCGAA TTAACCATGT GGGAGATATT GTAATTTAGC CTGTGATTTG TTGAAATTAA AACGCAAAGA TATGTAAAAC CAGCTATAAC ATTTGGAGAG GAGCCAAAAA TATGATGATG GATGTGTCAA CGACT
|
Protein sequence | MSLSTSTLLE RRRSSLSPPS SPPAERSKTQ PSVTPVPSAT LSVYPSPSSM SLPTVPLKDV PADQSQEGGL QPSSQRYHPP PLLPRRESFH CSNNRPHPPP FRTALSSLGP MTTSNLILGT PTEEKASPKF SIKRGKEEKV DKEELTTKGR KRKRLAKACS ACHKNKRRCD GFAPCSNCEF SNRPCQYLNA QGEPIPPPRT RDPSNNTSSK GKDDCKASSA DGSDIASQED RRESGESNQS LRDGQMDPES EVDRKPSIGP LQVVDMDVSL GAELVDIFFK RCLPLPFMLH APTFNYRLYL NQVSPILLDS MYAFAARLCE NPFFLQTFPP NHPPHLRGEL FALRAHRSAE NLIQQRNIWS EETRRADRGS WQETELAQAA YLLSVYFTCL REPKLGLFYL DAGVDILRPS PATYIQPPAA PTGASPIEYT THMECRTRTF WAFVLHDLCA ASNGRPRKLG EVDLGAIPLP GTEAHWARWG GGGIGGREPG RRDGLIAGTG NWLGEEGAVG EIGNVIRILS ILADIMSLAT DPNAGDSKQT LAARLEAALK AWAMALPSHM HFNEPNLTMA VSKLSSPVAE IKTSGWMYAY MHAVAECGMF YLQAAVAPVS DGVFTARRQS QAIENLIVIM DAINQTGREG FSFLFPLLVI SNWQEHLEKS DLLVRDVKHH LTEERLNHWW SEMAREWGVE RHDVLRRGFY ILPISPVVPQ RKYRYSQSSH PDRPFLATSS LGLYQASPPN RISLESAATT SPTSTAAIPV TTPNFNRTSR FNLPTLPPLR PRATSGASVL SNYYGRSPSP PLHLPSVASA LSDRGSDKEH ERFSLPSISS ELRYREPSSP RHPLSQSISL RYIPKQHPYV REKPRSPRRS ISERDMRDGD HITGIAALVT AAERERERES GGQNIRS
|
| |