Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB01350 |
Symbol | |
ID | 3255863 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 403989 |
End bp | 406985 |
Gene Length | 2997 bp |
Protein Length | 858 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254785 |
Product | hypothetical protein |
Protein accession | XP_568807 |
Protein GI | 58262794 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGTTT CTGGTCTTTG GGACGTGAGT CAATTTACTC CATTTTACTC GTTCTTAAGG CATTAGGTAC TGATATAAAT TAGCTTCTGA GACCAAGTGC GGCGAGCGTT ACGCTACATA CGCTCTCCAA AGAAGCCTTT TTGGAAAATA AGAACGGCCT GAGAGCACTC ACAGTTGGTA TAGATGCTTC GTAAGTGGCT CTTGTGGTAA ATGGAAGTCC TAGCTTATTG TACCTTGTGA CAGAATTTGG ATCTTTCATG CGGCTGTACC TCAACATGGT GAAAACCCTT TCCTAAGGAC CATATTTTTC AAGATCACAG CATTACTCCA ACATCCTGTA CTGCCAGTAT TTGTTTTTGG TTGGTGAATT TGGTCCTGAC AGGATACGAT AACTAATTAC CAATTAGATG GTCCCAACAA GCCTGCGATG AAAAGAAATC AGAAGGTCGG GGGAAAATTC GGAACCCATG ATTACCGAAG CAAACAGTTT AAGGCCTTAC TTGACACCTG TGGTCTTGAA TGGTGGAACG TGAGTAACGG CGATTTGGAA TTAATCGTCC AGCTGAAGGT ATGACAGGCG CCGGGAGAGG CGGAAGCAGA GTTGGCTGTA ATGAATCGGC AAGGAAAGAT AGATGCCATT TTGTCCGATG ACGGAGATGC TCTTCTATTT GGAGCGAAAT GTCTCATCAG GAAGTAAGCT TTTGGATGGG TTCTATACCT TGCATTCGTG CTGACAGCTA GCCAGTTCTT CCCCGACTCT CTCAGGATCG CTGGCTTCTT CAACGAAGAA TAATCCATCT GCCGGCTCTA AACGTGATTA CGACGTATAT ACACTATCCC GGATCTGTGG AGAATGGGCA AAAGAGCAGG ACACCGAACT GACATCTGAA GAAAGCTGCA CAATGGCAAT GGTATGGATT GCCCTTTTAA GTGGCGGAGA CTACACGCCC GAAGGACTCT ACAGTATCGG TGAGTGTCTC GAACTCAATG CGATAATATG TTTACTGACC TAATTCAAAT ACAGGACATA AAATATCCTA CGGTCTTGCC AAAGCTGGGC TTTCTGACTA CTTGAAAGAA TACTGCCGTG ACAAGCAAGC TTTCCTGAAG TCTTTGCCCG GGCTTCACGC TCGTATGGTG GAAGAACTCC GGACAAATTC TTCTAAACAG CAGGACAAGC GTTACCCTGA TCGCTCTAAC AAGCTCTCAG CAATGTCCCC TTCGCAGCTG TTTCCAACGT CCACTTTGGA CGCTTATCTC AGCCCATGCA CTAGCCCTTT GGACGACCCT TCTCAAGGAT GGCCTGGTTT CGGACAGGGA AGCTGTTCCA TGGCCAGAGG AAAGGCAAGG AGTGAAGGGA GAGGCGATAT GGAGGGCATG GCAGCAGCAT GTGAGAAATA TTTTGAATGG GGAACCAAGG ATCTTGTGTG CAAGAAATTT GCAGGAGAGT CTGTGGGTAT CTTTGGAGCA GAAATTATGA ACGCCGCTCG AGAGGCGGTA CGCGCTAGGG ATAGCTTGGG TCTAGGCGTT GGCATAGGTC CAGAGAAGAC GCCTTCGAGA ATTACTTCCT TCTTTCAACA GTCTGTCCCA TCTCCTATAT CGTCCAAATC GACCGGAGCT TCTCAGTTTC CTAACCCGGC AACTCAGCTG CGCGATACCG TTCCACCTCA TATCGTCCAA ATCCACTCAG AGAGGACAAG TAAAGATGGA ACGGAGAAAG ACTACCGCAT CTCATTTCAC CAGGATGTGT ATGTTGAACG TTGCCGCAAT GCCATGCTTG GCATACGAGT CGACCCTAGC GAACTTCCTC AAGAAGAAAA GAACAGACTA GGACTCGCCG ACCATGCTGA TAAAGACAAC GATGATAAAG TGTCCGCAAC TCAAACGGCC TCCAAGTCTG AAATCAGAGT CTGGCTTCCT CAATACCTCG TACGAGAGGC ATGGCCGGAA CTAGTCAAAG CTTACGATGA TAAGCTGGCT GCCAAAACGG CCAGTAAGTT GAAAACGCCT AAAAAGGATG TCCAGCCTTT GAAAACAAAT GCCGCTGGTA AGGCAAAGAG GGGAAGGGGG AAGAAGGCGC TTGAAGCAGA CGGGGAGGAT GTAAATGCGT TCACCTCCTT CTTCAGTCAA CGCCCCAAAG AATCAACACT AGATGCTTTT GAGGAGGAAG AAGAGCAAAT TGAGCCAACG CCACCGCAAA GTAAGGCTAC GCAAGAGGTC ATCGATCTCA GCCTGTCGCC TTCCCCTTCA CCACCTGCAT CTCCCACGCA AAAGTCCTCA AATAATGCCG AGAAAAAGAA GCTAGCACGA CCAGCTCTTA ACACATCTAC CTCCAGTACA CCGTCCGGTG GAGAGAAAGG TGAAGGCTCC AGTCGAACTG CCCGGCGTGC TCGTAAAAAA ATCCACAAAT CCACTTCACC TTTAAAAACG ATCACCGACC TCTCCAAGTC TTCCCCTTCT CCTGAGCAGT CCACGGCAAC TCCCACTATC ACCCCACGGT TCTCCAGTCG AACGTTCAGA AAGACTATAT CCTCGCCATC TGCCTTCCCA CCACGCCGGG TGCAGGCTCA GGAAGTTATC GATCTCTGCT CATCGAGTGA AGAGGATACA GCTCCAGCGA AGCCCATTCA TCGTAGACAA GCCGCCAGGT CGCCTAAAAT CTCATCCCCG CCATCTGGTA TTCTTTCTAG TTCCACTAAG ATCAATACCC AATCATCTCG TTCATCTCGC TCTACCTCTG TCTCATTTCC TGCCTGTCCT TTGAACAAAC CAAGCTCACG CTCGCCACGG GAAAGTTCGA TACTCAGCCA CCCTCTCCTT TCTGGGTCTA GCCAATCATC CTCCCTATCG CCTCCCCCGC CTACACCTCC TAATACACAA AGAGCAACAT CCCCAAAGAA AACAAAAGTA TCATCCCCGG CTAGGCGACG GCCAAAGTAC AAAATTATCA GCTCGACAAA AGACGGCGAA GTCATTGATT GTACGATGCG ACGATAA
|
Protein sequence | MGVSGLWDLL RPSAASVTLH TLSKEAFLEN KNGLRALTVG IDASIWIFHA AVPQHGENPF LRTIFFKITA LLQHPVLPVF VFDGPNKPAM KRNQKVGGKF GTHDYRSKQF KALLDTCGLE WWNELASSSP TLSGSLASST KNNPSAGSKR DYDVYTLSRI CGEWAKEQDT ELTSEESCTM AMVWIALLSG GDYTPEGLYS IGHKISYGLA KAGLSDYLKE YCRDKQAFLK SLPGLHARMV EELRTNSSKQ QDKRYPDRSN KLSAMSPSQL FPTSTLDAYL SPCTSPLDDP SQGWPGFGQG SCSMARGKAR SEGRGDMEGM AAACEKYFEW GTKDLVCKKF AGESVGIFGA EIMNAAREAV RARDSLGLGV GIGPEKTPSR ITSFFQQSVP SPISSKSTGA SQFPNPATQL RDTVPPHIVQ IHSERTSKDG TEKDYRISFH QDVYVERCRN AMLGIRVDPS ELPQEEKNRL GLADHADKDN DDKVSATQTA SKSEIRVWLP QYLVREAWPE LVKAYDDKLA AKTASKLKTP KKDVQPLKTN AAGKAKRGRG KKALEADGED VNAFTSFFSQ RPKESTLDAF EEEEEQIEPT PPQSKATQEV IDLSLSPSPS PPASPTQKSS NNAEKKKLAR PALNTSTSST PSGGEKGEGS SRTARRARKK IHKSTSPLKT ITDLSKSSPS PEQSTATPTI TPRFSSRTFR KTISSPSAFP PRRVQAQEVI DLCSSSEEDT APAKPIHRRQ AARSPKISSP PSGILSSSTK INTQSSRSSR STSVSFPACP LNKPSSRSPR ESSILSHPLL SGSSQSSSLS PPPPTPPNTQ RATSPKKTKV SSPARRRPKY KIISSTKDGE VIDCTMRR
|
| |