Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND05810 |
Symbol | |
ID | 3257170 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1580090 |
End bp | 1583287 |
Gene Length | 3198 bp |
Protein Length | 851 aa |
Translation table | |
GC content | 48% |
IMG OID | 638256519 |
Product | conserved hypothetical protein |
Protein accession | XP_570618 |
Protein GI | 58266924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.109949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTCTA ACAAACATCT TGCTACGGGC GATTCTCATG GTTGGCCCAT CCAGAGCGCA TCGTGAGCAT TCTCTTGTTC AATTTCAACG TAGACTTACT TGGCGAAGGC CTTCTATATT GCACAGAGGC ATGACCCCGA CGACATCGTT ACCACCTCCC ACAAATCTGG CTCCTAGGCC TCTGACTGAA AGAGAAAGTC ACTTAGTGAA GCACCTGAGC AGATTGCAGT TCGTGAGTAA ACACTATGGC TGAGGAAACT TTTAACCATA ACCTACCCAG TTTCTTGCCA CAGCTCCTAC GAGGTGGATG GGAACGGGAG AGCGCACGGT GAGGCTAATG CGGCGTTTAA ATCGTTTTGG AGAGTTGGCT CACAGCAGAC AGAACTCGTC ATTCCAGGAT AATTCACTTT CCTCTCCTCA TCCCAACTTG AATCGTTTCC TCTTGCCAAA CGGCGAGTAT GTAGCTTGTG TCTTCTGGAA TGGGCTGTAT CACATTACTG GTACCGACAT AAGTGAGCTC AACCGAGCGA ATCAACGAGA ACAACAACTA ACTAGGGATT AGTCAGAGCA TTGGTCTTTC GTTTTAAGGC CTTTTCCAGG CCTGTACGGA ATATGAAAAA GTATGCACGA TTCAGAGTAG CAGCGCTCCG ACTGATTTCA AGCTATCAGA TTCGAAGAAG GCGTATTCTC TGATCTGAGG AATCTCAAAC CGGGGACAGA TGCTTGTCTT GAGGAACCTA AGTGAGTGTA ATGGTTTTCG GGAGAAAATA ATTGGCGCTG ATATAGTTGG CAGGTCACCT TTCCTTGACC TTTTGTTCCG CAATGGCTGT ATTAGGACGC AAAAAAAGGC AAGTCTTACT TCTCGATAAC ATCAGTTTCG AACTAACACC TAGTTTTAGC AAAAGGTGAG GTGTCCGTTC ATTGATGATA GTACTTTTTT GTTTACGAGA CATGTAATCA ATCTATTCAT AGGTATTCTA CTGGTTAGTC GTACCCACAG AGGACTACGT AGCGAATTAA CGTTTTCCGC AGGTTCTCGG TGCCTCATGA TCGATTGTTT CTAGACGCCC TGGAGAGAGA TCTCAAAAGA GAGAAAATGG GGTTGGAACC AACGACGATC GTAGTGGGTG AGCCAGCACG GAGTTTTAGG TAAGTAAGGG AAGTCTTATG ACACGAACTA TGCCATTGAC CTTGTATTGC CAGGTATGAT CCGAGGCGAA GCCTGTTCGA ACAGTTCGCG GGCAAACAAC CTGGCTTAGA AGAATCTGTC AACTCAGTAA GCCTTGGTAT TGGGTATTGC ATGTTTTCTC ATTTGACGAT ATCTTTTCTC GCAGAGCACT CGTGATACCC CAGCAGAGCA GTTAGCGAAT CATGATCAAG GTGCAACGGT GCCCAAAACA CCCCTTATTT TGCCTCCTAA TAATCTTATT TCGTCATCTT CCTTGCCATC TTCTATTATC CAGAACAACA GCATCATTGT TAGCACCCCA CCGTCGCCCG AAAATATCCT GAACAACACT GAAGATCAAT TGAGATGCCT AACGACAGCC AGCAGCGTGT TGTTTCCACG CTCTTTGTTG AAGGGAAGTC CTGCCTATAA ACAGGGGAGA AGAAAGGCCT TGCGAGATAA CAAGCAGAGC ACTCGGCAGG ACAGCGTCAC AATCTCCGAT TGCGACACCG GAGATGATGA AAGTGGGTCT GAAAGCGAAA GAGCAGGCCG AGTCGAAAGA AATGGTTTCG AGAGGGATCT AGCCTTAACA TCACAATCCT CAGCACCTAC CCTTAATCTA TCTGAAGATA GTCATGCGAC GATGGCGATC ACCGCGTCCG AGTCGACCAG GTATATCTCA TTTCCCTCCT TACAGTCAAC CTCGGGAACC ACCCTTTCGA GCCACGAACT GCCGTCTCAG ATGTTATCGC GGCAGCCGTG GGCAACCAAC ATTATACCTA CCACTGCTGC TGCGACCGTC GAACAGTTAT CTGCCAATTA TGATCAGTTG TCTGCCGCAA ATGCATTCAA CATGGCTCCA AGGTTGTCCT TGCAGCCTCT CGGGCCTCTT ATTCCGGGCA GACCAGTCTT GCAGAACTCG ACAGTAAAAG GGTTCAGCTG TCCTCTTCTT TCGTGTGGCA GACTATTCAA GCGGCTTGAG CATCTCAAGC GCCATGTTCG AACACACACC CAGGAAAGAC CCTACGAATG CTCTCGATGT GCCAAACGAT TCAGCCGCAG CGATAATCTT ACACAGCATT ACAAAACGCA TGAGAGGCAA GATCGTGGAG GGAGAGATAG GAGTGAACGC ATGAAAACGG AGGCTTCAGA AGGCGCGGAT GATGATATAA CCACGTACCT CGAAGCTCAA GTGGATGCCA TGGCGGGAAG TGCCCATGTA TATGCGACGG CAACAGATAG TTTCGTGATA GGGGAAGAGT CCAAATCGAG TGCGGTCAGC ACGGAGGTAT CTCGTAGGTT TTATGCAAAC AGCAGGCATC GCTTGTAGCT GATACATGGT TGTAGCACCC AATCGCGGTA ACCAGACATC TATTTCCCTG TCTTCAGCTT CTTTGCTCGG CAGCCACGTT ACCAATGATA TTTCAGCGCA TTTCCTACCA GCCGGCGCGC CCCATGGTTT ACCAGTTGAC GTTGAATGGC CAAGACTGGG AGTACCACTT AATGCGATTT CCGTCGGAGA AGCAACCCAT GATACCTTGA ACCTAATAAA ACGTCATCGA TCAATGACAC CCAGCCTTCC TCCGTCGGGC CGCATTATTG GGTCGACCAG AGCTTTACAC TCATCACCAT ATCGCAATAA CCCCCTTTAT AAACCATATA ATCCATATAA CCCATATTCG ACCAATGCAG CCATACCAAA TGGTCATTCT TTTACACGTG CTGCATCTTT GGATCCATCG GTATTCCAAG ATCGAGCTGC GGTTCTCAGC CATTTTGTAA GCAGCAAACA GCATACGCCA TCGCAGACCA TCCCAGCTAT GTGTTCTGAT CGTAACCTGT ATACTACTCC GGAGGACCTC CCTGCACATC CCGTAGGGGT TGATGCTCGG GAGAACGCAA TCTTTATAAC CGGGGGGGGC GAAAGTAGAC TGGCCGCCTC TTCAGGACAC CAAACTGATT ATCAGCTATC GAATGGGCTG GGCGAGGCTA GTTGGGAGAT GAGGATGAGA AATGATTCCG TACCTTAG
|
Protein sequence | MGSNKHLATG DSHGWPIQSA SPSILHRGMT PTTSLPPPTN LAPRPLTERE SHLVKHLSRL QFFLATAPTR WMGTGERTNS SFQDNSLSSP HPNLNRFLLP NGEYVACVFW NGLYHITGTD IIRALVFRFK AFSRPVRNMK KFEEGVFSDL RNLKPGTDAC LEEPKSPFLD LLFRNGCIRT QKKARFSVPH DRLFLDALER DLKREKMGLE PTTIVVGEPA RSFRYDPRRS LFEQFAGKQP GLEESVNSST RDTPAEQLAN HDQGATVPKT PLILPPNNLI SSSSLPSSII QNNSIIVSTP PSPENILNNT EDQLRCLTTA SSVLFPRSLL KGSPAYKQGR RKALRDNKQS TRQDSVTISD CDTGDDESGS ESERAGRVER NGFERDLALT SQSSAPTLNL SEDSHATMAI TASESTRYIS FPSLQSTSGT TLSSHELPSQ MLSRQPWATN IIPTTAAATV EQLSANYDQL SAANAFNMAP RLSLQPLGPL IPGRPVLQNS TVKGFSCPLL SCGRLFKRLE HLKRHVRTHT QERPYECSRC AKRFSRSDNL TQHYKTHERQ DRGGRDRSER MKTEASEGAD DDITTYLEAQ VDAMAGSAHV YATATDSFVI GEESKSSAVS TEVSPPNRGN QTSISLSSAS LLGSHVTNDI SAHFLPAGAP HGLPVDVEWP RLGVPLNAIS VGEATHDTLN LIKRHRSMTP SLPPSGRIIG STRALHSSPY RNNPLYKPYN PYNPYSTNAA IPNGHSFTRA ASLDPSVFQD RAAVLSHFVS SKQHTPSQTI PAMCSDRNLY TTPEDLPAHP VGVDARENAI FITGGGESRL AASSGHQTDY QLSNGLGEAS WEMRMRNDSV P
|
| |