Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND02010 |
Symbol | |
ID | 3257103 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | - |
Start bp | 540446 |
End bp | 543535 |
Gene Length | 3090 bp |
Protein Length | 785 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256135 |
Product | expressed protein |
Protein accession | XP_570230 |
Protein GI | 58266148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTCA CTTCAACCCA TTCACAAAAT TCAATCAGTC CTGGCGCAGT CCCAGACGCC CCAGGAGCAC ATTCGGCGCA ACCGCAAACA GAAGCTGCTG GGGTTGACCT CCGCGTCCGA TTTGCCCGCG CTTGTGATCG TTGCCGGTGG GTGATATCTA TCGTACACTA TTGCAAACTA ACGGGGCCGA CAGACACAGG AAAATAAAGG TTAGTAATGT GATATTTAGC AGGCATTAAT CCCCGCAGTG CGATAATGAA CACCCATGTG GAAAATGTGC GGCAGCTGGT GTTACCTGTA CTTCGGGAAT GAGCAGAGCG ACTGGTGTAA GCAGACGGCG AGCTTCTCGA ACAAGCTTTT CGGACACGCG GTGGGTATCG AACATAATTT AGACGTGATC TGACCTTGGA GTCTCAGCAA CCATCATGAA GCTTTCATTG AGAAAGAAGG GTCACCTTCC AACAGAGAAC ATCGATCTAA CAACAAAGGA CGCAAACGGA GAAGAGACTC TGAGGAAACA GATAAGATAG GAGCAACTGG CAACTCCAGT GAGTAAAGAG TCCGAAATTT CCATATTCGC CTGATCTTTC CCTATTACAG AAACGTCAAA TGTCCTTGAT CACCTCGATA CGGCGCACTT CTTCTTGTCG CAAGAAGGAA TGACCCGTTT CGCCGGCTCT ACTTCGGGCC TTCCAGTCCT GTGAGGAGGA TCTACCAGTT AGCTAGCTAT AACAGTCAAT AACATCACCT GTAGCGAAGC CACCCGACGT ATGCTGCAGA AAATACCTGA TCCCGTCAAC GACGCGCAAA GTCCCATGGG TGAACCAAAT TGGTCATGGC TCTCTTCATT ACTAGAGGAC GGAGAGGGAT CCTCAAAGAG CGGTAGGGAG TCTTTCCTTC GTCGTCAAGA ACGAGACGAG CCGGAATACT TTCCTGGAAG GGATCACGCC TCGGAACAAG GCGGAGAAGA CATTTTCGCA AGGATTTCGG AGATCAGTGA GCCACCATCT GCCTCTGGGA CTTACAAAGC TAAATTTGGA TAGTACCACC TGATTTGATG GCCTCTCTTG TTCAGACTTA TGTAGGTAAG AAGCGGTCAA CGTCCCTGAC ACGAATTAGT TCGCAGTTGT TCATCCTGTT TGGCCTATTA TCCACATCCA GTCATTCTTT GCAGTAAGTA CACGTAGCCA CCTCTTTTCC ATCAAAGGCC CAAGAATCTG ATAGCCGTGC CTAGGATTTC TATAAGTGGA CCAGCTATTC CTTTGCGGCG TTAGTGGTAT CTATGTGCAT GTTGGCGACC CGCTACACGA ATGACCCAAG AGTGCTAGCT GAACCGGGTG AGCTACAGAT GAATTTATAA CAACTAATTT GGCTGACTAA ACTGGCTCTT GATTTTAGAT ATCTCAGCCA GTGCAGGGTT CCAATATTTT GAGCTTTTCA GGCAATTGCG TGCACAAGCT TCGACAGAGG ATAACGTGAT TCAAGCAATC CAGAGCCTGT TTTTCGCGGC ACAATACCAT TGTGTTGATA ATGTACCCCA CCAGGTTGCA CAAGGTCTTT TTGCAGAGGC AGCAGCTCGA CTGCTCGATG GTGGACTACA TCGGTATGCA TTTGCCCGTT CGCGAGAAAG AAACGTCAGC TGAATTACTC GATAAGAGAA ATCTCAGATA TTGCGCTTGG CGATTCGCTG GAGAAGGAAA CGCGAACAGT GGGTCCAGTA GTATGCTCGC AGTTCCAAGA ACTAATAGAA AGCAGCGTAC TGCATGGGCA TGCTATAGCT GGGATAAACA ACTGGCCGCC ATTTGTGGTA AACCACCACT CCTACGCATT TGGGACTACG ACGTCTCATT GCCCGAAGTG TTTGAAGAGA CGCAATCGGC TTTTGCTGAG CTCCCCGAAA ATCAAGATGA GAAGCTGAGC GCAATCTTCA TCCAGCAAAT CCTGTTGTCA GTGGTACTTG AAAAAACGTT GACCTCGTGC ACCCATCATC CCGAATTTGA TAACTGCGAA ATGCTCAACA GATGGGCAAG GAGCATGCGC CCAGAAGTCG AAGATATGAA GGCTCTAGAT GATTCAATGC GGCTGCTGAA TGAATGGCGA GAGTGCGTTT AACAACTGTA TTTGATGAAG CACACAAGGA TGCTAATGTA ACGCAGGGCC CTCCCGCCAG CAATGTCAGA CCGCTCGATC GCTGGTCGCC TCGCTTCGCC GATGTACAGC GTCGAATACG AACAGATTGC TGTAATCGAG CAGACCATAG AAATGCTTAT AGCGGGCCGC AAACTGCAGC TTGCGACACT TGCCAAGACC CGGGAAAACA CGTCAACGAC TCGTCTGGAG TCTGCACGTG ATGCTATTCT CGAAGCAGGA AAACACACGC TAGCATCAGC AGTAAAGATG GGCTCTGCCA AAATGCTCGG CAAATGTGAT ATTCGTGAGT ACATATCATT TCAAATACCA TGCTGACATT TGTTTAGTCC TGGCATATCG TATATTGATG GCTGGTCGGT TTCTACTAGC CAGTCTGTTA TCAGCCCGCG CAGATTGCAA AGCCGAGCAG GAAGAAGAGG CGACCCGGGC AGTGAGAGCA GCGATTGTAC TTCTCCGTCA TTTCTCGGAC GTATTCCCCA TCTCACTTGG CTCTGCTGAA GTCTTGGAGG AAACATGCCG AGGTTGGTCA CTGCATACCA CTTTACTTGT CTGACTCTTT TACTGACTCG ATACAATCGC AGTTTGCCGG GTTGACATCT CTTTGCCTAC AGCGGCAACT CCCGGCCACC CTCGGCACAA CCTGTACGCC TGGCATCGGC CCCTCAGGCT TCGTGACAAG CATTCCAATG AAAAAGATGG CTGTCGTTCT CAAGTCAGGT CACCAACGAA TGTCATGTCT GGAGACGCCG CAGCGGACGC CATAGCGTCC ATGTTTTCCC CGGTGGACGC AGCATTTGGG TTAGGTTCTT TACCTTTACC TTTCCCTGAG ATTGGAACAG AAGGAGAAAG GCAGCCAGAC TTCTCATGGC TTCCTCCAGG TGGTAGTGTC CCTTACTACT TTCCCTCACA TCCATCAGAG TAACGTAGCA GTATATACCT
|
Protein sequence | MSVTSTHSQN SISPGAVPDA PGAHSAQPQT EAAGVDLRVR FARACDRCRH RKIKCDNEHP CGKCAAAGVT CTSGMSRATG VSRRRASRTS FSDTRNHHEA FIEKEGSPSN REHRSNNKGR KRRRDSEETD KIGATGNSKT SNVLDHLDTA HFFLSQEGMT RFAGSTSGLP VLEATRRMLQ KIPDPVNDAQ SPMGEPNWSW LSSLLEDGEG SSKSGRESFL RRQERDEPEY FPGRDHASEQ GGEDIFARIS EIIPPDLMAS LVQTYFAVVH PVWPIIHIQS FFADFYKWTS YSFAALVVSM CMLATRYTND PRVLAEPGNC VHKLRQRITL HKVFLQRQQL DCSMVDYIGM HLPVREKETS AELLDKRNLR YCAWRFAGEG NANSGSSSML AVPRTNRKQR TAWACYSWDK QLAAICGKPP LLRIWDYDVS LPEVFEETQS AFAELPENQD EKLSAIFIQQ ILLSVVLEKT LTSCTHHPEF DNCEMLNRWA RSMRPEVEDM KALDDSMRLL NEWREALPPA MSDRSIAGRL ASPMYSVEYE QIAVIEQTIE MLIAGRKLQL ATLAKTRENT STTRLESARD AILEAGKHTL ASAVKMGSAK MLGKCDILLA YRILMAGRFL LASLLSARAD CKAEQEEEAT RAVRAAIVLL RHFSDVFPIS LGSAEVLEET CRVCRVDISL PTAATPGHPR HNLYAWHRPL RLRDKHSNEK DGCRSQVRSP TNVMSGDAAA DAIASMFSPV DAAFGLGSLP LPFPEIGTEG ERQPDFSWLP PGGSVPYYFP SHPSE
|
| |