Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00940 |
Symbol | |
ID | 3259280 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 916243 |
End bp | 919215 |
Gene Length | 2973 bp |
Protein Length | 813 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258388 |
Product | transcription initiation factor tfiid 90 kda subunit (tafii-90), putative |
Protein accession | XP_572288 |
Protein GI | 58270264 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.152566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAGATGTC GCAGTCCCCA GACCCATCCT CCGTCAAGGG TGGCGCCCAA GGGCAGCTAG CTCCCAAATC ATCCACACCC ACCCAACAAC CTCCTCTCAC AGACTCAAAG CTCCTCCGCC AATTCGTCAT GGAATACCTC CAATCTCATG GTTTCGACAA GGCTCTCGAA AAGCTCACTG AACAGGTGGA AGATGCGAAT TTGAGTACAA CTGCGAGCGC GGCTGTCTTG GGTGATGCAG ATGTCTCCAA AGAAGTTGCG GATGGGAAGG CAACGGCAGG TGAACAAGGC GGAGAAAAAG AAGCCATTTT CCGAGCTCCC GGTCCTGTAC CGTTAGAATC CAACATCAAA CGTAACATTC CCCAGGCCCA AGCTGTTTCT GCATCTACGA TGTCTGATCG GATTACTCCC GAATTCGAGG CCCAAGCAAA ATACATTATT GAGCAGCTTC AGAAAAAGGT CGAGGCCGTC CAAGAGGAGG GTGGAGATGA GAAGGAGAAG GAGCTGAGAG ATGGGATACC GGTTGATGGG GCGATGGTGG ATGTCAGTGA CAAAGTGGCT GGGTATGAGG CTTATAGGAG ATGGGTAGAT GGAGGATTGG ATGTTTGGAA GGTGGGTCTT AAGCAATCTA TACCGGGCCT GGAGCAAAGT GAACTGACAA TGGGATAGGC TGAACTGGAC AATCTGTCAT TTCCGCTTTT CGTCCTCTCG TTTTTGGACA TGATACATTC TGGTTTCCTC AAGACAGGTG TGTAGTCAAT TGCCTTTGTT CTTTCTGACA CCTTGCCTAG CTCGTGAATT CTTCGAAAAA CACAGTGCCC ATCATCGCGA ACTTCATTCC CAGGATCTTT CCTCATTATC ATCAGTATCA ACTGAAGAAC ACATAAAGCG AAATCCTTTC TGTGCTAGAA TACTGCGAGT GCTATATTCA CTCTGGGAAA AATGTGTTCA TTGCTGACTC CACGTAGGAG CGAAAAATAC GTTGTTCCTT TATCTCGCAA CGCTTATGAT TTACTTTTAC AATGGCTATC CGGAGCGAGC TTGGATGATG AATGGGAAGC TGGTCTCCAC AGCGCCCCGG GCAGACAAAA AGAGGCAATC AAGTTGATAG TGGTTAGCCA TTTGAACATA ACTGGCAAGT CAATTTCTTT TTCCATACAG GCCCCCGTAC TAACCAGAGC AATAGTTACC AACGATTCTG CGCCTCTCGA TAAAGTCACA ATAGCCTCTA AATCCGGTCT TATCTCATCT TCTCTCCCAC CCAACACCAA CATTGATGCT TTCAACACTG CCACCCAACT GAAGCTCGGT CCACCGCCAA TGACAGAAAA GCTAAAAGAG CAAGTTACGA GGACGCTGCA GGATGAAGGG GAGGCACAAG GTGCCAACGG CGACGCAAAC GGCGATATTG ACATGACTTC ACCCTTCCCG ACTAACGAAC CTGTTATCAA ACTCGAGCCC GAGACTGAAC GTGACCCCTC TGTGATCAGC CCTGACGAAT CCGAAACGCT GCCGCCTATC CCTGCTGTCT TTCGCATAGC AGACCTCAAG CGGGAAGTGG AAGCGATCAA AGATAAACGC AAGATGATCC GCCTTGGTCC TTCTGGATCA GGAGCATCAG CTGGATCCGG CGTCTTACCG AGTGTGGTAG CGTTTACGCT GTTTGACCAT GGTGAAAATG CCAACTCGGT AGAATTCTCG AGGGATAGCA GTTTGATGGC TGTTGGGAGC TCGGAGAGCT GTGTGAGACT TTGGAGTTTG AAAGGAGACA AGTTGAAGAA GAAGAAGGTG GATATAGAAG GGAACTTGGT GGAAGACGAA GGGTTGCCGA TGAGGAAACT CGTTGGCCAT TCTGGGCCAG TCTATTCCCT TTCCTTTGAT CCCTTGTATG GCTCTGCTGG TCCACCTTCC ACGCTTTTAT CTTCTTCACA AGATGGGTCT ATCCGACTTT GGTCCATGGA TACGTACTCA AACTTGGTAG TTTACAGAGG GCACGGTAAA GATCCTGTTT GGGATGTGGA GTGGGGTCCT ATGGGCGTGT ATTTTGCGAG CGCCAGTAGG GATCGGACGG CGAGGTTATG GAGTTCAGAC AGGGTTGCGC CGTTGAGAAT GTACACTGGG CATTTATCAG ACGTGAACGT GAGTTTTATT ATCTTTCATA TAAAACATTG GTAAAACGGG AGGGGGGCTG ATTGAGAGAT GCAACTCAGT GCGTCAAATT CCATCCCAAT TCGCTTTACC TTGCCACAGC GTCCACCGAC ACCTCTTGTC GATTATGGGA CGTCCAACGC GGTGCATGCG TCCGTCTCTT CCTCGGACAT ACCGATAGCG TCACAACGCT CTCCATTTCC CCGGACGGCA AGACGCTCGC GTCGGCAGGG CTGGACTCGT CCATCTGGCT CTGGGACTTG GGCTCCGCCC GACCGATCAA GAAGATGGAG GGTCATACTG GCGCGGTGAC GAGTCTAACT TTCTCCGCTG AGAGTTCCGT CCTGGTTTCC GGGGGTCTGG ATGGGACGGT GAGGTGTTGG GATGTGAAGA GTGCAGGAGG GGAGAGGAGT GTGGATGGGA TGTTTGGTGG TGGGGATGCG AGAAGAGGCG AGGAGAGGGG TGGGTTACCG ATGAATCCGG GTGATGTTTG GGATACTGCG CCTCATACGT GAGTGTTTTT ATTCTTTTCT TTTCTTTTCT TTTCCTATCC TCTTTTCTCT TGCCTGTACC ATGTGCTGAT AGGGTCGTTT TAGACCGGAC CTACTCGCAA CGTGGCCAAC AAAACGAACG CCCATTCTCA AAACACATTA TACCCCTCGA AATTTGTGCA TGGTAGCCGG GAGTTTTGTT CCACCTAGTA ACCACAGGAC GAGCGGCGGG GAAAAGTAGG AAATAGGGGA TGTGAGGGAA GAAGGAAATG AGGGTGGGCA GGAAGGGGCG GGTTTTCTGG TAACTGTTGT AGGAACAAAG AGTTTTATAT ATG
|
Protein sequence | MSQSPDPSSV KGGAQGQLAP KSSTPTQQPP LTDSKLLRQF VMEYLQSHGF DKALEKLTEQ VEDANLSTTA SAAVLGDADV SKEVADGKAT AGEQGGEKEA IFRAPGPVPL ESNIKRNIPQ AQAVSASTMS DRITPEFEAQ AKYIIEQLQK KVEAVQEEGG DEKEKELRDG IPVDGAMVDV SDKVAGYEAY RRWVDGGLDV WKAELDNLSF PLFVLSFLDM IHSGFLKTAR EFFEKHSAHH RELHSQDLSS LSSVSTEEHI KRNPFCARIL SEKYVVPLSR NAYDLLLQWL SGASLDDEWE AVTNDSAPLD KVTIASKSGL ISSSLPPNTN IDAFNTATQL KLGPPPMTEK LKEQVTRTLQ DEGEAQGANG DANGDIDMTS PFPTNEPVIK LEPETERDPS VISPDESETL PPIPAVFRIA DLKREVEAIK DKRKMIRLGP SGSGASAGSG VLPSVVAFTL FDHGENANSV EFSRDSSLMA VGSSESCVRL WSLKGDKLKK KKVDIEGNLV EDEGLPMRKL VGHSGPVYSL SFDPLYGSAG PPSTLLSSSQ DGSIRLWSMD TYSNLVVYRG HGKDPVWDVE WGPMGVYFAS ASRDRTARLW SSDRVAPLRM YTGHLSDVNC VKFHPNSLYL ATASTDTSCR LWDVQRGACV RLFLGHTDSV TTLSISPDGK TLASAGLDSS IWLWDLGSAR PIKKMEGHTG AVTSLTFSAE SSVLVSGGLD GTVRCWDVKS AGGERSVDGM FGGGDARRGE ERGGLPMNPG DVWDTAPHTP DLLATWPTKR TPILKTHYTP RNLCMVAGSF VPPSNHRTSG GEK
|
| |