Gene CNC05370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC05370 
Symbol 
ID3256201 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1603584 
End bp1607519 
Gene Length3936 bp 
Protein Length1069 aa 
Translation table 
GC content47% 
IMG OID638255755 
Producttranscription initiation factor tfiid 111 kda subunit, putative 
Protein accessionXP_569735 
Protein GI58265158 
COG category[K] Transcription 
COG ID[COG5179] Transcription initiation factor TFIID, subunit TAF1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAACGAAAG CCTCTCGCTT CGTCTTCCTT CCTGGGCACC ACTTCCTCAA AGCTCACGAT 
GGCCTCCGAG GACGAGACCC TTTCTTCGCT GGGTTCGCTT GGTCTTGGCC GTATCCTCGC
CTCCGCAGGT ATCGACCCAT CATCCATTGG CTCTTTTCTC GGTGACTCAG CACAATCGTC
CAAAGAACTC ACACAGGTTG AATTGGACGA AGATGATGCC AAGTTTGAAG ATGACATATC
CGATGGTGAG TTGCCTGAGG AAGGAGAAGA GGAAAGGCGA CAGCGGGAAA TAGATCAGGA
GGCGAGGAAA AGGGAGCAGG AAAGATGGAT GAAAAAGGGG TTGGAGATGA TGAAAAAGAG
CATGGAACAG CAACAAGCCA AAGACAAAAA AGGAAAACAA AAAGCAAATG ACGGGAAGAC
CCAGGAAGAA CGGGATCTTG AAGAAGCAAG GAAGATATGG CCGGACTTTG ATAAGGGGAA
GAGGCTGAGG ATGAGTGAGA TCTTTTATGA GACACCAGCG GATGTCAAAG CTTTCCAGGC
GAAGAGAAAG AAAAGAAAAA CGGAGATGAT GAAAGAAACA AAAACTTGTA AGACCTATCA
TGATTATGTT GAAGACCAGC TCGTTCATGC TAAACAAATG AACAAATGTC CAGTTACGTT
CACCGTCGCC CCTCCACCTA TACAATCTCT CCAATCTACT TTTCTGCTTC CATCACTCCA
ACCGATTCAA CTCCCACTAC CTGGCACACC AACCTATAAT ACACCCATAG GAGTATTTTT
CGATAAGAAA TGGATCAGGG AAGCGAAGGA TCGAAGAAGA TTAGAGATGA CTAAACCTCC
AGAAGGATTA GAATTAGAGG ACGTGAAGAA GGTTAGGTTC GCGAACGGGG CGAAAGACCT
GGATCTGGTG GATTGGGAAC AAAGTATCAT TATGAATTCC ATGTAAGTTT AGTGGCTGCT
GTAAAGCGAC GCTGCTAATC GGTCAACAGG GAAATGCCTG GCAAGGAAAT CGATATCCTC
GCACCCCGCA ATGATCACCT AGAATCAGGG GACTGGATCA CAAATGTGAT TTGGGATGCG
ACTCGCATCT CTCCTGAACT ACTGGAGAGT GACGAAGAGG ACGATGCCCA ATCCGAAGCG
GCAAAAAGAT CGGCGACAAA GAAGGGCGGG GCTGTTGCCA TTGTGAAGGA CACGAAGAAG
CTCGACCCTT TCAACATCTC AAACGACCCT TTGTACGAAC ACTCACGAGA AAGCAAGTAC
CGCATCCGAC AAACTTTTGG TGCTATCGAA GTATTCCACT CTATGCCCGC CAAGATTCTG
CAGTTACCTT ACGTGGGTGC CATCTTTGTA TTCTCATTGT CATTTACTTA CACATACCGC
AGTTCAAGAC TACTCTCAGC AAATCCGAAG CTCGAGCTTG GCATCGTCCT GCTCTTCAAT
TCCCGACCGG CGTGTCTCTT ACTTTCTCCA AACTCAAATC GAATCCTTCA GCCGCATTGA
ACGTGAAGAA AAAGCAGATG ATGGCAGATC CTTCGGAAAA GTTCAAAACG ACAAAGGATC
TAACATTGAC GGAGCAAGGT CCATTTGTGT TGTTGGAATT TTCGGTGAGT TTTAAGTCTT
CTTCGGAAAG AAGTGTATAC TCAGAATGGA CACGCAGGAG GAATACCCAC CAATCATGAG
CAATTATGGT ATGGGCACTA CCATCGTTAA TTACTATCGT AAAATCGATG ATAAAGACGA
AACCGTGCCC AAGCTTGACT TTGGCCAGCC TTCAATTCTT AATACTGGAG ATGCCGAACC
ATTCTTGCTA GGATATGTGG ACAGGGGCAA AGTGACTCAA GTGATTCACA ACAACCTTAT
CAGGGCGCCT ATTTTTAGGC ACAAGCCTGA GACTACAGAT TTCTTGTGTA TTCGGTATGT
TTGTTCTATG CGGGTGTACA TGTTTTTGTT AGTTGACCTC GTGATCAGAC AAACCGTCAA
TGGCCATGTC TCTTATCATC TCCGTCCCAT CAGCAACATC TTTACCGTTG GCCAAACAGT
TCCAAACGAG TCTGAAGTTC ACGGCCCGCA TGCGAGAAAA AATACCAACA CTGCCAAAAT
GCGTCTCATG ATTATCGCCT GGTTATTGAT CAATAAATCA AAGCAGAAGA GATTTAAGAT
TGGCAAGTTA CTAAAGTACT TCCCTGATCA GACAGAGTTA CAAATGAGGC AGAGGTTAAA
GGTGAAAGGG AACGTAAGTT ATTTTGTCGT TTCGTAAACC CACCAGCTAA AAGATCTTAC
AGGAATTTTT GATGTATGCA CGGAGTCCTG GTCCCAACCA AGGTTACTGG ATGCTTAACC
CCGACTATGC CTTCCCCGAC GACCGACGCC AAGTTCTCGA GATGTGTCCT CCAGAACACG
CGTGTCTTTA CGAAGCCATG CAAGTCGGAG CTCGCCATCT ATACGATGCT GGATACAAGA
AAACTGCGGA GGGTGGTCAT GAGGATGAAG ACGAAGCAGG ACTGGATATT GAGCAACGCC
TGGCGGTGTG GTCGACCACT CACAATTATA AGCTGGCGGA AGCTCAGAAG GCTTGGTTAA
TGGTCCATGG TGAGGGTGAT CCGACAGGAA GAGGCGAAGG TTTCAGTTTC TTGAGAGCGA
ATATGAAAAA TTACTTCTTG AGGAAAGGCG AAACTGAGCA AGGGAGGAGA TGTAAGTCCG
ACCCCTACAG TTTCCCGGAT CCCAAAGACT AATCTAATCT ATCCTATTAG TGGAAGCGGA
AGCCAGAGCA GGCGGCAACC CCGTGAAGAT CTCCAACGCC GAGCAAAATC GTATTTACGA
AGAAGAGAAA CGCAAGGTCT GGGATCTTCA GGCATCGGCA CTTAGCAATC CAGTCCCCCC
TGTTCTCACT GCCGCCGAGG AGGAAGCCGC CCGTAATGCT CAACCCCCTG TGATGCCCGG
TCTCGCACCA AAGATTCACC GTGGGGATAG CAGGAGAGCT TTTTCCAGGG GTACTTCCAT
GGCCGCGACC CCGAGGGGCT TCGATAGTCC AAGGGATAGA AGCCCGAGTG TTTTCTCAAT
GGACGGCGGA GAGAGTCACT ATTCAGGAAA TCCGCTGGCT GGTAAAGTGT TGAGGATCAA
GAGGATGGTA TGTTTAATCT ATCTTCGCTA CCGACTAATG GGGGCTTACA AACCATTCAG
GTCAAGGGGA AACAGCAAAT AGAAATCGTA AGAGACCCGG CTGTCATTGC GAGCTACTTG
AGAAGAGTCG AGGAGAAGAA GATCGAGTAT TACATGGAGC ATCCTGATGA GCTGGCACCT
ACTGGTGATG ACACTGAGGA TGAGCTGAGA AAGGTCGCGT ACGTTTTTCA GTCACATCCT
TTCATATCTG AGCTCGTATT GACAATTCAT CAAGTCTTCG CCAGATGCTC GAGAAGAACA
AGTTGAACCA ACAGCGAAGA TTAATGAGGA AGAAGTATCA GTCAAAAACT TTGGAAACGG
ACAATATGGG GATCGAGGGC ATAGATTTGG AAGGGGTAAG TTGCTAACCA CTGTTGATTG
TGATTCGAAG TTAATATACT CGCAGAAACG AAAATGTGGT GCTTGCGGTG CTATTGGCCA
TACCAGTACG TTTACGTTGC CTTATACCTG GATGATTGGT TATGATACTG ATTTGAATAC
ACAGAGGCGA ACAGAAATTG TCCTATGTTC GGTGTCACCA CTGGCAACGC CTCTGTCGGC
CTTTCACCTT CCAACACTGC TAGTGGCCAC ACGCCTGGCT ATGGAGGTTT TACACCTATG
ACGCCCATGG ATACAAGCAC CCCTGCTACT CAACAGCCAA CCTCCTTCAA GATCAAGCTT
GGTGGCTTGG GCGGAGGTCA ATAGCGAAAT CTTAATTATT GTATCAATTA ATTGTATGGT
ATAGTTAGCG ACTAAATATA TAGTCGCATT GATCAA
 
Protein sequence
MASEDETLSS LGSLGLGRIL ASAGIDPSSI GSFLGDSAQS SKELTQVELD EDDAKFEDDI 
SDGELPEEGE EERRQREIDQ EARKREQERW MKKGLEMMKK SMEQQQAKDK KGKQKANDGK
TQEERDLEEA RKIWPDFDKG KRLRMSEIFY ETPADVKAFQ AKRKKRKTEM MKETKTFTFT
VAPPPIQSLQ STFLLPSLQP IQLPLPGTPT YNTPIGVFFD KKWIREAKDR RRLEMTKPPE
GLELEDVKKV RFANGAKDLD LVDWEQSIIM NSMEMPGKEI DILAPRNDHL ESGDWITNVI
WDATRISPEL LESDEEDDAQ SEAAKRSATK KGGAVAIVKD TKKLDPFNIS NDPLYEHSRE
SKYRIRQTFG AIEVFHSMPA KILQLPYFKT TLSKSEARAW HRPALQFPTG VSLTFSKLKS
NPSAALNVKK KQMMADPSEK FKTTKDLTLT EQGPFVLLEF SEEYPPIMSN YGMGTTIVNY
YRKIDDKDET VPKLDFGQPS ILNTGDAEPF LLGYVDRGKV TQVIHNNLIR APIFRHKPET
TDFLCIRQTV NGHVSYHLRP ISNIFTVGQT VPNESEVHGP HARKNTNTAK MRLMIIAWLL
INKSKQKRFK IGKLLKYFPD QTELQMRQRL KVKGNEFLMY ARSPGPNQGY WMLNPDYAFP
DDRRQVLEMC PPEHACLYEA MQVGARHLYD AGYKKTAEGG HEDEDEAGLD IEQRLAVWST
THNYKLAEAQ KAWLMVHGEG DPTGRGEGFS FLRANMKNYF LRKGETEQGR RLEAEARAGG
NPVKISNAEQ NRIYEEEKRK VWDLQASALS NPVPPVLTAA EEEAARNAQP PVMPGLAPKI
HRGDSRRAFS RGTSMAATPR GFDSPRDRSP SVFSMDGGES HYSGNPLAGK VLRIKRMVKG
KQQIEIVRDP AVIASYLRRV EEKKIEYYME HPDELAPTGD DTEDELRKVA LRQMLEKNKL
NQQRRLMRKK YQSKTLETDN MGIEGIDLEG KRKCGACGAI GHTKANRNCP MFGVTTGNAS
VGLSPSNTAS GHTPGYGGFT PMTPMDTSTP ATQQPTSFKI KLGGLGGGQ