Gene CNH01090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01090 
Symbol 
ID3259098 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp877201 
End bp878775 
Gene Length1575 bp 
Protein Length232 aa 
Translation table 
GC content48% 
IMG OID638258374 
Productgeneral RNA polymerase II transcription factor, putative 
Protein accessionXP_572300 
Protein GI58270288 
COG category[K] Transcription 
COG ID[COG2101] TATA-box binding protein (TBP), component of TFIID and TFIIIB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTCCCTTGC TAGCTTTCGT CCTACATCCA TCTTCTTTCA TCCTCTTATC CACATAATTA 
CGCAGCACTC GCTCAATCAG CGACATCGTT AATAGACACG CATAGCGTAC CCTAGAAAAT
TTAGCCCTCC GAATCAAAGG ATTGACCACT CAAAAGTAGC CGCCAAATAT CATCCTCTTA
TCTCTCGCCA CCAACGTGCA ACATGTCCGG TCTCGCCCTT CCCAAAGCGT CCAATGCGGG
CCCGTCCAGC TCAAGTAGCA AGCCAGAAGG AATTCAAACA CTTGATGGAG AAGGATCTCT
CGCCACAAGA CCACCGCAAA AGGCTATTGC TAGTGTTCCG GAGATTACAG CGGTGGATGG
TCTAGTGCCA ACCTTACAGT GGGTATTACA ACGTTTGTGG CTGGGATCTT TGCATATAGG
CCATGGTGTA TGAGTGAAAT AATTTGCATG GGATGGAACG AGAGACGAAG GAAGAAGGAA
GGCTGCGAAA GGTGACGGCG ATGAACCACC CTGTGACGGA AGTGTTGGGC CATCCTAGCC
AATGCCTTCG TTGCGAATGG ACTCGCCGCT GCACTGTTGC TACCTGGTGG CGCCCGCTGT
TTGTCTCACG TATGAGAAAG TTCAGTGGGT GCTGACTGCG TGTCTTTTGC AGAAACATTG
TCGCCACCGT CAACCTCGAC TGTCGTTTAG ACTTGAAGAC TATTGCCCTC CACGCTCGAA
ATGCGGAATA CAACCCAAGA GTATGCTTTT CTCTGTCCTG GAAGAAGCAT CAAACTGATG
TATTGTTTAG CGTTTCGCTG CCGTTGTCAT GCGTATTCGT GACCCAAGAA CAACGGCCTT
GATTTTCGCT TCTGGAAAGA TGGTCGTCAC TGGTGCCAAG TCTGAAGACG ACTCTCGGTT
AGCATCTCGT AAATACGCCC GAATAATCCA AAAGCTTGGT TTTGAGGCCA AGTTCGCCGA
GTTCAAGATT CAGAACATGG TCGGCAGTTG CGATGTCAAG TTCCCTATCA GATTAGAAGG
TCTGGCATTC AGCCACGGTG CATTCAGCAG TTACGAACCG GAGGTGAATC CTTCTCATAA
TTGTCTGAAT TGTTTTCTCA CATACTTTCA GCTGTTCCCT GGTTTGATTT ATCGTATGCT
CAAACCCAAA GTCGTCATCC TCATCTTCGT GTCCGGCAAA ATTGTCCTCA CCGGTGCAAA
AGTCAGAGAA GAAATTTACA TGGCGTTCAA CCAGATCTAC TCTGTGTTGC TCGGTAAATT
GACCTTTCCC TCATATGTGA AGGCATATGC TAATATCTTC CTGTGCAGAA TTCCGAAAGA
CGACATAAAG TCTCGGCTGA CAGGCTTCTC TTTCATTACC ATCCTCTGCA CCGCGACCTG
TACTGCCCCT ACATCGTTGA GCATAACATT TGGGTTTCAA GTTAATCTCT TCTTTTTTGG
GCGCCGCGCC CACCCCGGTC ATCCCGATAT TGTCTATCAC ATTAGTCATA TAACGTACTT
CACGAAAAAA AAGCAATTAT CCATACTGGT GACAACCACA TTGACAGTCC CCTCAGACTA
TACACAGTTT TATCG
 
Protein sequence
MSGLALPKAS NAGPSSSSSK PEGIQTLDGE GSLATRPPQK AIASVPEITA VDGLVPTLQN 
IVATVNLDCR LDLKTIALHA RNAEYNPRRF AAVVMRIRDP RTTALIFASG KMVVTGAKSE
DDSRLASRKY ARIIQKLGFE AKFAEFKIQN MVGSCDVKFP IRLEGLAFSH GAFSSYEPEL
FPGLIYRMLK PKVVILIFVS GKIVLTGAKV REEIYMAFNQ IYSVLLEFRK TT