Gene CNF03680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03680 
Symbol 
ID3258018 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1074605 
End bp1076721 
Gene Length2117 bp 
Protein Length531 aa 
Translation table 
GC content48% 
IMG OID638257487 
Productrtf1 protein, putative 
Protein accessionXP_571655 
Protein GI58268998 
COG category[K] Transcription 
COG ID[COG5296] Transcription factor involved in TATA site selection and in elongation by RNA polymerase II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.176479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTATTGCA AATCATCATG TCTGACCTCG AGAACGAGCT TTTGGGTCTC GCAGAGGATG 
ATCCTACCCG CCACAGGAAG CGTCACGGCT CAAATGATAG GAGTAAAAGA AATAGCAAGG
CGTTGTACGT TATCCCCATC TGACTGTTCT TCCCTACAGA CTATAAAAGG TTGACTAGGA
GTGTTTTTTA GTATTGAAGA TTCGGACGAT GATGGAGAGG AGGAAGATAT GGAGATGGAA
TCTGAAGATG ATGAACCAGC TCTTCAGAGA TCAAGAGGGC CGTTGAAGAA CCCTTATCCC
TTGGAGGGTA AATATGTGGA CGAGGCGGAC AGGGAGGCGT GAGTATCCTT TATTTTATTT
ATTTTGTTGA GTGTGATGGC TAATACAAGG TATAGTCTTG AGAACCTCCC GGAAATCGAG
AGAGAAAACA TCTTGGCGTC ACGATTGGAA GAAATGCAAA AGTTCAAAGA CTCTCAAGCG
CTTGATGCGA TGTTCAAGAC TGCTCATGGT GGGGATGATG AGGAAGAAGA TGATTCGAGA
GCGAGAAAGA GACGTGAGCC GAACCATTCA GAACCATATA GAAGCTATGT GCTGATCGAT
CCACAGGCAA GCACACTAGT GTGAGCGAGA AGGCTTCTAG GGCACTCAAC GTTTTGAAGA
ACAAGCGGAA AGCGAAGGAT GAGCGTATGC AGCGCCGGGT AAGACTGCTC TCTATGGCAA
ATCGTTGGCC GACCTGACAA ACGCAAAGGC TGCACGTCGT CGACATTCCC GATCTGCCTC
TGCATCTTCC GAAGAAGAAG GCCAGATCAC CCGCAGATCG CCGTCATACT CCCCTGAACG
ATCGCTTTCC CCTCAACCCA AAAACGTCCA GCCCAAGCTT AGCAAAGAGG AGGAAATGGA
TGCTATCGCG CCCAACAGGG CCGAATTGGA GAGTGCGAGG GTTAGTAGGT ACGAGTTGGT
GGATATGATG CACAAGGATG GCTTCGAGGA CGTTATCACT GGTGAGTGCA CAGCCATAGA
CACCTGGATG GGACTTAAAC AGATGCCGCA GGTGCATACG TGCGAATTAT CTCTCCTGAT
AGGGACGAGC ATGGTAGGCC AAAGTACAGG CTTTACAAAA TTGCGGATGT GGACGAGTCT
GGACAGTTCG GATCGTATTC TATCGAATAC CAGGGTCGAC AAATCCGAGA GACTCGGGCT
TTGCTTGTCA AATACGGTTC AGCATCGAGA CTGTTCAGAA TGGCGGATGT TTCTAATGGT
GTGATTGAAG AAGTAAGTAT AGGATCTTGT CGATTACTGA CTAGAGTTAA TTTTGTTCAG
TCTGAGTTTC AGAGGTTTTC TATGACAAAC CAAGCAGATG GTGTAAAAGC CCCTAAGCGG
TCATTTTTAA AGAAGAAGCA CGATGAAATA AAGGCTCTGA GAGAAAGGCC GATGACAAGC
GTACGTCTTG TAACACTCAT CCACTTATAG AGCTAAATTT CCATCGCAGG CTGAAATTGA
TCGCCGAGTT GACTCTCGTA AATCTCAAGA ATCATCATTT ACTCGAGTTA GCCTCCTCAA
AATACATCAA CTTATGAACA CACGTGACCT CGCCCTCCGC CGAAACGACC ACGTCATGGT
CGAGAAGCTC AACTCCGACA TTATCGCCCT CGGTGGCGAT CCCAACACCG GCAGGCTTGT
TGCAGAAAAG GAAGGGGAGA AGGATGACTA CGATATGAAG ATTCAGAAGA TCAATGAAAA
CAATAAGAGG AAGACAAAGG AGGCAATGAT GAGAGCTCAT GCAGCTGCTG TGGCGAGGAA
GAAGGCTGAA GAGGCCGTTG TTAAGGCGAA GCTGTACGGT TTATCCTCTC CGACTCCAAC
GAGCCTATAC TAACCTTGTA ATCTGATCCA ATAGGGCTGC ATCCCAAAAC CCGTCTACAA
CAAGTACACC AGCAACGGAT GTTCCCAAAC CCGAGGTTCC ACCGCCATCA GGTCAACGCA
AGGGAGAGAC TCCTCAACAA TACGTGGCGA GGACGGTCCA GCTGGATCTA GATTTGGGAG
ATTTCTGATG TGTTCTCTCG ACGAGTGGTG GGTGGAGCTG GGTTCTCTGC TATAGTCAAT
GACGAAGTTG GGATATG
 
Protein sequence
MSDLENELLG LAEDDPTRHR KRHGSNDRSK RNSKAFIEDS DDDGEEEDME MESEDDEPAL 
QRSRGPLKNP YPLEGKYVDE ADREALENLP EIERENILAS RLEEMQKFKD SQALDAMFKT
AHGGDDEEED DSRARKRRKH TSVSEKASRA LNVLKNKRKA KDERMQRRAA RRRHSRSASA
SSEEEGQITR RSPSYSPERS LSPQPKNVQP KLSKEEEMDA IAPNRAELES ARVSRYELVD
MMHKDGFEDV ITGAYVRIIS PDRDEHGRPK YRLYKIADVD ESGQFGSYSI EYQGRQIRET
RALLVKYGSA SRLFRMADVS NGVIEESEFQ RFSMTNQADG VKAPKRSFLK KKHDEIKALR
ERPMTSAEID RRVDSRKSQE SSFTRVSLLK IHQLMNTRDL ALRRNDHVMV EKLNSDIIAL
GGDPNTGRLV AEKEGEKDDY DMKIQKINEN NKRKTKEAMM RAHAAAVARK KAEEAVVKAK
LAASQNPSTT STPATDVPKP EVPPPSGQRK GETPQQYVAR TVQLDLDLGD F