Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03680 |
Symbol | |
ID | 3258018 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 1074605 |
End bp | 1076721 |
Gene Length | 2117 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257487 |
Product | rtf1 protein, putative |
Protein accession | XP_571655 |
Protein GI | 58268998 |
COG category | [K] Transcription |
COG ID | [COG5296] Transcription factor involved in TATA site selection and in elongation by RNA polymerase II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.176479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTATTGCA AATCATCATG TCTGACCTCG AGAACGAGCT TTTGGGTCTC GCAGAGGATG ATCCTACCCG CCACAGGAAG CGTCACGGCT CAAATGATAG GAGTAAAAGA AATAGCAAGG CGTTGTACGT TATCCCCATC TGACTGTTCT TCCCTACAGA CTATAAAAGG TTGACTAGGA GTGTTTTTTA GTATTGAAGA TTCGGACGAT GATGGAGAGG AGGAAGATAT GGAGATGGAA TCTGAAGATG ATGAACCAGC TCTTCAGAGA TCAAGAGGGC CGTTGAAGAA CCCTTATCCC TTGGAGGGTA AATATGTGGA CGAGGCGGAC AGGGAGGCGT GAGTATCCTT TATTTTATTT ATTTTGTTGA GTGTGATGGC TAATACAAGG TATAGTCTTG AGAACCTCCC GGAAATCGAG AGAGAAAACA TCTTGGCGTC ACGATTGGAA GAAATGCAAA AGTTCAAAGA CTCTCAAGCG CTTGATGCGA TGTTCAAGAC TGCTCATGGT GGGGATGATG AGGAAGAAGA TGATTCGAGA GCGAGAAAGA GACGTGAGCC GAACCATTCA GAACCATATA GAAGCTATGT GCTGATCGAT CCACAGGCAA GCACACTAGT GTGAGCGAGA AGGCTTCTAG GGCACTCAAC GTTTTGAAGA ACAAGCGGAA AGCGAAGGAT GAGCGTATGC AGCGCCGGGT AAGACTGCTC TCTATGGCAA ATCGTTGGCC GACCTGACAA ACGCAAAGGC TGCACGTCGT CGACATTCCC GATCTGCCTC TGCATCTTCC GAAGAAGAAG GCCAGATCAC CCGCAGATCG CCGTCATACT CCCCTGAACG ATCGCTTTCC CCTCAACCCA AAAACGTCCA GCCCAAGCTT AGCAAAGAGG AGGAAATGGA TGCTATCGCG CCCAACAGGG CCGAATTGGA GAGTGCGAGG GTTAGTAGGT ACGAGTTGGT GGATATGATG CACAAGGATG GCTTCGAGGA CGTTATCACT GGTGAGTGCA CAGCCATAGA CACCTGGATG GGACTTAAAC AGATGCCGCA GGTGCATACG TGCGAATTAT CTCTCCTGAT AGGGACGAGC ATGGTAGGCC AAAGTACAGG CTTTACAAAA TTGCGGATGT GGACGAGTCT GGACAGTTCG GATCGTATTC TATCGAATAC CAGGGTCGAC AAATCCGAGA GACTCGGGCT TTGCTTGTCA AATACGGTTC AGCATCGAGA CTGTTCAGAA TGGCGGATGT TTCTAATGGT GTGATTGAAG AAGTAAGTAT AGGATCTTGT CGATTACTGA CTAGAGTTAA TTTTGTTCAG TCTGAGTTTC AGAGGTTTTC TATGACAAAC CAAGCAGATG GTGTAAAAGC CCCTAAGCGG TCATTTTTAA AGAAGAAGCA CGATGAAATA AAGGCTCTGA GAGAAAGGCC GATGACAAGC GTACGTCTTG TAACACTCAT CCACTTATAG AGCTAAATTT CCATCGCAGG CTGAAATTGA TCGCCGAGTT GACTCTCGTA AATCTCAAGA ATCATCATTT ACTCGAGTTA GCCTCCTCAA AATACATCAA CTTATGAACA CACGTGACCT CGCCCTCCGC CGAAACGACC ACGTCATGGT CGAGAAGCTC AACTCCGACA TTATCGCCCT CGGTGGCGAT CCCAACACCG GCAGGCTTGT TGCAGAAAAG GAAGGGGAGA AGGATGACTA CGATATGAAG ATTCAGAAGA TCAATGAAAA CAATAAGAGG AAGACAAAGG AGGCAATGAT GAGAGCTCAT GCAGCTGCTG TGGCGAGGAA GAAGGCTGAA GAGGCCGTTG TTAAGGCGAA GCTGTACGGT TTATCCTCTC CGACTCCAAC GAGCCTATAC TAACCTTGTA ATCTGATCCA ATAGGGCTGC ATCCCAAAAC CCGTCTACAA CAAGTACACC AGCAACGGAT GTTCCCAAAC CCGAGGTTCC ACCGCCATCA GGTCAACGCA AGGGAGAGAC TCCTCAACAA TACGTGGCGA GGACGGTCCA GCTGGATCTA GATTTGGGAG ATTTCTGATG TGTTCTCTCG ACGAGTGGTG GGTGGAGCTG GGTTCTCTGC TATAGTCAAT GACGAAGTTG GGATATG
|
Protein sequence | MSDLENELLG LAEDDPTRHR KRHGSNDRSK RNSKAFIEDS DDDGEEEDME MESEDDEPAL QRSRGPLKNP YPLEGKYVDE ADREALENLP EIERENILAS RLEEMQKFKD SQALDAMFKT AHGGDDEEED DSRARKRRKH TSVSEKASRA LNVLKNKRKA KDERMQRRAA RRRHSRSASA SSEEEGQITR RSPSYSPERS LSPQPKNVQP KLSKEEEMDA IAPNRAELES ARVSRYELVD MMHKDGFEDV ITGAYVRIIS PDRDEHGRPK YRLYKIADVD ESGQFGSYSI EYQGRQIRET RALLVKYGSA SRLFRMADVS NGVIEESEFQ RFSMTNQADG VKAPKRSFLK KKHDEIKALR ERPMTSAEID RRVDSRKSQE SSFTRVSLLK IHQLMNTRDL ALRRNDHVMV EKLNSDIIAL GGDPNTGRLV AEKEGEKDDY DMKIQKINEN NKRKTKEAMM RAHAAAVARK KAEEAVVKAK LAASQNPSTT STPATDVPKP EVPPPSGQRK GETPQQYVAR TVQLDLDLGD F
|
| |