Gene CNH00600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00600 
Symbol 
ID3259290 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1012388 
End bp1015341 
Gene Length2954 bp 
Protein Length716 aa 
Translation table 
GC content50% 
IMG OID638258424 
Productspecific RNA polymerase II transcription factor, putative 
Protein accessionXP_572252 
Protein GI58270192 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACA GTGCAGAGGC AGGGCATCAG AGAAAGCCCA CGTTCAAAAT CTCCCGTCCC 
CTTTTTCTCC CTCAGACCCT CCTACACGCC TACAGGGTGA GCTTCTGCGT AACTGACCGC
CAGTTCCGAA CAGAGTGGCC CCAAATCTTC CCAATATCTC TAGCGGACGC CCAATCCGGC
ACGAAGCACC GGCAACGGCG AAGAAACGCT TTTTTTTCTT TGCATCATTG ACCAAACTCC
GTCCGAATAC AATCCTAGTT CTATCTCTGT CACTTTATTC TGATTCAAAA AAACTCCTTT
CAAAAGCTGG TTTACGACAT GTACCAGCAA CAGACCATGG CCAGCCACCC ATTCTTCAGC
CACAACAATC CTTGGGCCCG TTCTGCCATT TGCTGTGATC AGCACCACGG CGAATCGACT
TCGGCCGCTA TGGCACGTCT TGCTGGCAAC ATTCCCTTTT CTCATGATGA AAATTTACAC
GACCACCATC TCGGATACCA CGACTCGTCT GGATGCACTT CGGATTGCCC TCTCGACACC
TACTGCTGCA GTGGAGATTA CTGCTGCGAC AAGCATGGTT CTTGCTCCAG TGGAGATGAG
TGTTGCGACG ACCCACGTTG TGAAGAAGCG CACAGACCGG ATAGTCGTGC CAGTCATCAA
AGCCATCACA ACCGTTCTAA ACCCGAGCAA AGCCGTAACC ATAACCACCA GCAGCCTATG
AGTCTGGAAG AATGGGCTGG AACCCAGGAA GGGTGTAACG CCATACAACA GCTGGTAAGT
TGGCCTACAA TTACATGTTT CTACCTTTGT CAGCTTCCCT GTCATTAGCT AATACTTCCA
AAGATTGAAT GCTGTAACCA GCCAGACTGC CATATACCGG TCTGCCCTAC GGACAATTCT
GAAGTCCATC CACTGCCGGC AGACCCCTTG TCAGCCCTAT TTGCATCACT AGATGCACAG
CAGCAGCCTC AACCTATCTC CACTGCCCAG CAGCCTATGG CGCCGGTAAG CTCCGTTGAG
GCTTCTCACA CTTGCCACTG GGGTAATTGC CACCTCGTTT TTGGCTCAAT GCCCGACCTT
TTGGCACATG TGGCGGCAGA TCACCTTAAC GCAGCGGGTA CGGCGCATCA GTCCGATCAA
CTTCTGCAGC AAGCCCAGTC TGCCCAGTCT GCCCAGTCTA CCCCGTTAGC ACTGCTTACT
GAGCGCGCGC TGTCTAGTAT TAGTACGAAT ACGACTGGTT TACAGAGTCA TTTGCCGACT
AACTCTTCGC TTCAAGCCAC ATCTTTGGCC GTCAATGACG CCCTGCTATC TTGCATGTGG
GATGACTGCT TTCCTGTCCC CGAGGTGCCT GCTGCTTCAT CAACGTCTCA CAGCACGTTC
CATCATTACA ACTCTGATAA CTGCCAGGCT CCTCATAATC ACCAGCACGA CCATTCTTAC
GCTGCTGGAG AGCCCTTTAA CCCTGGGACG ATGCTACGAC ATGTTCTGGA AGAGCATTTG
GGTATTCCCC CTGATATCAT TGGCTGGCCG AATGAGGCTG AGCTTCAAGC TCAAGCACAG
GCGATCCTTG AGAAGCACCA TCATCATCAC CATATCGACC CTCGCGAGGC CTTAGTGAAC
CACTCTGAGA ACTGCAATCA TGTGCATCCT CACCCTCACT CTCATTCTCA TGGGAACAGT
GCCGGCACCG GTGCCAATGA CTCACATCCT CATGGCCATG CTCTCGCTCA CTCTCAGTTC
CATCCCAATC TTCATCCTCA TCCTCATTCT CTTCCTCATG AGCGCTCGTA TGCTCATTCT
CATTCCCTCT CTCATTCACG TCCTCTCTCG CACGAACCTC TTCCCACGCC CCCCTCCACG
GTCAAGACCG AAGCCTGCAC CTCCCCTGCC GCTTCCAACG ATTCCGTCGC CAGCACAGTC
CTCACTGCAT CCCAATCCTC TAAAGATCTG ATCTGTCTCT GGCCCGGGTG CACCATCCAC
ACTCCTTTTG CTTCCACTGC TTCCCTCATG GATCATCTGT CCGAAATGCA CATCCCGAAA
GGTAAAGATT GTTATACATG CCATTGGGGT GGGTGCGGTG GTGAAGAAGG GAGGGTGTTT
AAGAGTAGGC AAAAGGTGTT GAGGCATTTG CAGAGTCATA TAGGACACAA ACCGTTCGTT
TGTGGGGTGT GTAATCAGGC TTTCTCGGAA GCGGCGCCTT TGACGGCGCA TATGAGGAGA
CATGCGCAAG AAAGTCAGTT AGATCGCGTA TTCCATTTTT TTCTTTTTTT TGGAATGCTG
ATAGTGGTTT GTAAGAACCT TTCAAGTGCG AACATCCAGG ATGTGGCAAA TCGTTTGCAA
TCTCTTCGTC TTTAACAATC CACATGGTAA GATTTGACCT GCTATAAGCA TCACAGCAAT
GGCTAATTTT TAATCAACAG CGCACACATA ACGGTGAAAA GCCATTTGTC TGCCCGTATT
GTCAGAAGTA AGTCCGCTCT TTTTAGGAAC AGTCGATATA GGGATTCTAA CGCAACGCTA
CAGGGGGTTT GTAGAAGCGT CCAACTTGAC CAAACACGTA AGTCATTCCT CCCCAAAAAA
AACCCCGAAA ATGAAACCCA TTGACAAAAC TTCGAACCAG ATCCGAACGC ATACGGGCGA
ACGGCCATTT GCGTGCTCTC ATCCTGGATG CGGCAAGAAA TTCTCGCGTC CTGATCAGCT
GAAGAGGCAC ATGACTATTC ATAACAAGCC ACCTGGGGAG AAAAGGCGAG GAAGTGGTGT
CCCTGCGAAG TAAAAGACTT TTCGTTTCAC GGCAAGTGTA ATGAAACTGA AAAAAGCGCT
CGAGGTCGGT GCATAAGCAA AAGCGTGAAA CGGTCGAGAA TCGAGACTTT TTGTGCAATC
TGTATTTGTA CCATGATGTT GGTGTACGGC ACTCCTTGAA AACTACTACG TGAAAATAGT
CTGCAGAGTA TTAT
 
Protein sequence
MYQQQTMASH PFFSHNNPWA RSAICCDQHH GESTSAAMAR LAGNIPFSHD ENLHDHHLGY 
HDSSGCTSDC PLDTYCCSGD YCCDKHGSCS SGDECCDDPR CEEAHRPDSR ASHQSHHNRS
KPEQSRNHNH QQPMSLEEWA GTQEGCNAIQ QLIECCNQPD CHIPVCPTDN SEVHPLPADP
LSALFASLDA QQQPQPISTA QQPMAPVSSV EASHTCHWGN CHLVFGSMPD LLAHVAADHL
NAAGTAHQSD QLLQQAQSAQ SAQSTPLALL TERALSSIST NTTGLQSHLP TNSSLQATSL
AVNDALLSCM WDDCFPVPEV PAASSTSHST FHHYNSDNCQ APHNHQHDHS YAAGEPFNPG
TMLRHVLEEH LGIPPDIIGW PNEAELQAQA QAILEKHHHH HHIDPREALV NHSENCNHVH
PHPHSHSHGN SAGTGANDSH PHGHALAHSQ FHPNLHPHPH SLPHERSYAH SHSLSHSRPL
SHEPLPTPPS TVKTEACTSP AASNDSVAST VLTASQSSKD LICLWPGCTI HTPFASTASL
MDHLSEMHIP KGKDCYTCHW GGCGGEEGRV FKSRQKVLRH LQSHIGHKPF VCGVCNQAFS
EAAPLTAHMR RHAQEKPFKC EHPGCGKSFA ISSSLTIHMR THNGEKPFVC PYCQKGFVEA
SNLTKHIRTH TGERPFACSH PGCGKKFSRP DQLKRHMTIH NKPPGEKRRG SGVPAK