Gene CNN00060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00060 
Symbol 
ID3255392 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp17778 
End bp21590 
Gene Length3813 bp 
Protein Length919 aa 
Translation table 
GC content50% 
IMG OID638254422 
ProductRNA polymerase II transcription factor, putative 
Protein accessionXP_568515 
Protein GI58262210 
COG category[K] Transcription 
COG ID[COG5068] Regulator of arginine metabolism and related MADS box-containing transcription factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0968509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCCTGTTCG CGAGTCACAA CACAGAGGAA ATCATAGGCT CACCACATCC ACCCTATTCC 
AATTGTCATT GGCTGGCCCT ACGCGGGAGC TGAAGTAGTA ACTCAGATCC GAATCAGCTG
AAGGTGCAAC CAAACATCCA ACATCTCCTA TCTCCACTCC TTGCCTTGTT GTATTGCTTT
AAGCATCTCA TGCTGACCAC GTGATAGTGC CACAAATATA TCAGTAACGA GACAATCACT
GTATCCAACA TCCTCCAGTC TGGCGCAGGG GAACTGTAAC ATCTAGTTCG CTTCTGGTTT
CACCTCCCCC GGAGTATCCC ATCCAACCGA CTCGACCAGA AAATGGAGGA TTCCTATTCA
CCCCGCATTC GTACGCCTCA TATGAATCAT CTTCACCCTT CTCATCCTGC CCACCAGCAT
TCACATTCGC CGAGAACTGT GTCGATAACG ATGGAGGACT ACATTGCGGA CGATAATGAT
AGAGGAGATG GATTGGGTAT GGGTGAAATG GGTATGGGAA GTGGTTTTGG AGAAGGAGAT
TTCGGAGAAG GAATGGGAAT TGATATGGAT TTTGCCAGTA AGTCAGGTTG CGGTTTACTT
GCGGAGCGAA GCGACATGGG GAGGAGACCG TTCTTGGAAC TGGTTGAGAG GCGGATGTCG
CATACTCCTG TATGGCGACA AGCGAGAAAC AGGTGTCGTG ATGGAAGAAA TTAGCAAAAA
GTACCAGCTA ACTCATCTTT CACAGCGTCC GCTCTTACTC CCACTTTATC ACAACACTCG
CTCTCTCGCC CTCCGACTAG TCACAGTCTT GCCGATGCTC CTTCAAGCGG TGGTCTCAAT
TCATCCGCTT TACCTTCCAG TGGGGGGATG AGTGGCACTA TGCCATCGAT CGGATACGGC
GCTCAGCAAC ATCAGCATCG GCATCCTCAG GCGCAGCAAC AACATCAACA TCACGCTCAT
CAGTCTCAGC AGCAACAGCA TCCTTCCCAA CCTCATCCCA CCTCCCAATC AAACCAACAA
AGCCACCCAC ACGCGCATCA TCAAGCGTAT TCTCAGCCAC ACGCCCACAC CCATACCGAC
CATATGCAGT CCATGCACCC TAATAGTGGA ATGGACATGT TCCAGTCAGA GACGGGTTTA
GAAGATGAAG ACGATGAAGA AGGTATGTAT CTATCTCGAA TTTTATTGTG TTGGCTGAAC
CTTGACTTTC GGATTACAGA TACTCGAGCC AAACGTCCCC GCCCAAACAC TTCCCAACCC
TTCCATGCCT CCCGCGCACC TTCTCATCCT CACTCTAATT CCAATTCCCA TTCTCACGAC
CCCGATGCGG AAAACGATAA TGATAACGAC AACGATGTTT CCGACAAGGA CGAACCCCAA
CGACGTAAAA TCAAGATTGA ATACATACCT GACAAGTCTC GTCGACATAT CACGTTTTCG
AAGCGGAAGG CGGGTATCAT GAAGAAGGCG TATGAGCTGT CGATTTTGAC AGGAACGCAA
GTGTTGTTAT TGGTGGTATC TGAGACTGGG CTAGTCTACA CATTCACGAC GAACAAGTTG
CAGCCTTTGG TGCAGAAGTC GGAGGGCAAG AATCTTATCC AAGTACGTTC TTTCATCTTT
TTCCCTCTTT GTCGCAATAC CTTTTATTGA CATGGTCCTT TTCTTCTCCA ATAGGCATGT
CTCAACGCTC CTGATGGTTT CGGTCCCGAC GGCGAACCTG TCGGACCCAT GTCCGCTACG
AAGTCGAAGA ATGGTGGCTT GGCGATACGA CCTCACAAGT TACCTGCTGG AGCAAGCGCG
GCAATGGCGA AGAGTCAGGC CTTGTCAGCC GAGCAGAATG CCGCCCAGCT TCAAGCTCAT
GCCCAAACAC AAGCTCAGGC GGCACAAGCC CATGCCCAAG CTCAAGCGCA GGTGCAAATG
CAGGTGTCAG CCGCAGCACA GGCTCAAGCA ACTGCTCAAG TTCAAGCTCA CGCGGCGGCT
AAAGCAGCCC AAGCCCAAAG ACAACAAGCT CAGATACAAT CTCATGTCCA GAATCAGAAC
TCTTCTCCGA ATCCGAACCA GCCTGAGCTC AGTCGTCGAG AGCAACAACT GAAATCACTC
TCAGCGCTCG GTATATCCGC GACACCCCCT ACGACTTTGT CTACTCTTCC TCCATCCTCG
GCTTCGATAC CTTCGTTACC TTCTTTACCT GGATCGGGCT CAATGGGTTC TGTAACGCAA
GTTCCGCCTA TGACTTCGTC TGGGCAAGGA GGAGGTATGG GGAGGACCCC AACGCCAAGC
CAACTGCATA CGCCTACAAT GGGACATGGG CCACTCGGTC AAGGTATTTC AACGCCTGTT
ACACCTCGCG CCAACGCCAA TATGGCCGCA AACTCCAACA AAAACAACTT GGCCAATGCC
AACCACTCGG TATCCGTATC ATCCCCCGCT AATCGACCAA AGAAGCGGCT CCCTTCCCGC
CGACGTCAAG CTTCTACCTC TTCAGCAGTC GGCGAGATGG GCATAAAGTT GGAGCAAGGA
ATGGATATAC CGCCCGTGCC GAATTTGCCT GCTGAATATG TAGGATCCGG CCAGGCGAAT
GGGTCGTCAC CTGGTGCTAG CGGATCATCT ACTGGTAAAG GCCTCAGCGG AAAAGGAAAG
GCCCAAGGTT TGGGTATAGG TATTGGTGTA AGCTCCCCTA GGATCGCGAG CTCTTCCGGT
GCAGCGAGCC CCAAACTGGG ATCAGGGGTA GGGGTGATGA CCGGCACAAT TGCGCAGAGG
AGAGCGGAAG CGTCCGCGGC AAGGCTGGAG AGGATGGGTG GGGAGATCGA TTCAGAGATG
GGCGCTCAAC ATAGTCAGCA TGGTCATCAA CATCAACAAC ACCAGTATCA TCACCAGCAA
AATTCGCCTT TATCCCCTAT GTCAGTCTCG ACATTCCATT ATCCTCCTGA ATACCGCCAT
CCCTCTTCTC ATGCGCATAG CTACGCCCGC ACTCTTCACC ATTATCCTCA TGCTGAGCAC
GGTCATCTTC AACCCCAGGC AGTCTCTGCT ACCCAAGCAC AAGCTCAAGC AGACGTGGAG
AATGCCTACT ACTACGGCAC TCGTTCACCA AATTCGCAAT CGCTCTCTGC AGCTGGTCTG
GGCGACATGC GCCACATAAG TGACATGAAC GATATGGGAG AGATGGGTGA TTTGCGAGAT
ATGAGGGAGA TACGAGATAT CCGCAATATG GGAGATATGG GCGATATGGG TGACATGGGT
AATCTGGGTA GTATAAGCGA CATGGGTGAT ATGGGTGATA TGGGCGATAT GGGCCACTTG
CGTGAGATGG ATGGCATGGA TATGGAGATA GGAATGTTTG GCTCAGGTGA AGAGAGGGCA
GGGAGGGAAG GAGGGAGGTT GATGGGTATT GGGATGTAAA TCGTGGTGTC GAAAGAAGAG
AACCGGGCGA ACGCCACGAA GAATCGGTTG TTTCCCCTTT CGTGAGTACT TTAGGGTCCG
CTCAATCTTA CTCTGTTCTG CAACATCATC TAGCTAACTG GTCTCCTATG ATAGCATCTT
TTTGGCAAGG CTATTCTATC CGTTGGACAA TAACAACTTT CTTCCTTTGC GTCTTTTTTC
TTTTTTATCT TTGTGCTCTT TCTTTATATT CTCGCTATCA CTTTAATGTG TTTGACTATC
TCAACTTTTC GAGTTCATCT AACTGCAGTT ATCAAGACAG TACCATGTAG AGGTTGTTCG
AAAGAGTCCT TGGTGCGGGG AAAGTGGGAA GACAAATCAG TGTATTACTG TTCCTTATGA
GTCTCATTAT TTTTATATGA TATGTTTGCT TAA
 
Protein sequence
MEDSYSPRIR TPHMNHLHPS HPAHQHSHSP RTVSITMEDY IADDNDRGDG LGMGEMGMGS 
GFGEGDFGEG MGIDMDFATS ALTPTLSQHS LSRPPTSHSL ADAPSSGGLN SSALPSSGGM
SGTMPSIGYG AQQHQHRHPQ AQQQHQHHAH QSQQQQHPSQ PHPTSQSNQQ SHPHAHHQAY
SQPHAHTHTD HMQSMHPNSG MDMFQSETGL EDEDDEEDTR AKRPRPNTSQ PFHASRAPSH
PHSNSNSHSH DPDAENDNDN DNDVSDKDEP QRRKIKIEYI PDKSRRHITF SKRKAGIMKK
AYELSILTGT QVLLLVVSET GLVYTFTTNK LQPLVQKSEG KNLIQACLNA PDGFGPDGEP
VGPMSATKSK NGGLAIRPHK LPAGASAAMA KSQALSAEQN AAQLQAHAQT QAQAAQAHAQ
AQAQVQMQVS AAAQAQATAQ VQAHAAAKAA QAQRQQAQIQ SHVQNQNSSP NPNQPELSRR
EQQLKSLSAL GISATPPTTL STLPPSSASI PSLPSLPGSG SMGSVTQVPP MTSSGQGGGM
GRTPTPSQLH TPTMGHGPLG QGISTPVTPR ANANMAANSN KNNLANANHS VSVSSPANRP
KKRLPSRRRQ ASTSSAVGEM GIKLEQGMDI PPVPNLPAEY VGSGQANGSS PGASGSSTGK
GLSGKGKAQG LGIGIGVSSP RIASSSGAAS PKLGSGVGVM TGTIAQRRAE ASAARLERMG
GEIDSEMGAQ HSQHGHQHQQ HQYHHQQNSP LSPMSVSTFH YPPEYRHPSS HAHSYARTLH
HYPHAEHGHL QPQAVSATQA QAQADVENAY YYGTRSPNSQ SLSAAGLGDM RHISDMNDMG
EMGDLRDMRE IRDIRNMGDM GDMGDMGNLG SISDMGDMGD MGDMGHLREM DGMDMEIGMF
GSGEERAGRE GGRLMGIGM