Gene CNK01590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01590 
Symbol 
ID3254565 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp468080 
End bp471120 
Gene Length3041 bp 
Protein Length838 aa 
Translation table 
GC content50% 
IMG OID638253648 
Productconserved hypothetical protein 
Protein accessionXP_567833 
Protein GI58260846 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.196806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTCTTTCTT CCATCCGGAA AGGATTGTTC AGACCGAAGC ACACTGCGTA CTGCACCACA 
AGTCTCAAGG AAATAGCACT GCCAAAATCT TAGATTAGGA CGGGGGATTG GAGAAGGTAT
ACTCTCACAT TACAAGACCT TAAATTATTG TCATTTCATA ATGGTCAGTG TACATTGTGT
TTCGTTGTAT AGATGAAAGC TAAATAGTTC ACAACAGCGA GTTATTGTCA AGGGCGGTGT
TTGGAGAAAC ACCGAGGACG AAATTCTCAA GGCCGCCATT TCAAAATACG GTAAGAATGT
GAGTATAACC CGCACATATA TGGGAATGAA GCTGACGAGC TTGGTTTCAC AGCAATGGGC
TCGTATCTCA TCACTTTTGG TTCGAAAAAC CCCCAAGCAA TGTAAAGCGA GGTGGTACGA
ATGGTTAGAC CCTTCAATCA AGAAGGTGGA ATGGTCAAAG GCAAGCGCAT TCGGTGTTAT
GCAATGGCTA ACGATGGTTT CAGACTGAAG ACGAAAAGCT TCTTCATCTT GCCAAGCTCA
TGCCTACTCA ATGGCGAACC ATTGCTCCTA TCGTCGGTCG TACAGCTACG CAGTGTCTTG
AGCGATACCA GAAGCTCTTG GATGACGCGG AGGCGAGGGA CAACGAAGAG CTGGGATTAG
GAGCAGGAGA GGATGAATCG AGCAAACCAG CTACCGATGC TAGGGGTCTC AGGCCGGGAG
AAATTGACAC AGACCCTGAG ACAAGACCCG CGAGGCCTGA CCCCATCGAT ATGGATGATG
ACGGTACGTG TATCATAATG TTTGGTTGCT CTGTGCGATT CATTGATATC CTATTAGAGA
AGGAAATGTT GTCGGAAGCG CGAGCGAGAT TGGCCAACAC ACAAGGCAAA AAAGCCAAAC
GCAAGGCCCG AGAAAGGCAA TTGGAAGAGG CCAGGCGATT AGCTTTTCTA CAAAAGAAGC
GTGAATTAAA AGCCGCTGGT ATTAACCTTC GTGCCAAGCC GAAGAAGAAG GGTATGGATT
ACAACGCCGA CATACCCTTT GAAAAGCAGC CTGCTCCTGG TTTCTACGAC GTCACGGAGG
AGCAGGCCAA GGTTCACGCT GCTCCTGTCG GTTCAACACT TCGGGCTCTC GAGGGAAAAC
GCAAGCAGGA GTTGGACGAA ATCGAGGAAA GGAAGAAGCG ACAGAAGAAG GGAGATGGCA
AGTCTAACCA AACGCAGCAA TTTGTGGCGG CCCGAGAAGC GCAGATCAAG AAACTTAAGG
AGCAAGAACA GATTATTAGG AGGAGGAAGC TAAATTTGCC GATACCTCAG GTTGGAGAGC
GAGAACTGGA AGATATTGTC AAGATCGGGC AAGCAGGAGA ATTAGCAAGG GAGCTGGTTG
GTGACGGCAA CAAGGCGACA GAGGGTTTGC TAGGCGAGTA TGAGGCTTTG GGTCAGGCTA
AGATGGCGAG GACACCAAGA ACAGCTCCTC AACGTAAGAT CATTATCTGT TTTTCAGTCG
TTGTGCGGTG CTAATGTTGA GTCAGAGGAC AATGTTATGG CCGAGGCCCG AAATCTCCGA
AATATGATGG CAGCACAAAC TCCTCTTTTG GGAGAAGAGA ACACTCCTCT ACACGGCCCT
TCTGTAGGCA CTGGATTCGA AGGGGCCACA CCTCGACACG ATGTTGCCGC GACCCCCAAT
CCTCTTGCAA CTTCGGCTCG AGGTGGTGTA CTCACTTCAA CTCGAACAGT CCCTGGTGTT
GGTACCACTC CCCTGCGGAC CCCTTTCAGA GATGATTTGA ACATCAACGA CGATGCGTCC
GTGTACGGCG AAACTCCCAT GAACGACAGG CGCCGCCTTG CCGAGTCTCG CCGAGCTTTG
AAGGCTGGCT TTGCGGCCTT GCCCAAACCT GAAAATAATT TTGAGCTTGC TGAGACAGAA
GAGGATGAAG AGGAGGCGGA AGAAGCGGAG CCTCTAACAG AGGAAGATGC TGCCGAGAGG
GATGCGAGAT TAAAGGCTGC TAGAGAGGAA GAGGAACGAC GCGAGCTTGA GAGGAGAAGT
ACTGTTATAA AGAAGGGTTT GCCTCGACCC GTCAACGTTA ACACATACAA GCTTCTCGAC
GATCTCAACT CTGCTATAGT TGAGCAGACC GACGAGGAGA TGGCCGCAGC GTTCAAGCTC
GTCAATCTTG AAGTCGCCAT GCTCATGAAG CACGACTCCA TCGCTCACCC TCTGCCTGGA
ACTTCTACCC CTGGTGGCCT GGCTTCTGAA TATGATATGC CAGAGGATGA CTTTGTTGCT
GAGGCCAAGA ATGCTATCCA CACAGAATTG GCTAACGCAT TGGGCTTGCC GGGTGCGAGC
GATGAACATT TACGCTTGGC AATTGGCGCA GCCGCCGAGG AAAACGAAGC TGCCTTTGCA
GAAGCGTGGG CCGAGGAACG CGAAGGTCTT GTCTACTCCC CTTCAACTCG AACTTGGGTT
GATAAAACCT CTCTTTCCCC AGAGGAGCTA TCCGCATGCT ACGCTGCGAT GATCAACGCT
TCTCGAGATC GCGTTATTGC CGAGGCTACC AAAGCCGCCA AAGCAGAGAA GAAGCTCGGT
AAGCAGCTGG GTGGTTACCA GACGCTCAAT GAGAAGGCAA AGAAAGCCAT TGTGGACGTC
ATGGAGGAGA TTCACCAGAC CAAACGGGAT ATGGAGACAT TCCTTATGCT TAAGGGCATA
GAAGAGGCTG CAGCCCCGGC CAGGTTGGAG AAGATTAGGG AAGAGGTTGC TGTTTTGAAG
AAGAGAGAGA GAGATCTGCA GGCTAGATAT GCAGAGTTGA ACGACAGGAG GAGGGAGAAC
CTCGCAGCTA TTGAACAGGT ACGTCAATGT CACTTCTTGT CGAAATTACT TAGACCAATG
ATATTTCACA GCTCGAGGAA GACAAGATCG TTCTCGCTGC TCAAGTGGCA TTGGAAGCTC
AAGAAGGAGA GGTTGCAGAT GGTGATGTTG ATATGAACGG GGCTTAGAAG TGCAAACATA
TTATTGGTAT ATCAATGCAT TGTTGTCATA TTGGTTGTGT G
 
Protein sequence
MRVIVKGGVW RNTEDEILKA AISKYGKNQW ARISSLLVRK TPKQCKARWY EWLDPSIKKV 
EWSKTEDEKL LHLAKLMPTQ WRTIAPIVGR TATQCLERYQ KLLDDAEARD NEELGLGAGE
DESSKPATDA RGLRPGEIDT DPETRPARPD PIDMDDDEKE MLSEARARLA NTQGKKAKRK
ARERQLEEAR RLAFLQKKRE LKAAGINLRA KPKKKGMDYN ADIPFEKQPA PGFYDVTEEQ
AKVHAAPVGS TLRALEGKRK QELDEIEERK KRQKKGDGKS NQTQQFVAAR EAQIKKLKEQ
EQIIRRRKLN LPIPQVGERE LEDIVKIGQA GELARELVGD GNKATEGLLG EYEALGQAKM
ARTPRTAPQQ DNVMAEARNL RNMMAAQTPL LGEENTPLHG PSVGTGFEGA TPRHDVAATP
NPLATSARGG VLTSTRTVPG VGTTPLRTPF RDDLNINDDA SVYGETPMND RRRLAESRRA
LKAGFAALPK PENNFELAET EEDEEEAEEA EPLTEEDAAE RDARLKAARE EEERRELERR
STVIKKGLPR PVNVNTYKLL DDLNSAIVEQ TDEEMAAAFK LVNLEVAMLM KHDSIAHPLP
GTSTPGGLAS EYDMPEDDFV AEAKNAIHTE LANALGLPGA SDEHLRLAIG AAAEENEAAF
AEAWAEEREG LVYSPSTRTW VDKTSLSPEE LSACYAAMIN ASRDRVIAEA TKAAKAEKKL
GKQLGGYQTL NEKAKKAIVD VMEEIHQTKR DMETFLMLKG IEEAAAPARL EKIREEVAVL
KKRERDLQAR YAELNDRRRE NLAAIEQLEE DKIVLAAQVA LEAQEGEVAD GDVDMNGA