Gene CNL04500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04500 
Symbol 
ID3254753 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp249294 
End bp251510 
Gene Length2217 bp 
Protein Length702 aa 
Translation table 
GC content54% 
IMG OID638253921 
Producthypothetical protein 
Protein accessionXP_567999 
Protein GI58261178 
COG category[K] Transcription 
COG ID[COG5576] Homeodomain-containing transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATT TCAGTCACGC CCAGCACATT CACGTCGTCT TGTTTTTTAT TATTATTATT 
GCCTTCATCT GTTTTCCTCA TAATACCATT CTTCCTGAGC CCACCATGTC CGTCTGCGGG
AACTCGCCCA CTCCCCCGCC ACCTCCGCCC CCCAACGCCA TCCACATGCA CCCGCCACAG
GTCACCCCAT CCGCCGCCAG CCACACGTCC CACCCCACAC ACCACACACC AGACGCCAGA
GCCCAACAGT ACTCTATACG TCGCGCGTAC TCAACCCCGT CCATTGCATT CCCGCTCCCA
CACCAAGCAC CTCCATCCTC TGCGCTTACA CATGCATCTT ACACGTCGAA CATGTCCGAG
AACATTACAC GTTACACCCC GGGTGGTACG CCCCAGTCAA GCGCGTCGCG TCCAGGCAAT
GGCAACAGCA AGTTTCCACC TAGCTTTGAC TCGCCCACAC CACATGGCGC GAGATACAAC
TCTGCAGTCA GGATGGATGG CATTGAGGGC GCTGAGATGC TCGAGTCTGG CGATCTTGCA
GGAATTCAGC CCCCAGCGAC CTTTCCCTTG CCCGAATACC CTGCATCTTT TATGCGCGCG
GAGCCGGTGC CCACGGAGGA AACGGAAGAG CTCCAAAGGT TTCTTCCCAA TGATGGTAGT
GTGAGAAAGT ATCGATATGG TGGCGCGTCC TGCAACCCAT GGGACTACAT GCTTGGTGAT
GTCCCTGATG CCGACTACGA CCACCCTTTG TCTTCACGAC CAGCCAGGTA CGGACCTGAC
GCTAAGCACG GATGTAAGGT TAGAAGACGG TTTACAAAGA GAGAGCTAGA GGCTCTCGAA
GTGCTCTGGA GTATTGCGAA AAGCCCCAGC AAGTATGAGA GGCAGAGGTT GGGCGCGTGG
TTGGGTGTAA AGACGAAACA TATCACGGTC TGGGTAAGTA GACATGTCCT ACCATACACA
TATTTTCATT GCTGACTATC TCACAGTTCC AGAACCGAAG GCAGGAAGAA AAGAGGTATT
CACGCGACGG TCATCATGAC GCTCCTCCTC CATCTCGATC CAACCGTGGC ACCTTTGACC
CTGTCACCGG TAAATGGCGT CCCGTACCCG CATCCTGCAT CTCCGGCCTC CAACCCCCAC
CCGATGATAA GATTGCAGTC GTCCGTGCCA TTAGTCTCGG TGACGTGACA CGTGATATGT
GGTTAAATAA GTATCCTTCG TCGTCTGGAC GCGGCATGAC TGCGTCGGCC AGGGTCAGTC
CCACACCCAT GAGCCTCGCG GCCACCTCAA GAGGAAGGAC TATCAAAAAT GCGCACACCA
CCCCTTTGCT GCCTCGTAAT CAATCAAGAT CTCTCGATCA AGTGCTCCAG GCGCGCGAGA
GCAGTTTTGG GACAGGTGCG CAGAAGCGAT ATCGCAGAGG CAGCGGTGAA TTTGTTATCA
AGGGAGAAGG TCAAGATAGG ATCAAAGAGA TCCTGTCGCT CATGCCAAGT GATCCACCCA
GCATGGGGCT TGCAGAGTCG GATGTCGAGG AAAGCGATGA GGATGATGGT GGGATTGATG
AGGATGTCGA GAAGAGGAAA AGGGCAAAGC AAGCCAAGGC ATCTAGCACC TTGGCAGGTC
TCGGCAGGGC AACACCTTAC GATGTCCTGG CCTCCAGCTC CAGGGCTAAG CTACTCGCAA
AACCCATCAG TGAGCACTCC AGCCGCAACC CTGTTCTTAG CCAACTCAAC CCAAATCTCT
CCCACTTTGC CCCGCCCTCC AACCTTCGAA AACACACCCT TGAATCTGTA GCTACAAACC
AACCTACAAA GCGACATCGT TCACGCGTCA CCGGTCATCC GAATCCCAAC TTTCATGCGG
GTTCTAGGAC CAAGGACTTT AACAGATCGG TTAGCACCTC TGCGCTCCCT CGTTCGTCGC
GAGTGGCGGA AGGCCAAGGA TCATCATCAT CATCATATCT TCCCTCACAA CTCAAAACAC
CGAACCTTGG GTACACGCGT TCCCATTCTG TCTCTTCCAG CAGTTCCCGG GTGATTACTC
CCGAAGATGT CAAGGATAAG CGAGAAGGCC CGCAAATGCG GGGGCAGGCT AGGGGGCAGG
AGAAGGATCA AGAGGTCATT GGTGCTGCGG AAATGTTGTT ACAGTTGTTT GGCGGGTCAT
AGATTGTTCA CGATAATAGT TATCACTGCT TATTTCTTAT ATGATCTTAA GGGAACC
 
Protein sequence
MNHFSHAQHI HVVLFFIIII AFICFPHNTI LPEPTMSVCG NSPTPPPPPP PNAIHMHPPQ 
VTPSAASHTS HPTHHTPDAR AQQYSIRRAY STPSIAFPLP HQAPPSSALT HASYTSNMSE
NITRYTPGGT PQSSASRPGN GNSKFPPSFD SPTPHGARYN SAVRMDGIEG AEMLESGDLA
GIQPPATFPL PEYPASFMRA EPVPTEETEE LQRFLPNDGS VRKYRYGGAS CNPWDYMLGD
VPDADYDHPL SSRPARYGPD AKHGCKVRRR FTKRELEALE VLWSIAKSPS KYERQRLGAW
LGVKTKHITV WFQNRRQEEK RYSRDGHHDA PPPSRSNRGT FDPVTGKWRP VPASCISGLQ
PPPDDKIAVV RAISLGDVTR DMWLNKYPSS SGRGMTASAR VSPTPMSLAA TSRGRTIKNA
HTTPLLPRNQ SRSLDQVLQA RESSFGTGAQ KRYRRGSGEF VIKGEGQDRI KEILSLMPSD
PPSMGLAESD VEESDEDDGG IDEDVEKRKR AKQAKASSTL AGLGRATPYD VLASSSRAKL
LAKPISEHSS RNPVLSQLNP NLSHFAPPSN LRKHTLESVA TNQPTKRHRS RVTGHPNPNF
HAGSRTKDFN RSVSTSALPR SSRVAEGQGS SSSSYLPSQL KTPNLGYTRS HSVSSSSSRV
ITPEDVKDKR EGPQMRGQAR GQEKDQEVIG AAEMLLQLFG GS