Gene CNI03130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03130 
Symbol 
ID3259781 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp846868 
End bp848997 
Gene Length2130 bp 
Protein Length494 aa 
Translation table 
GC content47% 
IMG OID638258805 
Productconserved hypothetical protein 
Protein accessionXP_572639 
Protein GI58270966 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGAAG ATGAGGACAT CGCTAAGACA GGATTAATCT ATTGGTCTGC CAATGGCACA 
ACCTTTACGT GCCCTAATCC CACCGAGTTC TCAAAGTACG TCAATAGACT ATTGGAGTTC
ATTTTTACTA CCGAGATGCC TTGGATACAG CTGACATAAT TTTTTTCTTT TTTGCAGAGT
GGTTTTGCCT CGATATTTCA AGCACAATAA TTGGCAAAGC TTTGTACGCC AATTAAACAT
GTACTCGTAT GTATTACCAC ATCGCGATAA CCTTGTCGGT GGCTGCATAT GCTTATCATG
ATATTCCTTA CGGCAGGTTT AATAAAGTTC GTCTTTTTCT CCGTTAGCGT CCGATTTCGC
ATCTTTTACC TTTCGCCGTG CATCTAATCC TAAGCCTATT AGGTCAACGA TATATATTCA
ACTTCCACCG ATCCTCAAGC TTGGGAATTC AGACACAGCC TTTTTCGCCG TGGCGAAGCG
CATCTGCTTC CCAGCATTAA GAGGAAATCG TCCCGACCCA GTGCCCCCGA TGGCTCTAAT
ACTGTCACGT CACCCACTGA TGAACTGCCT CCTAGTACTT CTAACCCGAT TAAACCTGTG
GCAGGCTGGA TGAGAGATGC TGTACCAGTC CCTTATCGCA TGCCATCTCC TCCTCATATC
CATGGCCAAC CTCAAAGGTC AGCCACCTAT CCTTACAGCG ATGGGTTTGC TACTCGCAAA
GACGATGGTC GCTCTCCAAC GCGTGGTATG GCTTGGGATC CACTTCCCGC AGTTCAACGC
ATGCCCCCGC CTCCAGATAA TCAGATCCCA ATACGTTATC ATCCAGATCC TAATCGTCCG
GTTCTTACAA CACAGCGCTT CTATCACCCT GGATATCCAG AATCACCTCT ATATGGGCCG
GCGCATTCGC CCAGTGCAGA AACTCTTCTC AACCAAATGT CTGTCTTGGA AGACAAGGTG
CAAAAGCTCA CAGACGTGCT GAATAATGAT CGTATTGAGC ATGTGCGAAA CAACCTCGAC
TTCACGAGCT ACCTCTTACA GATGATTGGA TGGGCTGCAG GTGATCAGCG TAAGTCCTCT
GTGGCAATGT ATCCCGGAAT GCACATGTTA ACGAGAGTAT GCGATCAGAT GCTTCACCCG
AACTGCGGGC GCTACAAGAT ACTCTGAGTC GCCAAAACGC CGATATGCGC CATAAGTATG
AAGCGTTCAT GGCTTCTGAC GCGTTAGCTA TCATGGCGAG TGGTGGGGGA CGAGAGCGCT
CCGATAGTAG GGACAGTACT CGTGAGCGAA ACGCTCGCCT GGGCTGTAAG TCTCCCATCA
TTTCTATCAC CTTGTCCCAT CTGATGTGCC GTTGTAGTTG AGATACCGCC TTTTCCAGGA
CATCCTCACC CTTCTATTAC GGACCTTCGC TTGCCTCAAA CCGCCCAGTC TAGTTCATCA
GCTATCTTAT CCCAACGTGC AGCTCCTCGA ACTTCTCCTC GAAACTCTAC GTCCTCTGAT
CTTCTATTGA CACGGCCTTC AACTAGTGAG TCTATAAGAG AAAGGGAGAT ATACCCAACC
TACTTCCCTC CTCAAACCTC TCATGGTATT GGACCCGCTT CGTCAATCCC TTCGAGAGGT
CCGGAAACTA TAACTCCATC TCTATATGGT GGCGGACCAT CAGTACCTCC TCCGCTCTAC
AGGCCGGCAC CCATCGTTGA AAAGCACCGA GAGATTGAGA AAGGAGAGGA ACACAAAGAT
ATGAATGGGC CTTCCACGGT GGAGCTAGGT AGAGAAGAGC GGGATAACGA AGACTCGCGT
ATGACGATGG CAGGGGAAGA AGCAGAGAGC AAAACGGGAT TACGAAACCT CCTCAATTGA
CGTTATCTGC CTGGTGAACA GCGCTCCCAA GAATAATAAT TTCAGAAAGG AACATTGTGT
CGAATAGCGA AGATGGGATG AAGGAAGGAA AGAAGGTTAC ATCTAAAAAG GAAGATCTTG
GCCAGCTCGT TTGTGTGTGC TCTTGATAGT TGCGAAACGG GCATTTATAT AAACCATATC
TTATCAGGCA TCTTGAGTCA TTTTACTTGC CATTCACCAC AGCTAATTCA AATAAGTTTA
CGTCAACTGC ATTGTTATTA TTATTATAGT
 
Protein sequence
MLEDEDIAKT GLIYWSANGT TFTCPNPTEF SKVVLPRYFK HNNWQSFVRQ LNMYSYVNDI 
YSTSTDPQAW EFRHSLFRRG EAHLLPSIKR KSSRPSAPDG SNTVTSPTDE LPPSTSNPIK
PVAGWMRDAV PVPYRMPSPP HIHGQPQRSA TYPYSDGFAT RKDDGRSPTR GMAWDPLPAV
QRMPPPPDNQ IPIRYHPDPN RPVLTTQRFY HPGYPESPLY GPAHSPSAET LLNQMSVLED
KVQKLTDVLN NDRIEHVRNN LDFTSYLLQM IGWAAGDQHT LSRQNADMRH KYEAFMASDA
LAIMASGGGR ERSDSRDSTR ERNARLGFEI PPFPGHPHPS ITDLRLPQTA QSSSSAILSQ
RAAPRTSPRN STSSDLLLTR PSTSESIRER EIYPTYFPPQ TSHGIGPASS IPSRGPETIT
PSLYGGGPSV PPPLYRPAPI VEKHREIEKG EEHKDMNGPS TVELGREERD NEDSRMTMAG
EEAESKTGLR NLLN