Gene CNL05100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL05100 
Symbol 
ID3254999 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp430547 
End bp431802 
Gene Length1256 bp 
Protein Length286 aa 
Translation table 
GC content47% 
IMG OID638253983 
Productconserved hypothetical protein 
Protein accessionXP_568227 
Protein GI58261634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTATATGAG CGCAAGTAGT ATCCAATGGG ATCCCTCAAA CGGAGGGCAG TGGACTACTG 
ATAGCAATGG GTTTATAATG CAGTAAGTTC TGGGGGTCAC TTGGTCCCTC AAAACTCCAG
TCCTTGCTAA TTCAAGCTCC TACCCTTTTT ACAGGATTGT CTGCTTGATT ATCGGCCCCA
CATTCTTTTC GGCAGCCAAC TATATCATTC TCGGAAGGTA TGTGCTTTCT TTCTGTGTGC
TTGGAGCCTT GGAGGGCTGA TGATGGAGAA TAACCAGAGT GGTACGCAGA ACTGGGTAAG
AATCTTAAAC CTATAAGTAT GATTGTGCCC TAACCACTTT TGCCCGATAC TCTGCAGGTC
CAATTACTCT TCCATCACAC CCTCGTCTTT CTCCGTCATG TTTACGATCA CGGACTTCTT
TTGTCTGTTG GTTCAATGTG CAGGAGGCGG CCTGGTTGGA ACCGCCGATA CCGACTCTGG
CATGCAAAAC GGGTTGTATG TGATGGCCGC TGGTGTGCTC GCTCAGCGTA AGGACCTTTT
CCTTTGCACT GCGCGGGCCT TTGAGCAGCG TACTGATGAT GACTATAGTT GCTGTCACTT
TGGCTTATAT CTTCATGCTC AGCGAATTTA TCTATCGCCA TGCCAGATCC AAGAAAGCAT
CTCGGCAGTA CGACTTACTT GCATGGGCCA AGATCTGCTG TTGTTGCTGC AGTAAACGCA
AGCGCCAAAG TGTCGACTCG AGCCACCGAA TGGACGATGC GAATAAGACA GAGGCGGGGT
TTACAGCTTC GAATGAGGGT GAACAAGAAG GTGAAGATAG GAAATTTGTC AATTTGGTCC
TCTGCACCTT GATAGTGGCT ACAGTCCTCA TCGTCGTCCG GTACGTACTT TGCCAATTTG
CTCTCGAGAC GCTCTTACAT TTGTTTTTCA CCCGCAGATC TGTCTACCGT TGCATTGAGA
TGCTCAACTT TCAGCCTAAT CACCCTGGTG CTTATGGGGA CCAAACACTT TTCTTAGTTT
TGGATTCTGC GTTCATGGTG AGTAGATGGA ATTAGCGAAA AGGGAAAGAT TGCGGCTGAT
GGGCTTCTTG AGCTTGCTCT TCTTATTGTG TACGCTTTAA TACACCCTGG GTGGATTTTG
GCTAGGACCT TAGGATTAAG CCGCCCTGTT GTATAGTGCT GGGATAGTTT GCGCAGGATT
AAAAACTGGT CACAGGTTTG ATGAAATTAG GATGGATTTA GACGCATTAA GCTAGA
 
Protein sequence
MSASSIQWDP SNGGQWTTDS NGFIMQIVCL IIGPTFFSAA NYIILGRVVR RTGSNYSSIT 
PSSFSVMFTI TDFFCLLVQC AGGGLVGTAD TDSGMQNGLY VMAAGVLAQL AVTLAYIFML
SEFIYRHARS KKASRQYDLL AWAKICCCCC SKRKRQSVDS SHRMDDANKT EAGFTASNEG
EQEGEDRKFV NLVLCTLIVA TVLIVVRYVL CQFALETLLH LFFTRRSVYR CIEMLNFQPN
HPGAYGDQTL FLVLDSAFML ALLIVYALIH PGWILARTLG LSRPVV