Gene CNL04200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04200 
Symbol 
ID3254732 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp159696 
End bp160910 
Gene Length1215 bp 
Protein Length366 aa 
Translation table 
GC content52% 
IMG OID638253891 
Producthypothetical protein 
Protein accessionXP_567973 
Protein GI58261126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.290426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCTA AAAGGCAGAA CGAGAACGAG AATCAAGCGT CTGAGGCGAC TCAGAAGAAG 
CAGCGAAACT CCAACAAGCA ATGGGCGACA GATACCAACA GCGATGGTGT CTCGGCAGAG
GAGCACGTTG CTGTATGGTT GTCAACCATT CCTGAGAATG GAAGGCTCCC CAACTTTCAT
AACTGGAAAA CTGGTCATGA GAAGAAGGAT TTTTGGTCGA GAAAGTGTCT ACAGTACCTT
ACTGACAATG GCTGCGACAG TCGACGCCAG TTTCAAAGTG TTGCTCTCAA AGTAAATATG
TCTTTTTCGT TTTTGGTATG CTCATGAGCG CTGATATGAT GAAAGATCAA TCAAATTGTC
GAGAGCTTCA CTAAGGCATC CCAGATAGGG ACCGGGACTG GTGCTGGGGC GATGGAGATC
GACGACGAGA GTCTCCTTGG TGTGTAAAAG TACTTTGGCA GGCAGCATGG GCACGTCGTG
GCTAACTGAA CATTAATAGC TCAACGAAAG AAGGTTTGCC CCTTCTACGA GATCCTTTTA
CCTGTCCTTG GCGACAGGGC CTCCGTCACT GCCCACCACG CCTCTTCGAC CCTCCACGCC
TCTCTCAATC GTCCCGACAG AGATCTCGCT GCCCTCGATG GTTTGATAGA GCGCCAAAGG
GGCGAAATGG CTGCTGATGA CTCGGAAGAT GAGTTGTCAG GAGAGGATGG TGGTCTCTTT
GGGGACGGTG CAGGTGAGGC TAGCGAGACC GAGTCAGAGG CAGATGCTCC AATCATTGCA
GCTATCCGCA GGGAGAGCCG CTCTTCGTCT CAACCTGCAC GGTCCTCATC TGTGCTGGGC
TCAGTGTCTA CACCAATCCG AGCCCAAACC CAGAGCCAGA GAGCATCCAT AAGGTCTTCC
AAGGCAAAGG CTGTCTCGGC AGATGACAAG ATGGATGAGC TGGTCATGAG GCAGGAGGGC
AACGACGATC GGCGGCATCA GGAGCTATTG GCCGTTCAAG AACGGAAAAT CTCCGTGCAG
GAGAAACATC ATGCAGACAT GATGAATATC GCGCAGGAGA ATGTGACGAT AGCACGGGAG
AATGCGGCGA CAGAAAAGAT GAAGATGTTG GCGGAGAGTT GGAACAGGAA GATGGAGATG
CTGATGAGGT CTGGGAAAAG TTGGGAGGAG GCGAAAGTTA TGGTGGGGCC TGAGCCTGGA
GCTCCCTCTC TATAA
 
Protein sequence
MPPKRQNENE NQASEATQKK QRNSNKQWAT DTNSDGVSAE EHVAVWLSTI PENGRLPNFH 
NWKTGHEKKD FWSRKCLQYL TDNGCDSRRQ FQSVALKINQ IVESFTKASQ IGTGTGAGAM
EIDDESLLAQ RKKVCPFYEI LLPVLGDRAS VTAHHASSTL HASLNRPDRD LAALDGLIER
QRGEMAADDS EDELSGEDGG LFGDGAGEAS ETESEADAPI IAAIRRESRS SSQPARSSSV
LGSVSTPIRA QTQSQRASIR SSKAKAVSAD DKMDELVMRQ EGNDDRRHQE LLAVQERKIS
VQEKHHADMM NIAQENVTIA RENAATEKMK MLAESWNRKM EMLMRSGKSW EEAKVMVGPE
PGAPSL