Gene CNL04100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04100 
Symbol 
ID3254756 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp119522 
End bp121099 
Gene Length1578 bp 
Protein Length350 aa 
Translation table 
GC content49% 
IMG OID638253882 
Productconserved hypothetical protein 
Protein accessionXP_567964 
Protein GI58261108 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATATA AAGGAAGAGC TTTGATTGAG AATGGCGACA TTCGCTTCGA CATAAAGCTC 
GCGTCGCCCG GCATGGAGCT GAGTTGGTTC TCATCCCAAC TCCTCATGAC GTAAGTACTT
TTCGTACTTA GCACAAGTAT AGTAGTATAC ATCTGACGCT CGTACGTATA ACTATTCGTA
TGGTCAGCTT CAAGGCCAAA CCACTCCTAA ACTCAACATG GTAAGATTCG CTTTTGCCCC
ATCCATTGTG CAATAGCTCG CTTACTATTG CGCTCACTCT AGTCCTTGGC GTATCTTGAA
CAAGATACAA CGCAAGGTAT CTTTTCTTGT CCTCGTTTAT ACTCTCTGCA GGCAACTAAC
TCCTCATTAG CTCTCGCTGA TTCCATATGG AACGTCAGCT GGAACACGGC CAACCAGCTC
GTTACAGCGT CTGCTGATGG CCATATCCGA GTGTGGGATG TTGAAGAGTT GAGACAACCG
GTCCACGATA TTGATTCACA TCCTCTTGCG ATTACGTCTT TGTCAGTGGC CGATAAAAAG
GCTTTAGCTT CCAGTCTGGA TGGCACTATA GTACTGGTAG ATACCGTGAA TGGGGAACAA
CTAGGCAAAG TTAATTCAGG ACGGGTGAAA GTATCGGCGG AAGGACCAGG TGAGCCACGT
TCTGGTTGCA GTGAGCACGC GTTTTTTGAG CTCATACATG CCGATAGAAA TCCCTGCTTT
TGCCTGTTCT CTGCATCCTC AAAGTGCTTG CTGGGCGTGG TCAGGGCGCT CCTCTAAAGT
TGTCATCCGA ACGATGGTCT CCGACGACGC CGACCCCACC ACACAAGGAC CACTTGGCGG
GGAAAGTAGC ATTATGGACG GTGGAAAAGG GAAATTTGGT ATGGACTTGC AATTTGTAAG
GTTCCAGGGC CTTATGTACC CCTGACATAC AGTGGATAAC CTTCCTGTAG TCCCCTGATG
GTCGCTCACT TGCCCTTGCT ACCGATCAAG GTCAGGTTGT GGTTTTCGAC ACCGAAACGC
GAGCTACTAT TGCAACCTAC ACTTCACATA ACAAGGCCGT CAGAACCATT GGCTGGTCTC
CTGATTCCCA AGTAGGTGGA TATGATAAAT ATACAATTCA TACAAGAATA AAAGTGACGC
TTGGATCTTC TAGTGGCTAT ACTCTGGTTC TGACGACCAT CTGATTGTCC TCTATGACGC
CCGAGCTGGC TCCCAATCCG GAGCTGGTGG CAAGGGTGAA GGGGCAGTGG CTATGATGCA
AGGTCATCAA AGCTGGGTAC TCAGCGTTGC GCCATCACCT GATGGTAGAC TCTTGGGTAG
CGGGTGAGTA AAAAAAAGCC GGTCAGTAAA ACAAAAGTGT TATACACTGA TGCGATCGTC
CCTAGTGGCG CGGATCATCT TATCAAACTT TGGGATATTG GACAGCGAAC CTGTGTTTCG
ACGTCATCGA GCAATGCGGA CGTCTGGGGA TTTGCGTGGC AGCCTGAAGG AGCCGGCACC
CTTCCCCCTG GGAAGCAATT TGCAGTGGCT GGAGATGATA AGGTCGTTAC GCTTTATAGA
GCTGCAGGAG CTGTATAG
 
Protein sequence
MGYKGRALIE NGDIRFDIKL ASPGMELSWF SSQLLMTWNT ANQLVTASAD GHIRVWDVEE 
LRQPVHDIDS HPLAITSLSV ADKKALASSL DGTIVLVDTV NGEQLGKVNS GRVKVSAEGP
EIPAFACSLH PQSACWAWSG RSSKVVIRTM VSDDADPTTQ GPLGGESSIM DGGKGKFGMD
LQFSPDGRSL ALATDQGQVV VFDTETRATI ATYTSHNKAV RTIGWSPDSQ WLYSGSDDHL
IVLYDARAGS QSGAGGKGEG AVAMMQGHQS WVLSVAPSPD GRLLGSGGAD HLIKLWDIGQ
RTCVSTSSSN ADVWGFAWQP EGAGTLPPGK QFAVAGDDKV VTLYRAAGAV