Gene CNL04810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04810 
Symbol 
ID3254863 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp345709 
End bp347942 
Gene Length2234 bp 
Protein Length628 aa 
Translation table 
GC content50% 
IMG OID638253952 
Producthypothetical protein 
Protein accessionXP_568022 
Protein GI58261224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCCGTTTCA TTTCTGTCCA TATCCTCAGC TCCGACACTT TGATGCCTGC ATAGGCCGGC 
CTTCTCAACA CACTGAAATC CATTCAGTAG TGTTTTGTGT CGTTGCACCG CAAGATCCTT
ATTATTTGGC TTGACGATAG CAGCTATGCG CGGTATCCCA GTCTTTGTTG CCATTTTGAT
TGGCCTGGCT TGTTCTTTCG TGCAATCGCT GGGTGAGTAA GAAGAGATAC TGGGAAATGC
TCTCAAGTAG GCTAAACTTC CCGATTTGCT TCAGGCCTTA CTATTCAGCG TAAATCTCAC
ATTCAGGAAG ATCTTTTACC CTTGCCTGCT CGTCGACCAG CAATACGGCG CCCGCTATGG
CTTATCGGCT TCATCATATA TATGACTTCC AATGTCTTTG CCACGCTCTT CCAGCTGGAC
GCGCTACCCA TTGTTATCCT GGCACCCTTG GGGGCCATCT CGCTGGTCTT CAATGCCCTA
TTTGCCCATG TGCTTTTGGG CGACAAGTTT GGTATGAGTT GGGTGGTCGG CACAGCGTTA
GTAGCTGGGG GGGCGGTTAT GATTGCTGTA TTTGGCGTAG TTCCTGATGA GGAACACGGC
CTGGACGAGC TACTGTTACT GCTCAAAAGA GGCCCATTTG TAGCTTTCTT TACAATCGTT
TTGATTGCGG TCGCAGCGGT CTTGGCAGCG GTAAGTAAGG GCATTGGTTC AGTGTCATGG
TTCTGACTAT TGACTCCAGG CCCACATTGC AAGCTGGCAC GTCTACCGGC ATCAGTCGAA
CCAGATCTCC TTGTCCGGTG CATCAAGTCC TGTCTCCGTA CCAAGCAATT ACGCTTCTCC
CCGCTCGACT ATTGCTATTC CTTTCCGCTC CACAAACCCT CGACATCGAT CAACTTCCAA
CTCTGATGAC CATGAAGATG AATACGCGAA GCCTAACCCT CGCCTTACTT TGAAAAGCCC
GTCTGTGCAG AAACCCCTTT CACTTTCTAT TTCACCTCGC ATTCAAGAGG TGTCGCCCAA
ATCCACCGCC GAATCTTCCC TCACTCACAC CCTTACCCTC TGCGGCCTTG CTTTTGCCGC
TGCGGCAGGT ACACTCTCCG GATTATGCCT TGTTCTGGCG AAGGCAACAG TTGAACTATT
CATGAAAACC GTCGATCATT GGCGAACAGG CGTCGGTCGT AATGAATTTG CAAGACCTCA
AACATGGGCC CTGATGTTGG GGATGAGCGT CATTGCCGTC GCGCAGATCT GGTATCTGCA
TCACAGCTTG AAATTTACAG GGCCGGCGTT AGTGTGTCCG TTGGCATTCT GTTTCTTCAA
TTTATCCTCG ATTTTTGGTA AGTGTTGTAA CCTCAAGACG TGCAGGGCAA AAATTAATAT
GGAGCAGATG GACTGATCTT TTATAACCAA TTCGGGCAAC TCGCAACTTA TCAAATTCTT
CTTGTCTCTT TTGGCACTGC CATTCTTCTG CTCGGTGTAT GGATTGTCTC GGCTATTCAA
CCTGAAGGCA GTGTAGAAGT GGGTACATGG GCTGAGGAAG ACGTGATGTC GGATAGCACT
TCCATCTCGT CACATGAGAG CAGAGACGAA GTGGAAGCTG GGGAAGGAGC GACTTTGCTG
GGCCGAGGGA CTACAGAAGA ACCAGAAGAC GATGAAAGTC TCTGGCCGCT TCCCCATCAT
GACATAGGGA CTGTCTCTGT ACCCCCCACA CCAACTGGGC ACTATCAACC AGTCTTTGCT
CCATCGCCAT CTTCTCCTCA CTCGCCCATG TCTCCTCGTT CCTTCCATAC CCATTCTCAT
TCACTTTCAC ATGCTCATCA TCACCACCAT AGAGGCCCCA GATATGGGTC TCTTCTCCCC
GACATTGGTC CTCATGCTGC GCCCATGGGC TTCTCCATTG GCTTAGGAGC GGCGAGTCCA
GGATTCGCCT TGCGATCGGG AAGTATGAGT GAGCATCATC ATGGGGAAAG GGAAAGAAAT
CGAAGAAGCC GAAGTGACGG GCCTCTCGGT CTCGGAGCGA TCATACGTGG GGAGGATAAT
AGCTCTTTGC AGACCGATGT GGAAGAAGGT CGTGGCGAGA ACGCAACAGA AGTTGCTTTG
AGAGATTGGG ACGTCGGTGA GAGAAGAAGG AACAACTGGT GGGATGTGAG ACGCTTCTTT
GAGAGTCAAG GGAATATTCG TTTGACAAAA TAGAAAATTT AAAGTGTACT ATTATTATTA
TCATTACATG TAGT
 
Protein sequence
MRGIPVFVAI LIGLACSFVQ SLGLTIQRKS HIQEDLLPLP ARRPAIRRPL WLIGFIIYMT 
SNVFATLFQL DALPIVILAP LGAISLVFNA LFAHVLLGDK FGMSWVVGTA LVAGGAVMIA
VFGVVPDEEH GLDELLLLLK RGPFVAFFTI VLIAVAAVLA AAHIASWHVY RHQSNQISLS
GASSPVSVPS NYASPRSTIA IPFRSTNPRH RSTSNSDDHE DEYAKPNPRL TLKSPSVQKP
LSLSISPRIQ EVSPKSTAES SLTHTLTLCG LAFAAAAGTL SGLCLVLAKA TVELFMKTVD
HWRTGVGRNE FARPQTWALM LGMSVIAVAQ IWYLHHSLKF TGPALVCPLA FCFFNLSSIF
DGLIFYNQFG QLATYQILLV SFGTAILLLG VWIVSAIQPE GSVEVGTWAE EDVMSDSTSI
SSHESRDEVE AGEGATLLGR GTTEEPEDDE SLWPLPHHDI GTVSVPPTPT GHYQPVFAPS
PSSPHSPMSP RSFHTHSHSL SHAHHHHHRG PRYGSLLPDI GPHAAPMGFS IGLGAASPGF
ALRSGSMSEH HHGERERNRR SRSDGPLGLG AIIRGEDNSS LQTDVEEGRG ENATEVALRD
WDVGERRRNN WWDVRRFFES QGNIRLTK