Gene CNL04360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04360 
Symbol 
ID3254862 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp209346 
End bp211713 
Gene Length2368 bp 
Protein Length718 aa 
Translation table 
GC content57% 
IMG OID638253907 
Producthypothetical protein 
Protein accessionXP_567987 
Protein GI58261154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0138131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCAAGTACA ACTATGCCTC CCCCGACTGC TCCGCCCGCA TCCACTCGTC CTCCCCGCAG 
ACACAGCACG CCGCGTCCCT GCTCCACAAG TCCCGCGACA GGTACATGCT CACCCCCTGC
AGAGCCGACG AGCACTGGGT CGTCGTCGAG CTCTGTGACG AGATCCGCAT AGAGGCGGTC
GAGATTGCCG TCTGGGAGTT TTTCAGTGGC GTCGTCCGGG AAGTGCGGGT GAGCGTCGGC
GGCGAGGACG AGGAGGACGA TGCGGAAGAG CCCGGCCAGG ATGATGTCGC TGGTAGAGGG
CACAGATGGA AACAGGTCGG ATCGTTTATT GGCAAGAACG TCAGAGGGTC CCAGGTGAGC
TCCTGTTGAC TTTTGCAACC CCGCTGAACG CTTCGATCAG ACGTTTAGCC TCTCACAACC
GACCTCGTTC CACCGATTCA TCCGCCTCGA CTTTCCCTCG TACTTTGGCT CGGAATACTA
TTGCCCCGTA TCGTCCCTCA AGGTATACGG CATGAACCAG ATGGAAGCGT TCAAATGGGA
ACAGAAACAG TTGAGCGCCG TGGCCAAGGA CAGGGACAGG ACTGGGAATA GAGAACACGA
AGAAGAAGAG CGCCGTGCAA AGGAAAGGCG GGAAAGGGAA AAGAAGGAGA GGGATGAGAG
GGATAAGCAA GAGCAGAGGG AGCGCGAGCT CGATGAACTG GAAAAGCTTT TGCATGAGCA
AGCGGGAAGA CTTGTACCCG AACTCTTGAC TGAATCCGGC TTGTTTTCCA GTATCGACGA
GACTGCGCCT ACCAACGTAC CCACTGTCGT CTCCAAGCGT GATGGGGATT CCGATTCGCC
ACCGACCAAC GAGTCCATGG CTACATCCTT GATCGAATCC ACCTCGATCG AATCCACCTC
GATCGAATCG CCCACCTCGA TCGAATCGCC CTCCACATCA TACACCCGTG CCGTACCGCC
TCGCTCCGAC TCCTCTGAAT CCATCTACGC ATTCATCATC CGCCGACTCA ACGCTTTGGA
AGGCAACTCT TCCTTGGTAG CACGTTATAT CGAGGAACAA GCCAAAGTCA TGCGATCCAT
GCTGAAACAG GTACAAGTTG GGTGGGACGA ATGGAAGGGT GAGTGGGAAG ACGAAGATCG
TGGGAGGTGG CAGCAAGAGG TGCGTCCTGC TAGCCTACTT ATGTGAAGTC ATTTTGCTCA
ACTCTTGACT TTTGATTAGC GGATGAGGCA AGAAGACCGG CTGGGACGGG TACTCTCCCA
ACTGGAACAG CAGCGCATCG CTTTCGACGC TGAACGAAAA GCCATCGAGA CGCAGCTCCG
TGTCCTCGCC GACCAGCTCG GCTACGAACG CCGCCGCGGG ATCGCCCAGC TCATCATCAT
GGTCGTCATC ATCCTCCTCG GCGCCGCCAG CCGCAGCAGC ACCATGGACG CCATCCTTAC
CCCGCTCTTG AAGGAAGCAC GCCGGCGCCG AAGCGATTAT TATCATCGGA AAAGTCTATC
GGGCCCCCTT GCCGGTTTAC ATATCGACAT GGGCGCGGGG AGACCCCCGG CCATCATCGG
CCAAGCGCGT CCGACATCCA CCACACCCAG CGCCCATCCC CATCGCCATT CCTCGTCGAC
ACCCACCCCG CGCTTGAAAA CATCCCTATC CCGAGCTGGA TCAGGCCACC GGTCCAATAC
TTCACTCAAA CGACGTGGTA TCGTACCCCA GGTCCCTCCC TCTTACCGCT CCGTCTCTTC
TTCCGAATTC ACGTTTTCCC CACTCTCACA CCTTCCCCCG ACGTCCTCCC CCTCCCCGGC
CAATATCCCA AACCCAAACC CAAACCCAAG AAACGTGAGA GTATCTTTCC CGCCTCCGAG
GCAAACACCA CCACCACCAT CCGTCTCGTC CCGTAAACTC GCACAGAGTG CCCATTTGCA
CCACTTGCAT ACAACGGCGG CGGCGGCGGC GGCGCGGGAG GATACGGAGC GTGGCATCAC
AGCAAGTATG AGGCGTCGAC GGATGAGGTC GTCATTGGTG AACGATGACA ATGAACAGCA
GACCACTGTT TCTGGGCTCG GATCTGGAAA GGCGGATGCT GGTGGCGGCG GCGGTGGCGG
CGGTGGTGAA GAGGCGGAGA GGGTGGTGGG AGCGGAAGAT AATAGTCAAG GAGAGTGGGG
GACGGATGAT TTCGATACAG AGGCGGATGA TTTCGATACA GAGGCAGAGG CAGAAGCAGA
AGTGTCAAAA GTTGAAGATC AAGTGAGGGA TAAGAAAGAT TCAGAAACGG ATAGGAAAGA
ACAAGATCAG TTGGGGGAGA CAGAGCAGCA GCCTGTGCGA GAAAAGCGGG GTGTACAAGG
AGAACATGTA GGATTAGCAA GAGCATAA
 
Protein sequence
MLTPCRADEH WVVVELCDEI RIEAVEIAVW EFFSGVVREV RVSVGGEDEE DDAEEPGQDD 
VAGRGHRWKQ VGSFIGKNVR GSQTFSLSQP TSFHRFIRLD FPSYFGSEYY CPVSSLKVYG
MNQMEAFKWE QKQLSAVAKD RDRTGNREHE EEERRAKERR EREKKERDER DKQEQREREL
DELEKLLHEQ AGRLVPELLT ESGLFSSIDE TAPTNVPTVV SKRDGDSDSP PTNESMATSL
IESTSIESTS IESPTSIESP STSYTRAVPP RSDSSESIYA FIIRRLNALE GNSSLVARYI
EEQAKVMRSM LKQVQVGWDE WKGEWEDEDR GRWQQERMRQ EDRLGRVLSQ LEQQRIAFDA
ERKAIETQLR VLADQLGYER RRGIAQLIIM VVIILLGAAS RSSTMDAILT PLLKEARRRR
SDYYHRKSLS GPLAGLHIDM GAGRPPAIIG QARPTSTTPS AHPHRHSSST PTPRLKTSLS
RAGSGHRSNT SLKRRGIVPQ VPPSYRSVSS SEFTFSPLSH LPPTSSPSPA NIPNPNPNPR
NVRVSFPPPR QTPPPPSVSS RKLAQSAHLH HLHTTAAAAA AREDTERGIT ASMRRRRMRS
SLVNDDNEQQ TTVSGLGSGK ADAGGGGGGG GGEEAERVVG AEDNSQGEWG TDDFDTEADD
FDTEAEAEAE VSKVEDQVRD KKDSETDRKE QDQLGETEQQ PVREKRGVQG EHVGLARA