Gene CNC00010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC00010 
Symbol 
ID3256733 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1984 
End bp3809 
Gene Length1826 bp 
Protein Length570 aa 
Translation table 
GC content52% 
IMG OID638255221 
Producthypothetical protein 
Protein accessionXP_569349 
Protein GI58264386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGGTG ACGAGCAGAC GTTCGACGAC AGAATTTTTC GTCGCTATCT TGATCGCTTC 
ATTCAAGAGA ACTGCAACTA CATGGCGATG GCTACAGAGT TCTCACATCA AGCAAATTTG
AAAACGGTCT TCTACGACAG ATATGACAAA GAGTGGGCCA AGCTAGATCT GGTGGATATG
TACAGACGGC GGACGGGGAA AATTCTGGGA GCCGTGAGTC ATGGCTTGTG CAAAAGGTGA
CACAATGCTT ATTACGCATT CAACATAGCC CTCTCAATCG CAAGGCACCT CCAAATCAAA
AAAGGATGTT GCCACTACTC CGGACGCCTC TCTCATCGGC AGCGTCAACG ATCCTAGCTC
TCCATCGGGC CGGAAACGGA TGCTTTACGC AGTCATCGAA CTGAAGTGGA TGAACTTAGC
GGCTCTACTT ACCGGCGAGG CTAAAAGTCA AACCGACGAG AAAGCTCTTA GTTACCTCTG
CCAGGAAGGC GTGTTTCAAA CCATGTGGTA TGTCATCTTG GGCTACGCCA TTTCACGCTG
CATCTTCGGC CTCTCCATAG TCAACGAATA TTTCTATAGA ATTGTGTATC TCTATCAAGA
CTCGGCCTCA GACAGTCCCG TGCTTGCCCT GGAGGCAAGC AAAGAGTTCT TGGAGAACGC
CGGGCGACAT TTTGGATATC CGCGGGATGG TTACTCAGTC GAAGAACTTG CAGAGCTGCA
AGACTTTTGG TCGTCGCCTC CCAATTGTCT GATCAGCGAC CGTGCCAACG CCACTTTGAA
TAAAGAGACA AGGTATCACC TCGATGCGAC CATCCTCTTG TTCCTCGCTC ATGCAGCGGC
ACTTCCAACG CAACGCTTCC TCAACGACCT GCCCCTCCCT TTTGCTCATC ATGTTCCTGT
TGATGCGACC GCTCATTCAG CCACCGACAT GAGATTGAAA GGATTAGAAG TTGGGCGCAG
GCGACACAGT CTTCGTTCGA CCAAGAGGAA CAAGCGCACA TTGGCGGATT TGTATGATGA
AGAGAAAGGT GAAGAGGACA AACCAGGGGA CCAAGAGGAC AAGTCAGGGG ACCGCAAGCC
ACCTGGGAAG GATAATCGCA ACAATGATCC TAGGAACGAT GGTTCACAAG GGGGAAACTC
TGGCTCCGGA GGCGATAACT CTCGTGGCGG AGGCTCTGGT CCTGGGGGCG ATAACTTACA
TGACGGAGGC TCTGGTTCTG GCCGAGGACA TGAAGGCTCC GACTCTCGAG ATGTCGGCGG
ATATGAAGGT TCTGAGTCTA GGGATGACGG CAGAGACCCT GTTCGAGATG GCGCCGGAGG
CCCCTCTACT CATCGCGCTG AGGCTGCCGA CATTCGAACG CCTACGACAC GACAAGAGTT
CTTGAGAGGC CTCCAGAAGC TATCCACTCC TGGAAATATG TTAGACATGA AATCATCCAT
TATTGCCTCC CTCATCTCCA ATCCGCGTGA GCACCTTGTA ATCTTATGTT ACAGCCTGAA
GTTACGCTTA CGAATTTATT GTAGATACCA GTACTTCTGT GGCTCCCTCC TCTCCCGGAG
TTAACTCCAC CACGTCGTCG GAGGATGTTC TTTTCCCGGA TCTGTCATTG GACTCCAATG
GCAGAACTCT CGTCCATTGT GATTTGTCCA CCGATCCCCT CCCAAAAGTT CACAAGTCCA
CCCCTCTTGA TCTCGATCTC GAGGACATCG ACCCGGAAAC GGGGGAGCTT ACGTTGGCGG
CCTATAGGGA CCGCCTCGCA ATGCTGGGGG TACGGGTGAA GTTGGTAACA AGGGAGGAGA
TGGACGTCTT GCTGGCGCGG GGATGA
 
Protein sequence
MIGDEQTFDD RIFRRYLDRF IQENCNYMAM ATEFSHQANL KTVFYDRYDK EWAKLDLVDM 
YRRRTGKILG APSQSQGTSK SKKDVATTPD ASLIGSVNDP SSPSGRKRML YAVIELKWMN
LAALLTGEAK SQTDEKALSY LCQEGVFQTM WYVILGYAIS RCIFGLSIVN EYFYRIVYLY
QDSASDSPVL ALEASKEFLE NAGRHFGYPR DGYSVEELAE LQDFWSSPPN CLISDRANAT
LNKETRYHLD ATILLFLAHA AALPTQRFLN DLPLPFAHHV PVDATAHSAT DMRLKGLEVG
RRRHSLRSTK RNKRTLADLY DEEKGEEDKP GDQEDKSGDR KPPGKDNRNN DPRNDGSQGG
NSGSGGDNSR GGGSGPGGDN LHDGGSGSGR GHEGSDSRDV GGYEGSESRD DGRDPVRDGA
GGPSTHRAEA ADIRTPTTRQ EFLRGLQKLS TPGNMLDMKS SIIASLISNP HTSTSVAPSS
PGVNSTTSSE DVLFPDLSLD SNGRTLVHCD LSTDPLPKVH KSTPLDLDLE DIDPETGELT
LAAYRDRLAM LGVRVKLVTR EEMDVLLARG