Gene CNE03970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE03970 
Symbol 
ID3257627 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1120958 
End bp1123863 
Gene Length2906 bp 
Protein Length512 aa 
Translation table 
GC content50% 
IMG OID638256980 
Productconserved hypothetical protein 
Protein accessionXP_571195 
Protein GI58268078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCTAACAT CCATATCCCT CCATACTCTT TCTTCCAACC TCTTTTATCC GCCACCGCTG 
CTTCTTCCTT CCTCACCGCT GCTTTCATAT ATAGACCGCC GCAACCCCTC GCCCACCCTC
CCCGCAGTTA GCCCCGTACC TTTCGCGGCC CCGGCCCTTA CAAATCGCAC TCGCACACTC
GCCCCATTTC GACATTGCGC CCGCTCTCCG CCGCCACGTC TCTCTATCCA CTGTCGCCTA
ACAGTAAACA TCTGGCCCAG TCTGCCTCGC ACAGGCCGCC AGTTTATATC AGAACCCATC
TTCTTGGCGT TCTTTTTCTT GGCACCACTG TAGCTCCGCA AATCGACATT CACGCGAGTA
CACCACGGCA AAGATGGGTC TTTTGAGTCA GCGAATGGAT TCTCTCTCCA TCAGAGGGAA
GACTTCCAGA CGCAGCTCCG TCGCGTCCCC TGCCGGCACC CCAGCCAGAG ACTCGGGCGA
GACTCAGATC GACGATGGCG ACCCCAACGA TACAAGCACT CTCGACCAAG AAGAAGGCAA
CATCCTGTTG TCCATTATCT CTCAATGTAA GCATTCGCAG CTACATATAC AGTGGTTTGC
TGACAAGCTG TATGTCAGTG CGACCTGGAA TGGACCTGTC CAAGATTGCC CTTCCCACTT
TCGTGCTTGA GCCCAGGAGT CTTTTGGAAA GGATCACTGA CTTTTTCTCA CATCCGGAGT
TGATCTTTGG GTGATTTCTT CGTATTCCAA CTTTTATACA TTTCGATACT GACGCTGTGG
GTTTTTCAGG GCCGGCAATG ACCCCGATGC AAAACAGAGA TACATCCGTG TGATGACATT
TTACCTGAGT GGCTGGCACA TTAAACCCAA GGGGTGAGTT GGATTCATTC ACATTTTTAC
CCATTTTTTG CTGATGATTG GTGTCTCTGC TAGCGTCAAA AAACCGTTCG TTATGTCCTC
CATTGTTACT CTCAGTCTTA CTGATCGTCT TGCAGGTACA ACCCTGTTCT TGGGGAATTT
TTCCGTTGCA CCTACGTTTA CCCTGATGGA TCGGAAGGAT TCTACATTGC CGAACAGGTC
TCACACCATC CTGTAAGTAC ATTGTCACCT TAGACACAAT TTTGTATCTA ACAAAAAGAA
CGTAGCCTGT ATCTGCATTC TTCTACATTA GTCCTAAGAA TGGGTTGCTT GTAACCGGAG
AGCTTAAGCC AAAAGTTTGT ATCTGTTAAC CGTCTTTAAT AGGAGATGAT GCTGACTGGG
CGGGTGACAG AGCAAGTTCC TTGGTAACAG TGCCGCTACT ATTATGGAGG GTGAAGACCG
GATACGGCTG TTGGACCGAC CCGAGGATGG GGAGTATTCC ATCACTGTAA GCTGTGGTCA
ATTCAATCAG CTGTAGCTCC ACGAGCTAAC ATTTGTCCTA GATGCCCAAC ACCTACGCAC
GAGGTATTCT CTTCGGCAAG ATGCTTTTGG AGCTCTGTGA CGTGTCTAGC ATTGCCAATG
CCAAGAATGA CTTCCACTGT GATGTCGACT TCAAGGCAAA GGTAAGGACG TCCTCGCCTG
ATCTACCAGT TTGGCTAATC CGCTGGATTC TTAGGGATGG ATATCGGGTG GTTACAACGT
AATTTCTGGC AAGGTCGTTG GTCCGGGCAG GTCGGACATT GGGGAGATTA GTGGCCACTG
GTCATCTGCT ATTGAGTTCA AAGACAAGGA CACCAAGGAG AAAAAGGTTC TTTTCGACCC
TTCTACTTCA CGTGTTGCGC CAAAGAAGGT TTTGCCCGAA TCTGAGCAGG AGGAGTATGA
GTCTAGGAGG TGTGTTTTCC TTCCCTTTTT TTTCTGCAAG AGAAGGCCTG CGGCTGATTT
CCGTCCTTGC TTAGGTTGTG GACTAAGCTC ACTGATGCTA TCCGAGCTGC CGACATGCAC
GGTGCAACCG CCGCCAAGTC CGCTGTAGAA GACCGCCAGC GAGAGCTTGC CAAGAAGCGA
GAATCCGCAG GCGAACCCGC TCACGAGCAA AGATTTTTCA AGCACGTTGC CGGGGACAAG
TGGATGCCCA AGTTGGACGT TGACAAGTGA GCCGACGCGT TCCTGTCCAT CTTTTCCAAG
TCGATAAGCT AAATGTCACC TTGATAGTTT GCCCAAGGAC CGAAATGAGA TGGAGGATGT
AGTTCGTAAA TGGATCTTTG GCGACAAGAA TCCCTTGGAC TACGAGAGTG TCAAGACGAC
TCCTCGAGGT TCCCAATCCA CGGAGTCTGG AGCTGCCCCT GTCGCCGCGG GTGAAGCTAC
TCCCGTCGCT GCGGGTGGAG CTGTTGCTGG CGGAGCTGCT GGAGCTGCCG GTGTCGCTGG
CGCTGATGTT AGTTCTTCCA ACACTAGTGT AGCCGAATCT ACAAGAAGTA CCTCTACAGA
GCCTCGTAAG TTTTATAGGC AAACCAACCC CTTTAGATGT ATAAAAACTG ACATGAGAAT
CAAAGCTGCA GTACCCACCG CACCAGGTCC TCCTGTCGGT CAACCAGTTG ACCATCCCGT
ACCTCCTAAA GCCTAAACCC TTCCCTAGAA TCTCATCAAA CTCCTATCTT TTTTCCCGCA
CGATACCCAA CACGCGCGCA AGTCTTACCT GGTGATGATG CCTGTAGAGT CGGTCCCCGC
GAGTAGAAAG CCGAGAAGTA ATTTTTTCAA AGAGGGTGGG TTAGTAATGA GGACTATATG
AATGACGCGG ACGAGCAGAA GGAAGACAGT TATTCAACCC CCAACTGTAG ACTATCAACT
TTTTAAACCT GCTTCATATC CAGTCTCCTT TTGCAGGTTT TGCCTTTATC TTTTGCTCTT
GTTTCTAGTA TTTTATTGCT TGGATTCATA AGTGTACGGC TGGGCTCTGA CGTCGAGAAC
GAGTTAACTG TGCTTTAATG ATACAC
 
Protein sequence
MGLLSQRMDS LSIRGKTSRR SSVASPAGTP ARDSGETQID DGDPNDTSTL DQEEGNILLS 
IISQLRPGMD LSKIALPTFV LEPRSLLERI TDFFSHPELI FGAGNDPDAK QRYIRVMTFY
LSGWHIKPKG VKKPYNPVLG EFFRCTYVYP DGSEGFYIAE QVSHHPPVSA FFYISPKNGL
LVTGELKPKS KFLGNSAATI MEGEDRIRLL DRPEDGEYSI TMPNTYARGI LFGKMLLELC
DVSSIANAKN DFHCDVDFKA KGWISGGYNV ISGKVVGPGR SDIGEISGHW SSAIEFKDKD
TKEKKVLFDP STSRVAPKKV LPESEQEEYE SRRLWTKLTD AIRAADMHGA TAAKSAVEDR
QRELAKKRES AGEPAHEQRF FKHVAGDKWM PKLDVDNLPK DRNEMEDVVR KWIFGDKNPL
DYESVKTTPR GSQSTESGAA PVAAGEATPV AAGGAVAGGA AGAAGVAGAD VSSSNTSVAE
STRSTSTEPP AVPTAPGPPV GQPVDHPVPP KA