Gene CNE00050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE00050 
Symbol 
ID3257894 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp6398 
End bp9319 
Gene Length2922 bp 
Protein Length666 aa 
Translation table 
GC content49% 
IMG OID638256586 
Productexpressed protein 
Protein accessionXP_571077 
Protein GI58267842 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGATACTTA TTGCAACACT CAATCATCAA CAACGCTTCC ACGATCTCAG TTATTGGTCC 
GACTGCTAGA AATTTCGCCA TACTCTAATC TTTATCCTAT TCTCAAATTT CGTTTTGAAG
ATGACCCCCG CATACGAACG AGAAGAAACA ATGAACTCAC CCTATTCGTC GTCCCACTAT
CTCGGTGTGA GCGAGGGTGG AAGTAGCGGG GATTCACAAA TAAACAACGT CAGGTCTGAT
GCCGCTCAGA CAAGAAATCC ACTTCCTCGA CAAGAGACCA ATGCGACCTT GCCTCCTTAC
GCTAGCTCGA CCGCAATCCC TGCCTACATC TCCAACAGCT GCCAAGAAGC ATGTTCAGTG
GGCCGCGCAA AAGGCTATTT GCTCTTGCTT TCGGCTTTTT CTAAAGTACT CGATCAAGTC
CGCAGGACGA GCTCGTGTAT TAGCTTGGAA CATTCCGTGA GCGAGGTTTC ACCCGTACAA
GCTGTAAACG GCTCTACATC ATGGTCGTCA GATTGCGAAG GACCGTTGTC TGAAGAATGC
AGGGTGGCTT TGTTTGTGGA AATGGCAGTC CGAAGGTTTG GCTACTGGCA GGAGCGGCTG
CTAGCACTGA AAAGTACAGC ATTACCCCCT CTGGATGTCC TACTAGTGTG GGCGACTTAT
TTGGGTTCCC CCATTTGGTA AGTAGTCTTA TCTTATGTTT CTTTCCTTAT GATTTTGACC
CTGCCCCTCT TCAGGTATCA TGATGACAGC GTTCAGGGGG GTACCACCCA TCCTCAATCT
GCATTCCCTG ATTTTCCACT TGAATCAGTT GTAAGTGAAC TCAGTCTATA CGCAGGTGCT
CACACATCAA TTTGGATAAC AGATTGCATG CATCGATCCT GACACCCTAG ATTATCAACC
GGTGGATCCT GGAGAGTGGA AAGCGGCTAC TGGAACACCC TTCGATCCGA TTGTTCACTT
TGAATCATCA ACCAGTGCGG CCGTAAATTG TCCCGGGTGC CGAACAAAAT TTTCGTGGCG
TGAGTAGGGC TGCAAAGTCG CGATAAACAC CGCAACTAAC AGGTTCAACC AACAGCTTGG
ATCTTAGAGG GAGGGAAGGG ATATGCACAA TGCCTTTTTG TCGCCGAGTG TCCCACAACA
AACTGTCGAC TACGAATTGA CAAAGAGGCT ATGGGAGTTG GACGTCTTGC ACGTCGGATC
GCAGACATGT ATGATAGGTC TAATTGCCAA CTTGCGTGAG TTCTCTGCCA GATATTGAAC
CCAACTCACC ATTGACGGGC CAGCACTCGG ATACTCACCG GTGGGCTGTT AAATCATCCG
CTAGATATAC CTTTTAAGTC CATACAAACG TCAGAAGACA AGTCTAAAAG TTTATGTAAA
CACTGGCGGT GGCGTCGATC GTACGCATTT GCATTCTTAG TGGAACAGCT ACCTCGCGTA
GACAAGAGAG AAATCAGTAG ATTGTCAAGC GCCTTTGTGG GAGCTGAGGA AGCGAGCATT
GATCTTGCAC CCGCGGTGAG ATCTGGCATT CCTCGAAATA ACAGTAGAAA GGACAGTAGC
TGATAAAACT CGCCGGTGCC AGACCTTTCG CCATATCGGT TTCATCGATA CGCTCAAGCA
CCTTGGATGG TTGAATCCTT CAACATGGGT TGATAGGTCA GAGGCTTTGT ACACGGCAAG
GTTGCGTTAC GCCAAGTGAG TCAAGTCAAC CTGGAATCTG CAACGCATTA TCATGAAATA
ACATTGATCC TCCAGATTTA TGAACCTCGC TCGATCCCCA AATCTCATAC CAGTCCCGAC
ACTAGGCATC GAGATCATCT GGCAAACCCA CATGCTAGCC TCCACTACCT ACCGGTCAGT
GAACGCATTA TATGCGCGTT GCTGGAGAGG CTGATCATAT TACATAGTCG GGAAACGCAA
ATGCTTGTCG GCCGCGTTGT CGATCGTGAT GAAGCCGTAG AAGAGTCCGT CTTGGCGGAA
GCTTTCACAG AAACCGCTGA AAAATGGAAG GTGAGGGTTT TGTCTGATGG CCATAGCTAA
ATCTACCCAG GCCTCTTTTC ACGTCCCTTA TACCACATCC GGCATGCTCT TACCTCTGCC
GCTATCAGGC CTTGCCGCGA AAGTAGCGTT GAAGCTGCAT TGGCGACCTC GTCCTCGGCG
ACTATCCCTT GATAGGAAGG TCGATTATGA CAATCCCATC GTCTATGCAG TTGTCGATTC
TACACTCTCT CTGCCTTGTG AAAGGAATTC AATTGCGTTA ATTGACGACG ATAAGGCGCG
TACGAAGAGG GCACTCAGAC GGGAGGAATA CGAATTGTAC AGAGTCGCAG CGCTTCAATG
GGCTGATAAG GGCAAGGTCG ACCCGGCATT GGCGGAAGGT CTAAGAAATC TCACGCCAGC
ATTTATGACG ACCACGCTCG AGGATGCAAA GTGCGGGCCT GGAGGATTTC CTTGTTCGCC
ATCATATTTG ACAATTTCGG GCAATCATAT TAGCGGCAAA CTTATTTCGT CCACCTCAAA
TGGCATGGGT ATTGTAGGGT GAGTATGGTT TCTGTTCTTT CCCTACATGC AGGATCGCGA
AGCTAATAGT GATAAACTGC AGCGCGGGGT TGTGGAGAGG GAACGATGCT GAGCTGGCCA
AAATGCAACC ATCAACGTCA TTGTAGGGGT ACCCAAACAG GCTGGAAGAC TGATCATCCA
CATCCGCCTA ACACGGACGC TTCGTTCATA TCAGCTTGAA GAATAGTAGC ATGGTCCCGG
CTCATGAGTA CATATTGGTT GCACGTCTGC GGATTTTACT TTTAGATTTT CTTAAGGGTC
GGTTGCAGAG GTTGTGGAAA AGAAACTACA GGAGTGTTTG CACGGTTTTC TTCAAAGCCT
GGGTTTCAGG TATCCCAAGG GGTTCTGCAG CTTTTCTCCA GC
 
Protein sequence
MTPAYEREET MNSPYSSSHY LGVSEGGSSG DSQINNVRSD AAQTRNPLPR QETNATLPPY 
ASSTAIPAYI SNSCQEACSV GRAKGYLLLL SAFSKVLDQV RRTSSCISLE HSVSEVSPVQ
AVNGSTSWSS DCEGPLSEEC RVALFVEMAV RRFGYWQERL LALKSTALPP LDVLLVWATY
LGSPIWYHDD SVQGGTTHPQ SAFPDFPLES VIACIDPDTL DYQPVDPGEW KAATGTPFDP
IVHFESSTSA AVNCPGCRTK FSWPWILEGG KGYAQCLFVA ECPTTNCRLR IDKEAMGVGR
LARRIADMYD RSNCQLATRI LTGGLLNHPL DIPFKSIQTS EDKSKSLCKH WRWHKREISR
LSSAFVGAEE ASIDLAPATF RHIGFIDTLK HLGWLNPSTW VDRSEALYTA RLRYAKFMNL
ARSPNLIPVP TLGIEIIWQT HMLASTTYRR ETQMLVGRVV DRDEAVEESV LAEAFTETAE
KWKASFHVPY TTSGMLLPLP LSGLAAKVAL KLHWRPRPRR LSLDRKVDYD NPIVYAVVDS
TLSLPCERNS IALIDDDKAR TKRALRREEY ELYRVAALQW ADKGKVDPAL AEGLRNLTPA
FMTTTLEDAK CGPGGFPCSP SYLTISGNHI SGKLISSTSN GMGIVGAGLW RGNDAELAKM
QPSTSL