Gene CNL03950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL03950 
Symbol 
ID3254758 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp82145 
End bp85590 
Gene Length3446 bp 
Protein Length1075 aa 
Translation table 
GC content55% 
IMG OID638253867 
Producthypothetical protein 
Protein accessionXP_568210 
Protein GI58261600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.215848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGTC CGAACACTGC AAAGCGGGCT GCTGCTGCCC TTGACAGGTG AGTCCCTGCT 
CGTGCATGCA CCGTGAAAGA GAAGAGGGGC GCTTATGCAT GGAAACAGCG AGGCCAAGTC
AGAGTCAGAC CGCCTTCGTA ATGTCCTCGG GTCGCAGCCG CCATGGAGTA AAGAAGTAGA
GCTCCATCGC CAGCAGTCAG TCTTTGCCTC TTGCAACCAA ACCCATCCAG CTAACATGCA
TGCGCTGGAA GATGTCGTAA CGCCCATCTC ACGCTCCTCT TCTCCCACCC ATTCTCACCG
TATTCGCAAT CCCTCGACTC GCTGTGGCTT CACACCACAT ACGTCCTTAT CCAGGCTTAC
CGCGACCTCA TCTCCCGGTT AGAACGTTTC CCACCTGCTG CCTCGAATAG CAACGGGGGC
AGCAAGGGTC GACGAGGAGG AGGAGGCGGA GGTAACTCGG AGCTGAAGAA GGCGCTTACG
AGGTTCAGAC AGGTGCTCGC TAGTGAAGAG ACGTTCTATC GTTCGCTCGC CGCTCGGCTC
GTACGTTTCT ACAATCTAGG TGAAATTTCA GGCGTAGATG AGACTCTCAA GGCCGTCAAG
CTCCCAACAG AGTACATGCC CACTAATGAC AGCGGCGAAG ATGAATCGTC GTCATACGCC
GCGCGATTTG CCTCGTTGCA AGAGAAGAAG GACAAGATTC CCTTACTCTA CAAGGGCCTG
ATCTGCCTAG GTGACTTGGA ACGGTACAAG AAGCAATACA AACAGCCTGT CAACAATAAC
CGCCATGCGC AGGAGCGAGA GAGGCAAGCG GATAAATTTG AAGTCGCTGA GAAGTATTAC
TTGGCAGCTT GGCGCTTGAT GCCTGACGAT GGTTTGTTGC TTCCTCTTTA CAACTTGGTT
CATGAACTGA CAATGTATAA AAGGTGCGGC ATGGAATCAG CTCGCAGTCA TCTCGACATA
CGTCCACAAC GACTATTCCA CCACGTACTA CTACTATCGT GCTCTCGCAG TCAAGAATGC
TTTCCAAGGC GCCGACGGAA TCTTGCAGAG ATTCTTCGGT AGAATCTTTG ACAAGTGGCG
AGCAAAGAGA AAAGACGGTG ACGGCGAAGA AGGTGGTGAA GTGGGAGATG GTGTGGAAAA
GTGGAAAGAG GAGATGGTTG TGCTCATGGC AATCCTCTAT CTTAAAGCTG GGTGAGCCGC
CTGTTTCCGT TTCGTTGGAC ACATGTTGAC AAGATTACAG ATTCACATAC ATCCCGACTA
TTCTCCCACC TCTCCTCACT TCGCTGAAAG ATTTTTTATC TGAACGACGG CTCCCCACCG
AGTCTATTGT TCAGCTCACT TCCATCCTCC TCGGCTCTCA CTTCCGCGCC CGCTCCACCT
CTGGCCTGGA GCAAGACCCA AATTTACTCA AACGTTCCTT CGAAGCCGAA GGCAAGACGC
TCGAGGTCGC GTTAGGAGTA TGGAAGATCT ATCTGGAAAT CGCGCGCGAA GAGATCGACG
AGGCTAGAGC GAGCTTGCGA AGAGGTTTGG AAGATAGTGC ATTGTTGGAT GATGAAGATG
AAGAAGAACT GGAAACGGAT GAGATGCCGC AACTGATTTC CGCCGTCTTG CGACGTATCC
TTCCTTCCCT GCGTATCATC TCCAAATGGC TCAAGCTAAA CACTACCTAC CTCTCTGGTC
TCGCATCGCC ATCAAGCAAC GATCAAGTCT CTTCGCCGGA GCTCAGGGCG GCGATATCCT
CTTTCTGGAA CACTTACAAC GCCTTTTTCC AATCCTCTTC CAGACTCTTC ATTCTTGAAC
GTCTTCCATC GATCACGAGG CCTCTGGAAG AAGATATTGA TATGCGAGGA TTCGCGCCTC
TCCAAAAGGG CAAGACAGCT GATATGGACT CTTTTGCAGC TGGTGCGACC TCGGCGACGG
ATTCAGCGGA TGCAGAGGGA GATGTGGATG GTGGACAAGA GGGGAGGGAG GTACACCCGA
ATGAGGAGCA TCTGATGAGG TTGGGGGATA TACAAGTGGA CGGGTTGTTG ATCGGTCAGA
GTTTGGGGTT TACTGATCCT CTTCATACCT TTACCACCGG AACCTTTGCC ATCCCCGAAG
CAACAGAGTT ACCAAACGTG CCGCGGGTGG AAGAACATGA AAGGGAACAT GATCTGCAGT
CAATCTCGAC CAACACTGAG GATGATCCGG TCAACCTCGC TATGCGTGCA AGTCTTGGCG
TTGAGAGTGT TGGTGAGGAA GACGAGGATG ACGAGGAGGT TATTGTGTGG AGCCGTGGAA
GGGAGAATGC GGTGGATACC AGCGTATACC AGCAGGTCCA AACCATTCCT GCTCAGCTCC
CGGCCCCAAT GGTCCCTCCT CCCAAGCACC GCCTTCAACA TCAGCAAAGC CAAACAGCCA
TGGATCTTTT ACAAGATCTC CTACAGTCTA CCCCGCCCCT GGCGCACTCG TCGGGCAACA
ATTCGCCTGC TCTTGTAGCT TCTTCACCGT ATGCCCATCA GTCCCCGTCT TATACTCAGA
CTGGATTTCC CGGCATGAAC AGAAACATGA GCGGACCTAG TGCTGGTATT CCTTCTCCCC
ATAATACGCA AATGCCCCCC GGCCTGCCTA TACTCGGCCC AGCCCCTGTT CCGCAATCTG
TACGGCCTGT ACAGCATTCT GGACAAGCCA AGAATCCCGG CCAGGTACCT TCTCTCTTCA
TGGGACCTCA GGGGAATAGT ATTTGGACTA TGACTCGTGA AGAAAGCCAG CAAGGTCGCG
CAAGGAGAGC TGGTAGTGGC GCTGGCACAG GCCCAGGTAT GGGTTGGGTA GGGGGTGATG
AAGGTCAGCA AGGTTCCGTT GCTTCCCAAG GCCCTCCTCC TCCTCCTGGT TTCGGAATCC
CTCAAGCACC TCAGATTGCT CAACCTTTGG CGACGCAGAT GCATCAGCCT CAACCTCAAG
CGCCTGTGCC GATCCAGCAC ATGGGGTTAT CCACTGCTTC TGCTACTCTC CAGGCCCAAG
CCACTTTCCC CCGCTCTTCG CCTGGAGCCG TTGCGCCTCC AAACCTGACA CACCGGCCCC
CCTCTATCCC CAAGCTCCAC CATGAATCGC CAGCGCGACA GACCCCCGTC CAAGTCCAAC
ACCCTACGCG TCAACCGCGC GGACCGTCGG CCAACCCTTC GGGGACTTGG GGGGGCTTGT
CCAACATGAA TATGGGGGGT ACAGGCATGG GCATGGGCAT GTCCAATGTC CCGTTGCCTG
TACCCAAGAG TGAAGGGGAG GCAGCAGTAC CGTATTATAT GCGGCCTGGA GTGTTTGGTG
GAGGTGCAGC GACGCCCGCG GGGGCGGGTG GGATCAGTGG AGTGGGTGGA GTAGTACCCG
CAAGTGGAAG GGATGGGGCA GGGCAAGGGA TATGGGATGG CACGAGGGGT TTCGGGCAGG
GACAAGGCTG GGTCAGCGGC TCGTAA
 
Protein sequence
MQSPNTAKRA AAALDSEAKS ESDRLRNVLG SQPPWSKEVE LHRQQCRNAH LTLLFSHPFS 
PYSQSLDSLW LHTTYVLIQA YRDLISRLER FPPAASNSNG GSKGRRGGGG GGNSELKKAL
TRFRQVLASE ETFYRSLAAR LVRFYNLGEI SGVDETLKAV KLPTEYMPTN DSGEDESSSY
AARFASLQEK KDKIPLLYKG LICLGDLERY KKQYKQPVNN NRHAQERERQ ADKFEVAEKY
YLAAWRLMPD DGAAWNQLAV ISTYVHNDYS TTYYYYRALA VKNAFQGADG ILQRFFGRIF
DKWRAKRKDG DGEEGGEVGD GVEKWKEEMV VLMAILYLKA GFTYIPTILP PLLTSLKDFL
SERRLPTESI VQLTSILLGS HFRARSTSGL EQDPNLLKRS FEAEGKTLEV ALGVWKIYLE
IAREEIDEAR ASLRRGLEDS ALLDDEDEEE LETDEMPQLI SAVLRRILPS LRIISKWLKL
NTTYLSGLAS PSSNDQVSSP ELRAAISSFW NTYNAFFQSS SRLFILERLP SITRPLEEDI
DMRGFAPLQK GKTADMDSFA AGATSATDSA DAEGDVDGGQ EGREVHPNEE HLMRLGDIQV
DGLLIGQSLG FTDPLHTFTT GTFAIPEATE LPNVPRVEEH EREHDLQSIS TNTEDDPVNL
AMRASLGVES VGEEDEDDEE VIVWSRGREN AVDTSVYQQV QTIPAQLPAP MVPPPKHRLQ
HQQSQTAMDL LQDLLQSTPP LAHSSGNNSP ALVASSPYAH QSPSYTQTGF PGMNRNMSGP
SAGIPSPHNT QMPPGLPILG PAPVPQSVRP VQHSGQAKNP GQVPSLFMGP QGNSIWTMTR
EESQQGRARR AGSGAGTGPG MGWVGGDEGQ QGSVASQGPP PPPGFGIPQA PQIAQPLATQ
MHQPQPQAPV PIQHMGLSTA SATLQAQATF PRSSPGAVAP PNLTHRPPSI PKLHHESPAR
QTPVQVQHPT RQPRGPSANP SGTWGGLSNM NMGGTGMGMG MSNVPLPVPK SEGEAAVPYY
MRPGVFGGGA ATPAGAGGIS GVGGVVPASG RDGAGQGIWD GTRGFGQGQG WVSGS