Gene CNN01080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01080 
Symbol 
ID3255540 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp333659 
End bp335762 
Gene Length2104 bp 
Protein Length603 aa 
Translation table 
GC content47% 
IMG OID638254524 
Producthypothetical protein 
Protein accessionXP_568745 
Protein GI58262670 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCCA TTAAGCGGCA GTCTTCTTTA AGCTCCGAAG ATCCTTCGTC AGAACCACCT 
TCGGCACTGC CTCCACGCCT CAAAACCAAG CAAAATGTGC CAGCTTCTTG GTCACCAAAT
ACGACAGCTT CAAATCGCCG AAAGCTCGGA GGGCTCTTTG CTGATTTGGG TATGCCGACA
AGCGGCTCCC CCTTGAAAGC AACTCCAACT AGAGTGGAAT CCACTCCCAC TAAATCCGGG
CAAAAGTCCG GTTGGATGGG CACATTATCG TCTCCTATAA CATCTTTCGC CGACCGACTC
GCTGTACTTT CGATGGTCGC CAACGGTGAA GCCGATCCAG GGAAGTCTCC TGTCAAACGA
AAGGCCAAAT CAAAGCAGGT CAAGGATTGC AAGGAGTATT TCACAGACAA TGAGGGTGTC
ACTTGTGAAA GGAAGCGAGA TATTGAAGAA GGCTTAAAGT TTGAACATTT GCCTGATGAC
CTGTACGTTC TACGTTTCGA CCCTGTCCTT CTTTTTCTGC TGCTCATGCC TATATTTTCA
GTATCTTGGA AGTTCTCCTC CACCTACCAC CAACACCACA AGCACTTTCG TCGGTGTCTA
GTCTATCAAA ACGCTTCTAT AACCTCTCCC GCGCACCTAT CTTATGGGCA CGAATATTCA
ATGCTGCCGG TTATACATCC CAGTTATCCA AGGAGGTACT GGAGCGAGGT TTAGGAGTAT
GGGAGGGGCC TAGAGGACAA TGGGACGGAT TAACATGGGT GACGGAAACT CAAGATACTA
TCGATGAGAT TGAAGAAAAG GTGCCTCCCG AGTACATCCC AATACACTAT CCTACTCTTC
ATCGTACCGC CTGCACACTT CCTCAACTCA TTCGATCACT ATTACCAGCT CACCGAGCAT
CATTTTCAAC ACTATCAAGT CATACGGAAA GTGTCTACTG CGTTCAGTCA GTGGGGAACT
GGCTCATCAC AGGTTCAAGG GATCGAAGCA TCAAAGTATG GAGGTTGCCG CCTGTCAACA
GTGATGAAGA AGCGAGACTT GTTACCACAA TACCCAATGC GCATAACGGA AGTGTCTTGG
GTCTCTGCTT TGAACTCGAT GATAAAGAGA GAGGATTACT GGTTACTTCC TCCTCCGATT
GTACCGCTTC CATCTGGTCT CTGGATTTAT CACCCTACCC TCAAAGAAAA TCAGTAGCAG
TGACCAAGTT GCAAAACCTT TTGCATCCTC TGGCGGTCCT TGATGTTGCC TTGACATCTT
CATCCATCGT CACCGCTTCC AAGGATTGTC ATGTCCGTGT CTACTCTCGA GACTCGTTTG
AGCTTGTCCA CCTACTCACA GGACATCGTG GTCCAGTGAA CTGCGTCACG CCGCGCAAGG
TCGATTGGAC CAGCCGGGAA AAGGGTGAAC AGAGGGAAGA GGTCGTGTCC GCTAGTGGGG
ATGGGAGCTG GATAGTGTGG GATATAAAAA ATGGATGCCA GCTGAAAAAG GGTGCTGATG
TCGGGAGAGG TCTTGCTTGT GTTGCATGGG AGGTTCGTAC GAGCTTTCGT GAAATCCCTT
GAAAGCGAAT ATTACTAACA CGTGATAGGA TGATTACATT CTTACGGGAG ACAATGAATG
CCTTGTCAAG TTGTATGACG CCGAGACATG TAAACTTCTC AAAGTATTCC AAGGACATAG
TAATCTTGTA AGAGCCGTAG CTCTGAGGGT AAGGGATGGG ATGGCGATTA GTGGCAGCTA
CGACGAAAGT GTCATGGTGA GGTTGGGTCT TTTATCGAAG GTCTCTGGAT GACTGATCAG
TGATTCTGCT ATGTGGCTCA ACAGATATGG GACTTACATA CCGGTCATCT AATCAAGCGC
CCAACGCTTG GGCATCACTC CCTCATTTTC GACCTTGAGA TGAGCTGTAA ACGATTGATC
CTGTGGGTGA AATAATCTAT TGAAAGATTA AGACTTCTGC TTATTACTTA TCTACGACCC
TGATCATCTT CGTTTAGAGT TGGTCATGGG CATTCTGTGC AAGTCTTGAC TTGGGGCAAA
GGCCTGCCTT ATGTAGATTT CTTTGTCTGA GGAGGCTCAT AAATATATGT TATACATAGA
TATT
 
Protein sequence
MLSIKRQSSL SSEDPSSEPP SALPPRLKTK QNVPASWSPN TTASNRRKLG GLFADLGMPT 
SGSPLKATPT RVESTPTKSG QKSGWMGTLS SPITSFADRL AVLSMVANGE ADPGKSPVKR
KAKSKQVKDC KEYFTDNEGV TCERKRDIEE GLKFEHLPDD LILEVLLHLP PTPQALSSVS
SLSKRFYNLS RAPILWARIF NAAGYTSQLS KEVLERGLGV WEGPRGQWDG LTWVTETQDT
IDEIEEKVPP EYIPIHYPTL HRTACTLPQL IRSLLPAHRA SFSTLSSHTE SVYCVQSVGN
WLITGSRDRS IKVWRLPPVN SDEEARLVTT IPNAHNGSVL GLCFELDDKE RGLLVTSSSD
CTASIWSLDL SPYPQRKSVA VTKLQNLLHP LAVLDVALTS SSIVTASKDC HVRVYSRDSF
ELVHLLTGHR GPVNCVTPRK VDWTSREKGE QREEVVSASG DGSWIVWDIK NGCQLKKGAD
VGRGLACVAW EDDYILTGDN ECLVKLYDAE TCKLLKVFQG HSNLVRAVAL RVRDGMAISG
SYDESVMIWD LHTGHLIKRP TLGHHSLIFD LEMSCKRLIL VGHGHSVQVL TWGKGLPYVD
FFV