Gene CNH03080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03080 
Symbol 
ID3259317 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp239966 
End bp241096 
Gene Length1131 bp 
Protein Length359 aa 
Translation table 
GC content49% 
IMG OID638258177 
Producthypothetical protein 
Protein accessionXP_572478 
Protein GI58270644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.410223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCACT TGCTCCGGGC TTTAGCAGTC TGTGGTACCA GATTTGCCAG AGTCTGGACA 
CTTTCTGATC ATCCTGAAGG TAGGACCTTC GCAGTTGAAA CCAGTCGTCC TCTTGGCAAT
AGCGGCCCTA TGGTTGCTGC CGACTTTTTT AGCCAAGTCA ATCTCGACCT CTTTCCGCAC
AATCTCATCA TTCCATCACC TTCCATGGAC AGTTTTGCCG TCGACTACCG GAAATGTGAA
CTTCTCGAGC GCACTTTATC TCGTGTTTGC GACAGTATCC TTTCGCATAC TATCGAATCA
GCTCTCCAAC CCCGCTATGA GGGCACTTTT GACCATGGCA CAGAGGCTGA TCCTTCTGCC
TTCACAACCC AATTACTTCG CGATGTTGAA GAGTTTGCTT GCCAAGCTCG TCCCACACTC
AAGAAGAGGA GAACGGGGAA AGAGGAAGTA GCCCAAGGCA GAACATTTCA ACGTAGGGAT
AGATTCGGTG ATGGTTCAAA CCAAGGCAGA GACGGTGATA ATGGAAAAGG TCCAAGGAGT
GGTGGTGGAA AGCAGGCGAG ACGGGGTGGC TCTGGCCAAA ATAGCCAAAC TAAGGGCCGA
GTTCAAGGTG GCTCTGGCCA AAGTGGTCCA AACGAGGACC AAGTTGGTAC AAAAGACAAT
GGTGGGTGGG AAGATGGTGT GGGAGAGAAC GAGAGGAGTT TGGATCCATT GCTCAAGGTC
GTCGAGAAGG AGGATCAGAA ACAATGTGAG TATATTACAT CTACTTGTGG CTTAAATGTT
AACTTTCATT CTGTAGATTT CGCCAAGTGG CAAAGCCAAC TGCCTTCATC GCCTTCCTCT
TACTATCCCT TGCCATCCGA TGATGACCTG AACAAAGACA AGGAGGCTAT ATTTATCGAT
TATACCAATA CTATAGGCCG TTTCAACTTA TACTATCAGG TCTCATTTAT CACCAGGGCT
GTAGAGATGC TCAAGATGGA GATTATCCCT ATATCAACTG CAAGGATGGA CAACCTTTAT
TCTGGACGGA TTACCTCCCA GCATGTCCTC GCGGATCCTG GCTGGGACGC TGCCATCAGC
GAACCGTTTT CCTCAAGGGC TCTGCAGACC CTACACTGTA TACCAGCCTA G
 
Protein sequence
MFHLLRALAV CGTRFARVWT LSDHPEGRTF AVETSRPLGN SGPMVAADFF SQVNLDLFPH 
NLIIPSPSMD SFAVDYRKCE LLERTLSRVC DSILSHTIES ALQPRYEGTF DHGTEADPSA
FTTQLLRDVE EFACQARPTL KKRRTGKEEV AQGRTFQRRD RFGDGSNQGR DGDNGKGPRS
GGGKQARRGG SGQNSQTKGR VQGGSGQSGP NEDQVGTKDN GGWEDGVGEN ERSLDPLLKV
VEKEDQKQYF AKWQSQLPSS PSSYYPLPSD DDLNKDKEAI FIDYTNTIGR FNLYYQVSFI
TRAVEMLKME IIPISTARMD NLYSGRITSQ HVLADPGWDA AISEPFSSRA LQTLHCIPA