Gene CNH03040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03040 
Symbol 
ID3259122 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp249746 
End bp251782 
Gene Length2037 bp 
Protein Length610 aa 
Translation table 
GC content53% 
IMG OID638258181 
Producthypothetical protein 
Protein accessionXP_572474 
Protein GI58270636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCGC CCGATACACA GGAACGAGAA TGCGCGTCGA CACGAGCTGC GGCACCAGCT 
CAACTTACTG CAGACTCGCA CCTGACTTCA TTATCCCCCC CTTCTGCTTC CGACTCACCC
TTATCCTCGC TCCCTTCATC ATATCAGACA CTCCCTGAAC ATCCGAAATC CCCTCGTGAC
GATCCCATTG CATACGGCGG CCACCCCAAA AACAAAGACC GACCATATCC TCTCTCTCTC
GTCCCCTTCG ACCTCAGCGA ACCTTATGAA GCCTCCGAAT ACCTCACTAT CCGCTCCTAC
GATGAGACAT GCCAAGGCGA CTATCTGGGC CAACCCTGGG CCAATGAGTA CAGGGACGGG
GTGATCGATG ACGAGCAGAT GTTCGATGAC AGTGTTTTTC GTCGCTATCT TGATCGGTTC
ATCCAAGAGA ACTACAACCA CATGACGAGG GCTACCAAAT TCTTTCATCA ATCAAATTTG
AAATCGACCT TCTACGACAG ACATGACAAA GAGTGGGCCA AGCTAGATCT GGTGGATGTG
TACAGACGCC GGACGGGGAA AGTCCTGGGA GTCGTGAGTC ATGGCTTGTG CAAAAGGTGA
CACAATGCTT ATTACGCGTT CAACATAGCC CTCTCAATCG CAAGACACCT CCAAATCAAA
AAAGGATGGT GCCACTACTC CTGATGCCTC TCTTATCGGT GTTGTCAAAG ATCTCAGCTG
CTCATCGGGC TGGAAACGAA CGCTTTATGC GGTCATCGAA TTGAAGTGGA TGAAACTAGC
GACTTTTCTT ACCGGCGAGG CTAAAAGACA GAACGACGAG AAATCTCTCA GATACCTCTG
CCAGGAAGGC GTGTTTCAGA CTATGTGGTA TGTCATATTA GGCTACGCCA TCTCGGGTTG
TATTTTCGGT CTCTCTATAG TCAACGAATA CTTTTATCGG GTTGTGTATC TCTCTCGAGA
CTCGACCCCA GACATTCCCG TACTTGCCCT AGAGGCAGAT AGCGAGTTTT TGGAGAAATC
CAGACGACAT TTTGGATATC TGCCAGATGA TTACTCTGTC GAGGAACTCG CAGAGCTGCA
AGACTTTTGG TCGTCGCCTC CCAATTGTCT GATCAGCGAC CGTGCCAACG CCACTTTGAA
TAAAGAGGCA AGGTATCACC TCGATGCGAC CATTCTCTTG TTCCTCGCTC ATGCAGCGGC
ACTTCCAACG CAACGCTTCC TCAACGACCT GCCCCTCCCT TTTGCTCATC ATGTTCCTGT
TGATGCGACC GCTGATTCAG CCACCGACAT GAGATTGAAA GGATTAGAAG TTGGGCGCAG
GCGACACAGT CGTCGTTCGA CCAAGAGGAA CAAGCGCACA TTGGCGGATT TGTATGATGA
AGAGAAAGAT GAAGAGGACA AGCCAGGGGA CGACAAGCCA CCTGGCAAGG ATAATGATGG
CTCGCACGGC GGAAACTCTG GCCCTGGAGG CGATAACTCA CGTGGCGGAG GGTCGCGTCT
TGGTGGAGGG TCTCGTCCTG GCGGTAGGGG TGCAGGCGGA GGCTCCTCTT CTCGTCGTGC
TGAGGCGTTT GACAGTCGAA CGTCCACTGC ACCGCAAGAG TTCAGGAGAG GCCTGGAGAG
GCTATCCGCT CCTAAGGAAA TGTTCCACAT GAAGACGTCC ATCATGGCCT CCCTCCTCTC
CAATGACCGT ATGCACCTTT TAATCTTCCA CGTTGAAGCT ACGCTTACGA ATCGACTGTA
GGTGCCAGAT GCTCTAGGGC TCCCTCCTCC GTCGACAGTG ACTCCTCTGG ACAGTTGGAC
TCGTCGTTTG ACACGTCCTT CGGCTCCAAT AAGGCAGGTC TTATCCTTGA CGATCTCCGC
CACGATCCCC CCCCAATAGT CAACAAGCCC GACCCTGTCG ATATCGACCT TGAAGATATC
GACCCAGAGT CGGGCGAGCT TACGTTGGCG GCCTTTAAGG ACCGCCTAAC GATGCTCGGG
GTGCGGGTGA AGCTGGTCAC TCGGGACCAG ATGGGCGTCT TGTTGGCCCG GGGATGA
 
Protein sequence
MTSPDTQERE CASTRAAAPA QLTADSHLTS LSPPSASDSP LSSLPSSYQT LPEHPKSPRD 
DPIAYGGHPK NKDRPYPLSL VPFDLSEPYE ASEYLTIRSY DETCQGDYLG QPWANEYRDG
VIDDEQMFDD SVFRRYLDRF IQENYNHMTR ATKFFHQSNL KSTFYDRHDK EWAKLDLVDV
YRRRTGKVLG VPSQSQDTSK SKKDGATTPD ASLIGVVKDL SCSSGWKRTL YAVIELKWMK
LATFLTGEAK RQNDEKSLRY LCQEGVFQTM WYVILGYAIS GCIFGLSIVN EYFYRVVYLS
RDSTPDIPVL ALEADSEFLE KSRRHFGYLP DDYSVEELAE LQDFWSSPPN CLISDRANAT
LNKEARYHLD ATILLFLAHA AALPTQRFLN DLPLPFAHHV PVDATADSAT DMRLKGLEVG
RRRHSRRSTK RNKRTLADLY DEEKDEEDKP GDDKPPGKDN DGSHGGNSGP GGDNSRGGGS
RLGGGSRPGG RGAGGGSSSR RAEAFDSRTS TAPQEFRRGL ERLSAPKEMF HMKTSIMASL
LSNDRLILDD LRHDPPPIVN KPDPVDIDLE DIDPESGELT LAAFKDRLTM LGVRVKLVTR
DQMGVLLARG