Gene CNH03020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03020 
Symbol 
ID3258995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp255808 
End bp257517 
Gene Length1710 bp 
Protein Length526 aa 
Translation table 
GC content54% 
IMG OID638258183 
Producthypothetical protein 
Protein accessionXP_572472 
Protein GI58270632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.18451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCTG CTGTCAAGAC CCATGCGCCG ACGCTTTGGA GGACTTGCAA ATGGGCCTAT 
GGCGACTCGT CCGACCTTGT GTGTGGCGAC AAAATCCTTC AATCCTCTCA AGGTGTTCGA
CAGGGTGACC CCTTTGGCCC TCTCTTCTTC TCAATCACCC TCCGACCAAC CTTGAATGCC
CTCAGTCAAT CGCTAGGTCC GTCTACGCAA GCGCTCGCTT ATCTCGATGA CATCTACCTC
TTCTCAAACG ACTCGCAAGT CCTCAGCAAA ACTACCCAAT TCCTCGCCGA CAAGCAGCAC
ATCATCAAGC TCAACGAAAA GAAATGCAAG TTAATCAGCT TCGATGAGAT CAGGCAGGAT
GGCTTCAAGA TGCTAGGGAC GATGGTAGGA GGTAAGGAGA AGCGAGCGGA GTTTCTGGAA
GGCAGGATTC GGAAGGAAAT GGCAAAGGTG GGCAAGCTCA AGGATCTTCC ACATCAACAC
GCGCTCCTTC TATTACGCTT CTGCATTCAG CAAAATCTAC GACACCTGCA GAGAAGCCTG
CGCTCGGACG ACCTTGTAGA CCTATGGGAG AGGCTGGACA CGATGCTATG GGAGGAGGTG
AAAAGGATGA GGATGAGGCA GCGAGAGGAT ACAGCGGAAG AGGAGGCTCT CGGGAGATCG
TTGACGAAGC TACCAGCGCG ACTGGGCGGA CTAGGTCTAC TTTCCTTCAA AGATGTAGCC
CCCCTTGCTT ACCGCTCGGC AGCCGAGGCC TCCGACACTC TCCTCGATAA CCTAGGTCTC
CTTTCTTCGC CTGAGGAACC TCCAACTCCG GTCCCCCAAC GAACTCGATG CGCAGAACTC
TGGGAATCGC AACAGGAAGC CATCCTACAT AATCTCGGCG ACACTGAACG CAAGCGACTC
ACCGAGAATG CCTCCAGACT CGGCCGAAGT TGGTTATCAG TCATCCCTTA CCTTCAGCCC
CTGCGCCTTT CCAACGTCGA GATTGCCTCG GGTCTCCACG ACCGCACCCT GGTCGGCTCC
TCGATACCTG TCTGTCGCTT CTGTGGGTCG GACTCACCTT TGGGTCACGA CGAGCTTTGC
CGCGCCCGCA ACCCCTGGAC CCAGCGCCGG CACAATGCCA TCAACCGCGT CATCTATCAA
CACCTCAAAC AAATCCAAGG TGCCACGGTT GAGATTGAGC CCCACACGCT GTCGGGACAA
AGGAGAAACG ACCTTCGGGT CAGAGGTTCC AGCGCGCTGG CCTTCACTGA CTACGACCTG
AAGGTTTACT CCCTCGGGGA CCGAGACGCG AGGAGCACAG CCACCCCCAG CACCCCCAAC
AGCAAACTGG CCGAATTCTG CTTGGACCGG TGCGTGAACT GGCTCGACAA GGTGGGTCAG
GTCGTCTCCA AGAACGCTCC GAAAGTCACC GGTGGGGTCT TTAAACCGAT CATCCTTTCC
ACTGGTGGCC TGATGAGCAG GAGCACAGCA GACGAATGGA AGGAGTGGAG GGAGGCGATG
CCGGTGGGGG GGTTCGAGAA AATGGAGAAA CGGATTGGTG TCGAGTTAGT AAAGGCAAGG
GCGAGGACGC TGGTCTTATG AGGAAGAGGA GGTTGGATTA TTTTTTCTTT TCTTTAAAAA
GTTGTTTATT TAAGTAGTTT CTTTCATTCG GGTAACACAC ACGACAACCC AATAAATTAA
ACAACGAAAA AATGCAACCT CTATAACCCC
 
Protein sequence
MAAAVKTHAP TLWRTCKWAY GDSSDLVCGD KILQSSQGVR QGDPFGPLFF SITLRPTLNA 
LSQSLGPSTQ ALAYLDDIYL FSNDSQVLSK TTQFLADKQH IIKLNEKKCK LISFDEIRQD
GFKMLGTMVG GKEKRAEFLE GRIRKEMAKV GKLKDLPHQH ALLLLRFCIQ QNLRHLQRSL
RSDDLVDLWE RLDTMLWEEV KRMRMRQRED TAEEEALGRS LTKLPARLGG LGLLSFKDVA
PLAYRSAAEA SDTLLDNLGL LSSPEEPPTP VPQRTRCAEL WESQQEAILH NLGDTERKRL
TENASRLGRS WLSVIPYLQP LRLSNVEIAS GLHDRTLVGS SIPVCRFCGS DSPLGHDELC
RARNPWTQRR HNAINRVIYQ HLKQIQGATV EIEPHTLSGQ RRNDLRVRGS SALAFTDYDL
KVYSLGDRDA RSTATPSTPN SKLAEFCLDR CVNWLDKVGQ VVSKNAPKVT GGVFKPIILS
TGGLMSRSTA DEWKEWREAM PVGGFEKMEK RIGVELVKAR ARTLVL