Gene CNH02210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH02210 
Symbol 
ID3259091 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp511648 
End bp513073 
Gene Length1426 bp 
Protein Length346 aa 
Translation table 
GC content49% 
IMG OID638258266 
Productconserved hypothetical protein 
Protein accessionXP_572564 
Protein GI58270816 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGACAATCAA CCTAGCCTGC TGGCTATATC CAGGCTGGGA CAGACATTAA ATACAACCAC 
GGCTGCAACC TTTGGCACCA CAATGTCATG TCCCCTCCTT CCTCCAAGCA GCCCTTAACA
CCTTCGTTAC AAATAAATCG AAAACCTTTG CCCAAATGGG AAAAAGTCTT GTGGAGAAGT
CAGCCTTATC CTGACAACTA TGTGCCCCCT GACTTTTTGT CAGAGCTTAA TGATATACGT
GAGTCTGTTT TTTTACGATA GATCTCAAGG CTACATAACT GAAATCCACG GTAGTACCCC
GCCCACGCCC GCCTTTTTAC GCTCTGCTGT TAGCATGTCT CCCTATTTCA CAGCATATCT
CTATTATTGC CATATTCCTT GCAATATTTG CTGCACTTTT AGAAGAAAGA GTTACTCCAG
AAGCTGTGGG CTGGGGGTGC GTACTGGGTG GCATTAGTGG ATGGGCAATA TGGACGTGGG
GTTGGGGCAG ATGGGGTCCT AAAGAGCCTC AAGGTCCATT GCTTTTTTTT CCTATGCAAT
GATCAGTGCT GATAGTGCAG TAGATTCATT AATACCCACA CCAACTCCAC TTCGCACCCT
TATACTACCC CCTCTCCTTC TTTCTTTGCT TTCCCCCGTG CTTGGAACCT TGACATCCGC
GACAACTTCA GATTCAATCT GGCCTTTAGC CGGCGGCCTT GGGTTTGTGC ACCTCTTACT
GGTGGATTTC AGGACAGGAG AAGATGTGAG GGTTGTGAGG AGACGTGAAA GGTTGAGAAA
GCGACGGGGT AGTGTGGGCT TGAAGGAAAT CGGAGAGGAG AAAAGGTATG CATCACGATG
CTCTGGGCCT GAGCGATGAG GCGCTGACAT TGAGGTAGCT TGACATCGTC GCTGTCACTA
ACCTCGGCAC TTTCAGCATC TGTTGTGCTT GCTTCTCGTC TACCCTCAAC AGCCCATGTC
TTTTCGTTGG TCCTGCTTGC CGTGTTGCTA TTTGCTGGCT GGCCAGTCAT AACAAAAAGT
GTGCGCGTAA GTTTACCGCC GATCTCTCTT CAAGAGATGT AAGCTGAGAT CTTGATAGGA
GACTGGTAGG GCATACTCTT TCGTACTGAC TGTATCAACA ACGACTCTCG CCCTATCACT
TTTTCCTCCA ACCCCTTCTA CCTTTTCCGG CATCTACTTC GGGTACCTTC CGTCAACACC
AACGCTAGTG TTTCTGTTAA TTCTTTTTCT CGTCAATTTC ATTGGACCTG CCATGCTCTG
GTATGCTTGG CGATGGAAAG TCAGGCGAGG CGGCGGCTGG GATGTTGCGA CAGTTCGAAT
TCGCCAGAGT CGGCCATGAT GGTGAGCGTT GTTGCTGCCG AGCTTTGCCC AGATGAATGC
ATTGTTGATA GTGACATATG AGAAGTGAAT ATGCATGCGT ACAACC
 
Protein sequence
MSPPSSKQPL TPSLQINRKP LPKWEKVLWR SQPYPDNYVP PDFLSELNDI LPRPRPPFYA 
LLLACLPISQ HISIIAIFLA IFAALLEERV TPEAVGWGCV LGGISGWAIW TWGWGRWGPK
EPQDSLIPTP TPLRTLILPP LLLSLLSPVL GTLTSATTSD SIWPLAGGLG FVHLLLVDFR
TGEDVRVVRR RERLRKRRGS VGLKEIGEEK SLTSSLSLTS ALSASVVLAS RLPSTAHVFS
LVLLAVLLFA GWPVITKSVR ETGRAYSFVL TVSTTTLALS LFPPTPSTFS GIYFGYLPST
PTLVFLLILF LVNFIGPAML WYAWRWKVRR GGGWDVATVR IRQSRP