Gene CNK01780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01780 
Symbol 
ID3254658 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp523211 
End bp524660 
Gene Length1450 bp 
Protein Length337 aa 
Translation table 
GC content47% 
IMG OID638253671 
Productconserved hypothetical protein 
Protein accessionXP_567657 
Protein GI58260494 
COG category[R] General function prediction only 
COG ID[COG1741] Pirin-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.293563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAGGA CCACACTTAC ACGTTTCAAC ACTTCATCAA GTCTAACCAC ACTCAAAGCA 
AACATGTCCA CCACTGCTGC TATCAATACC GCCACCGGCA CTTCTCGAAG TATTACCAAG
ACTGTTTACG CTCACGAGGT CTCTGAAGGT GCCGGTGCCA CTGTCAGGAG GTCTATCGGT
ACTAGGGAGC TTCGGAATCT TACTCCTTTC TTGATGTGAG TCACTTTCAG CTGTTGAGGT
GCAAGACTTG GAATTTGGGA CATCGACACC AAGGAATGGA GACCTTACCC AAACTCACTT
GAGGGTATGC CCGGACAAGT GATCGAGATT CCGATAGGAG TCAACTATCC AGATCCGAGA
AAGACTAGCC CCTTAATGAC TCATTCGTAA ACTTGACCCG ATATTTGTAT ACTGACTTCC
TCTCACAGGC TCGATCATTT TAAAGTCCTT CCCGGTGCTG GTTTCCCCGA TCACCCTCAC
CGCGGTATGC AAACTGTCAC TTACTTGTTC CGAGGTATCT TCAAGCACGA GGACTTCCTT
GGATACTCTG GAACGTAAGT TTTTTTCTGT TTGAAGCTAT TTCTTTGTAA TATTACCTGC
TGACACGATC GCTTTAGTTT AACACCCGGA GATGTTCAGT GGATGACAGC CGGTAAGGGT
ATCGCCCACG CCGAGATGCC TATATTTGAC CCGGACCCAA CTAAGGCTGA GCCTGTTGAG
GGTATGCAAC TTTGGATTGA CCTTCCCCAG AAGGAGAAGT ACATTGAACC AGAATACCAG
GACCGAAAGG CTGAAGAGTA AGTCTGAATG ATTAAATTTT ATAGTAGTGG TTTTGCTGAT
ATATGCGTAG TATTCCTGTT ATCCACCCTA AAGATGGAGT AGAAATCACT GTTCTTTCTG
GTGACTCTCA CGGTACCAAC GGCTCTGTTA CTCCGGTCGG CGGTGCTTGG TACTTGGGTT
TCAAACTTCA GAAGCCCGGG GCGAGCGTGT ATCAGCCTCT TCCTGAAGGT TACAATGCCT
TCATCTATAG TAAGCCTTTT TAATCTCATC GTACTGACCT ACTACTGACA AATACTAAAA
CAGTCGTGAA GGGAAAGCTG CAAATCGGCG ACGACACCAA GACTCATGAC AAGTTCAACT
TGCTTGTTCT TTCTTCCAAG CCTGGAGAGT CTGGCGTGAC CCTTACCAGG CCAGAAGACG
ACACTGATGC TGAAGAAGCA CATTTTGTTG TCATTGCCGG AAAGCCTCTC GACCAGCCTA
TTGTTCAGGT GAGTCAGTTT CTCATGTGTA AGATGATGAT GTGTTGACAG CCCTTTTTCA
GTATGGCCCT TTCGTCACCT GCAGTCAAAG ACAGGCTATG GAAGCAATTA TGGACTATCA
GACAGGGAAG AATGGGTTCG AGCGTGCTGT TGGTTGGAAG AGCAAGATCG CCAAGGATTT
CAGGGGTTAA
 
Protein sequence
MLRTTLTRFN TSSSLTTLKA NMSTTAAINT ATGTSRSITK TVYAHEVSEG AGATVRRSIG 
TRELRNLTPF LMLDHFKVLP GAGFPDHPHR GMQTVTYLFR GIFKHEDFLG YSGTLTPGDV
QWMTAGKGIA HAEMPIFDPD PTKAEPVEGM QLWIDLPQKE KYIEPEYQDR KAEDIPVIHP
KDGVEITVLS GDSHGTNGSV TPVGGAWYLG FKLQKPGASV YQPLPEGYNA FIYIVKGKLQ
IGDDTKTHDK FNLLVLSSKP GESGVTLTRP EDDTDAEEAH FVVIAGKPLD QPIVQYGPFV
TCSQRQAMEA IMDYQTGKNG FERAVGWKSK IAKDFRG