Gene CNK00920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00920 
Symbol 
ID3254420 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp288372 
End bp290831 
Gene Length2460 bp 
Protein Length676 aa 
Translation table 
GC content53% 
IMG OID638253582 
Producthypothetical protein 
Protein accessionXP_567655 
Protein GI58260490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGAGCGTT CATTATTTTG CCATCCCATA ATACTCTTTC CCCAGAAACC ATTGGACCGC 
TCCGGAGAGA CCGTCAGCAA GCCGTTCGAC ACCACTGCAG TCCTCGTTAA TCCCACCGTC
ACGTGCTTTG TGGTCGTTCC TTCCAATCTT CCGCCATATT CCGTTTCTTT AATTGTCCGC
TTCCATTGCC ATCGTCAGAC CATGTCATAC GGGAAACCGC TTCAGAAGAA ACCAGTCTGA
TAGAATCTCT CTACTCTCAA GATGCAGGAC GAACAACCCC CTCAGGGGAA TGCCGGAATG
AACGAAGACA CACCAATGCA CACACCCACG TCTCCACAGT CCAACTCTGC GAGCGGGGCC
AGCCATCATC CACAATCAGC CCTCCCACTG TGTTTGCCCC GCTTCACCAC ATCTACCAAT
GCATTTGGCC TTACTTCCTC TCCATCTTCT TCATCATCGT CTCAACGATC AAATGATTAT
TTCGGCTCGG TGAGCGGATC TAGTATGGTG ATTATACCCG GGATGGCAGC GGCAGCGGCG
AGTGACAGGA AGGTCAGGAG ACCAAGCATG TTGAGTTTGG CACAGAACGC GAGTTTTTTA
AGTGAAGGAT CAGGAGAAGG GGAAAGAACA CCAATGGCAT TAGTCGTAGA CACAGCAGGG
GAAAGGGACA AGAGGGGAGC GATGGATATG GTGGAAGATC CTTCAAGCAG CAAGGCGATA
AGTCAATTCA CAGGCGATGG GGGTCCCCTT CAACCTACGC CTCGTTGGCC TTCGAGTGGC
TCCTTTCAGA GTAGTCTTCT CAGAAGATCG ACATCAACAT CTACAATACC GATGGAGACC
CTCAAAACGT CTACACCCGT TCAAATTGAA TCAGTTGACG AGGAGGCACG CGGAAACAGT
GAGGTTGAAA TGGCTACAGA CTGGCTGAAT AAGTCATCGA ACCGAAAGGG TAAAGCTCGA
GAAGGCTCCC CGTCAAATCA TGACACCAAA CTTCATCCCC GCCCACTCCC CTCTGCCCTC
CTACAAACAC TCATATCCGA ATCAGCACCC CTTGAGCATG AAATCCAATC GGAAGCCCGT
CTTCAGCGGC TACTCAGTTC CCACCCCGCC AAGCTGCCAC TCACCCCACG AGCACCTCGC
GGATCTAGGG GTCGGTTTCC TGACCAAGTC GGCGGTGACG ATGATGATGA TGAACTCGGT
CTTTCGGCCT TCTCTGCCGC GGCCAGGCGG TGGGGTACAG GCAGACCATG GTCGAAGGAG
GATTCAGACT CAGATTCAGA TGATGGTCTA CCTGCGGGGG AGGTGAATGC AGCTTTCGCA
GCAGGGATGG ACATGGATCG ACCAGAGAGT AGTTCAAGTA ATGCGAATAC CATGGGAGGT
GGATGGGGAG CAATGAGCGA ATCTGGCAAA TCGACACCCG GCCAGGGGGA AAGAAGACGG
AGTAACAGTG GAAGCAGGAG TGGCAGTGGC AGTGCGGGCC CTGGAGCTGG AACGGGCGCA
GGGTATCCGT TCCCGAATAC AATGTCAACG CCTGGTATGT CAACCCCTGG TGCTGGATCT
CAAGTACAGA CACAAGGCCA AGGACAGCAA CCAATATCTG CTATTAAAGG TAGTAGCAAA
TTAAGCATGA GTGGGATGGT ACCTTCACCT GGGAATGGTC TTCCCAATGC CTTTGGCGGG
CTGGGGATGG GGACGGGTAC TCCACTGGCC AGTCCAACTG TTGAGCGAAA TGAGGTATGG
CACATTATTC CTTATATTCT TATATTTTTA CATTGTAATC ACTAACGTAA TAATACTTAC
AGCTTGTCGG TTCTCCCAAC GCTCCGATGT CGAGCCCTGG GTTGATGCAA TACCGCGAGT
CTTTGGGCGG AAGTGCATCC CTACGACCTA ATAAACGAAA AGGTAAGGGA AACCCTCACA
AGTTGTAGAT TATCGCTGAT TTGCACTTCT GCAGCTCAAG AAGACAGGTT CGACCCGTAC
AAGCGTCCGC GCGCAGCTTC TCCCTCCCTC CTCTCTTCTC CCGCTTTTCC CCTATCACCG
TCGAGGCCAA CTTCCATTCC CATCCCGGCA TCTCCCTCGC ACGCTCCGTT GTATCCTTCC
GCGTTATCGA CCTCCCATCC ACGCTCCGTC GGCGGTGTAG GCGCGCAAAC ACATCCGAGT
AGCTCAAGAC CGAATCATCC GTACACAAGA CCGATGGCTA GTCGGTCCAG AGCAGCCAGT
CCGGCTTTGA GCGCTGGGAG TGGGGGTACC AGTTTGGGAA GGGAGAGGTA TCTTGGGCAA
CACGTTCAGA ATGGGAACAA TGGACATGGC GGACAGGGGG CCTTGGGAGG GTTGGGTTTG
CTGAGTTTGG CGAACAGAGT GAGAGAGGAG GACGAAAAAG AGGAAAGGGG GGAGAGGATG
GAAGAAGACT AAGGCCGGAA TAGAGTCAAA CAGGGAATGT TGTATACAGT AGTATCTACC
 
Protein sequence
MQDEQPPQGN AGMNEDTPMH TPTSPQSNSA SGASHHPQSA LPLCLPRFTT STNAFGLTSS 
PSSSSSSQRS NDYFGSVSGS SMVIIPGMAA AAASDRKVRR PSMLSLAQNA SFLSEGSGEG
ERTPMALVVD TAGERDKRGA MDMVEDPSSS KAISQFTGDG GPLQPTPRWP SSGSFQSSLL
RRSTSTSTIP METLKTSTPV QIESVDEEAR GNSEVEMATD WLNKSSNRKG KAREGSPSNH
DTKLHPRPLP SALLQTLISE SAPLEHEIQS EARLQRLLSS HPAKLPLTPR APRGSRGRFP
DQVGGDDDDD ELGLSAFSAA ARRWGTGRPW SKEDSDSDSD DGLPAGEVNA AFAAGMDMDR
PESSSSNANT MGGGWGAMSE SGKSTPGQGE RRRSNSGSRS GSGSAGPGAG TGAGYPFPNT
MSTPGMSTPG AGSQVQTQGQ GQQPISAIKG SSKLSMSGMV PSPGNGLPNA FGGLGMGTGT
PLASPTVERN ELVGSPNAPM SSPGLMQYRE SLGGSASLRP NKRKAQEDRF DPYKRPRAAS
PSLLSSPAFP LSPSRPTSIP IPASPSHAPL YPSALSTSHP RSVGGVGAQT HPSSSRPNHP
YTRPMASRSR AASPALSAGS GGTSLGRERY LGQHVQNGNN GHGGQGALGG LGLLSLANRV
REEDEKEERG ERMEED