Gene CNF04940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04940 
Symbol 
ID3258471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1437068 
End bp1438867 
Gene Length1800 bp 
Protein Length526 aa 
Translation table 
GC content54% 
IMG OID638257612 
Producthypothetical protein 
Protein accessionXP_571462 
Protein GI58268612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAGTCTTGG AGGGTTCTGC CGGCGCTGAG TTCTCCTTTT TAGCCTCGCT CGATGCTTCT 
AACGCTTTCA ACCGTGTAGA TAGAGCCGAG ATGGCAGCTG CTGTCAAGAC CCATGCGCCG
ACGCTTTGGA GGACCTGCAA ATGGGCTTAT GGCGACTCGT CCGACCTTGT GTGTGGCGAC
AAAATCCTTC AATCCTCTCA AGGTGTTCGA CAGGGTGACC CCTTTGGCCC TCTCTTCTTC
TCAATCACCC TCCGACCAAC CTTGAATGCC CTCAGTCAAT CGCTAGGTCC GTCTACGCAA
GCGCTCGCTT ATCTCGATGA CATCTACCTC TTCTCAAACG ACTCGCAAGT CCTCAGCAAA
ACTACCCAAT TCCTCGCCGA CAAGCAGCAC ATCATCAAGC TCAACGAAAA GAAATGCAAG
TTAATCAGCT TCGATGAGAT CAGGCAGGAC GGCTTCAAGA TGCTAGGGAC GATGGTAGGA
GGTAAGGAGA AGCGAGCGGA GTTTCTGGAA GGCAGGATTC GGAAGGAAAT GGCAAAGGTG
GGCAAGCTCA AGGATCTTCC GCATCAACAC GCGCTCCTTC TATTACGGTT CTGCATCCAG
CAAAATCTAC GACACCTGCA GAGAAGCCTA CGCTCCGACG ACCTTGTAGA TCTATGGGAA
AGACTGGACA CGATGCTGTG GGAGGAGGTG AAAAGGATGA GGATGAGGCA GCGAGAGGAT
ACGGTGGAAG AGGAGGCTCT AGGGAGATCG TTGACGAAGC TACCAGCGCG ACTGGGCGGA
CTAGGTCTAC TTTCCTTCAA AGATGTAGCC CCCCTTGCTT ACCGCTCGGC AGCCGAGGCC
TCCGACACTC TCCTCGATAA CCTAGGTCTC CTTTCTTCGC CTGAGGAACC TCCAACTCCG
GTCCCCCAAC GAACTCGATG CGCAGAACTC TGGGAATCGC AACAGGAAGC CATTCTACGT
AATCTCGGCG ACACTGAACG CAAGCGACTC ACCGAGAATG CCTCCAGACT CGGCCGAAGT
TGGTTATCAG TCATCCCTTA CCTTCAACCC CTGCGCCTTT CCAATGTCGA GATTGCCTCC
GGTCTCCATG ACCGCACCCT GGTCGGCTCC TCGATACCTG TCTGTCGCTT CTGTGGGTCG
GACTCACCTT TGGGTCACGA CGAGCTTTGC CGCGCCCGCA ACCCCTGGAC CCAGCGCCGG
CACAATGCCA TCAACCGCGT CATCTATCAA CACCTCAAAC AAATCCAAGG TGCCACGGTT
GAGATTGAGC CCCACACGCT GTCGGGACAA AGGAGAAACG ACCTTCGGGT CAGAGGTTCC
AGCGCGCTGG CCTTCACTGA CTACGACCTG AAGGTATACT CCCTCGGAGA CCGAGACGCG
AGGAGCACCG TCACACCCTG TGCCCCCAAC GGCAAGCTGG CCGACTTCTG CTTGGACCGG
TGCGTGAACT GGCTCGACAA GGTGGGTCAG GTCGTCTCGA AGAACGCTCC GAAGGTCACT
GGTGGGGTCT TTAAACCGAT CATCCTTTCC ACTGGTGGCT TGATGAGCAG GAGCACAGCA
GACGAATGGA AGGAGTGGAG GGAGGCGATG CCGGTGGGGG GGTTCGAGAA AATGGAGAAA
CGGATTGGTG TCGAGCTAGT AAAGGCAAGG GCGAGGACGC TGGTCTTATG AGGAAGAGGA
GGTTGGATTA TTTTTTCTTT TCTTTAATAA GTTGTTTATT TAAGTAGTTT CTTTAATTCG
GGCAACCCAC ACGACAACCC AATAAATTAA ACAACGAAAA AATGCAACCT CTATAACCCC
 
Protein sequence
MAAAVKTHAP TLWRTCKWAY GDSSDLVCGD KILQSSQGVR QGDPFGPLFF SITLRPTLNA 
LSQSLGPSTQ ALAYLDDIYL FSNDSQVLSK TTQFLADKQH IIKLNEKKCK LISFDEIRQD
GFKMLGTMVG GKEKRAEFLE GRIRKEMAKV GKLKDLPHQH ALLLLRFCIQ QNLRHLQRSL
RSDDLVDLWE RLDTMLWEEV KRMRMRQRED TVEEEALGRS LTKLPARLGG LGLLSFKDVA
PLAYRSAAEA SDTLLDNLGL LSSPEEPPTP VPQRTRCAEL WESQQEAILR NLGDTERKRL
TENASRLGRS WLSVIPYLQP LRLSNVEIAS GLHDRTLVGS SIPVCRFCGS DSPLGHDELC
RARNPWTQRR HNAINRVIYQ HLKQIQGATV EIEPHTLSGQ RRNDLRVRGS SALAFTDYDL
KVYSLGDRDA RSTVTPCAPN GKLADFCLDR CVNWLDKVGQ VVSKNAPKVT GGVFKPIILS
TGGLMSRSTA DEWKEWREAM PVGGFEKMEK RIGVELVKAR ARTLVL