Gene CNG04710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG04710 
Symbol 
ID3258539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1344312 
End bp1346099 
Gene Length1788 bp 
Protein Length526 aa 
Translation table 
GC content54% 
IMG OID638258094 
Producthypothetical protein 
Protein accessionXP_572149 
Protein GI58269986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGGGTTCTGC CGGCACTGAG TTCTCCTTTT TAGCCTCGCT CGATGCTTCT AACGCTTTCA 
ACCGTGTAGA TAGGGCCGAG ATGGCAGCTG CGGTCAAGAC CCATGCGCCG ACGCTTTGGA
GGACATGCAA ATGGGCCTAT GGCGACTCAT CCGACCTCGT GTGTGGCGAC AAAATCCTTC
ACTCCTCTCA AGGTGTTCGA CAGGGTGACC CCTTTGGCCC TCTCTTCTTC TCGATCACCC
TCCGACCAAC CTTGAACGCC CTCAGTCAAT CGCTAGGTCC GTCTACGCAA GCGCTCGCTT
ATCTCGATGA CATCTACCTC TTCTCAAACG ACTCGCAAGT CCTCAGCAAA ACTACCCAAT
TCCTCGCCGA CAAGCAGCAC ATCATCAAGC TCAATGAAAA GAAATGCAAG TTAATCAGCT
TCGATGAGAT CAGACAGGAT GGCTTCAAGA TGCTGGGGAC GATGGTAGGA GGTAAGGAGA
AGCGAGCGGA GTTTCTGGAA GGCAGGATTC GGAAGGAAAT GGCAAAGGTG GGCAAGCTCA
AGGATCTTCC GCATCAACAC GCGCTCCTTC TATTATGTTT CTGCATTCAG CAAAATCTAC
GACACCTGCA GAGAAGCCTG CGCTCCGACG ACCTTGTAGA TCTATGGGAA AGACTGGACA
CGATGCTATG GGAGGAGGTG AAAAGGATGA GGATGAGGCA GCGAGAGGAT ACGGTGGAAG
AGGAGGCTCT AGGGAGATCG TTGACGAAGC TACCAGCGCG ACTGGGCGGA CTAGGTCTAC
TTTCCTTCAA AGATGTAGCC CCCCTTGCTT ACCGCTCGGC AGCCGAGGCC TCCGACACTC
TCCTCGATAA CCTAGGTCTC CTTTCGTCGC CTGAGGAACC TCCAACCCCG GTCCCCCAAC
GAACCCGATG CGCAGAACTT TGGGAATCGC AACAGGAAGC CATTCTACGT AATCTCGGCG
ACACTGAGCG CAAGCGACTC ACCGAGAATG CCTCCAGACT CGGCCGAAGT TGGTTATCAG
TTATCCCTTA CCTTCAACCC CTGCGCCTTT CCAATGTCGA GATTGCCTCC GGTCTCCATG
ACCGCACCCT GGTCGGCTCC TCGATCCCTG TCTGTCGCTT CTGTGGGTCG GACTCACCTT
TGGGTCACGA CGAGCTTTGC CGCGCCCGCA ACCCCTGGAC CCAGCGCCGG CACAATGCCA
TCAACCGCGT CATTTATCAA CACCTCAAAC AAATTCAAGG TGCCACGGTT GAGATTGAGC
CCCACACGCT GTCGGGACAA AGGAGAAACG ACCTTCGGGT CAGAGGTTCC AGCGCTCTGG
CCTTCACTGA CTACGACCTG AAGGTTTACT CCCTCGGGGA CCGAGACGCG AGAAGCACCG
TCACACCCTG CGCCCCCAAC GGCAAGCTGG CCGACTTCTG CTTGGACCGG TGCGTGAACT
GGCTCGACAA GGTGGGTCAG GTCGTCTCTA AGAACGCTCC GAAGGTCACT GGTGGGGTCT
TTAAACCAAT CATCCTTTCC ACTGGTGGCT TGATGAGCAG GAGCACAGCA GACGAATGGA
AGGACTGGAG GGACGCGATG CCGGTGGGGG GGTTCGAGAA AATGGAGAAA CGGATTGGTG
TCGAGTTAGT AAAGGCAAGG GCGAGGACGC TGGTCTTATG AGGAAGAGGA GGTTGGATTA
TTTTTTCTTT TCTTTAATAA GTTGTTTATT TAAGTAGTTT CTTTCATTCG GGCAACCCAC
ACGACAACCC AATAAATTAA ACAACGAAAA ATGCAACCTC TATAACCC
 
Protein sequence
MAAAVKTHAP TLWRTCKWAY GDSSDLVCGD KILHSSQGVR QGDPFGPLFF SITLRPTLNA 
LSQSLGPSTQ ALAYLDDIYL FSNDSQVLSK TTQFLADKQH IIKLNEKKCK LISFDEIRQD
GFKMLGTMVG GKEKRAEFLE GRIRKEMAKV GKLKDLPHQH ALLLLCFCIQ QNLRHLQRSL
RSDDLVDLWE RLDTMLWEEV KRMRMRQRED TVEEEALGRS LTKLPARLGG LGLLSFKDVA
PLAYRSAAEA SDTLLDNLGL LSSPEEPPTP VPQRTRCAEL WESQQEAILR NLGDTERKRL
TENASRLGRS WLSVIPYLQP LRLSNVEIAS GLHDRTLVGS SIPVCRFCGS DSPLGHDELC
RARNPWTQRR HNAINRVIYQ HLKQIQGATV EIEPHTLSGQ RRNDLRVRGS SALAFTDYDL
KVYSLGDRDA RSTVTPCAPN GKLADFCLDR CVNWLDKVGQ VVSKNAPKVT GGVFKPIILS
TGGLMSRSTA DEWKDWRDAM PVGGFEKMEK RIGVELVKAR ARTLVL