Gene CNC02590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC02590 
Symbol 
ID3256355 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp739743 
End bp741668 
Gene Length1926 bp 
Protein Length478 aa 
Translation table 
GC content44% 
IMG OID638255479 
Productpolygalacturonase, putative 
Protein accessionXP_569552 
Protein GI58264792 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTCC TCTGCCTCTT CGCCTGCGTT CTATCTAGCG CGGTCGTGCA GGGATCCCTC 
TTCGTGCTCA GTGAATCTCA AGATCAAAAA ATTATTCCTG AAATCCTTGA AAAAGATAAC
AGCATTTGGC CCTATCACCA TGAGGTATTT GAATTCCACC CAGAGCACAA AGACCTCTCC
ACAGGGCTTG TCCGACCCCT TTGTGTCTTA CATACTTTGG GAGAAGGAGC AGACGACTCA
TACAACTTTG AGAAAGCTGT CCATCAGTGT GGTCGCGGCG GCATTGTGAG ATTACCAGAT
GCCAACTAGT ATGTTCAAGT CTTGTTCAAC TTTCTAGTCA TATATCCTAA TCTAACTGGA
TTGTAGCACA ATCAGTCGTC CCCTTGACAT ATATCTTTCC AACTCCGTTC TTGATCTTCA
TGGTTGGCTA TCTTTTTCCG CGAACGTCTC TTCATGGATT GAAAATCGGA TGCCTCTTGG
CTTCCAAAAT CAATCACTTG CATTTGTGGT GAGGGGTAAT GATTACATTC TCGAGGGGAA
TGATAAAGGA GGGATAAATG GAAATGGACA AGCTTGGTAT GATTACGCAA AGGACTACGG
AAATAAGTTC GGGCGGTGAG TCGGTTTCTC AGGCTAGCCT GTATTGTCCT TGTCATCGTT
CTGAACAAGA CTTCATTTCA ACAGGCCCAT GTCATTGGCC ATCAAAAACA GCAAAAACGT
CATTATAAAA AATTTCAGCA TTGTCCAGCC ACAATTTTGG GCTTCCCTCA TATGGGGTTC
GGAAAATGTG TACATAAAAG ATTTCTACGT TAATGCTACT AGCTTCAACC CGGAATCTTC
CAGTGATCAG AAGAACTGGC TTCAGAACAC TGGTGAGTAA ACATAGGATG ATGGGCTTAA
GGAGTGTAGG TGCTGATAAA ATGATAAATT TACAGATGGA AGTGACACAT ACCAGAGTCA
CAATGTCACT TACGGTGCGT TTTCGTTCAA CTGGTAATAA AAATTACTTT CCTGATCCGA
ATGGATATAG AGAATATGAT TTACCAGGGT GGCGACGATT GTGTAGCATT GAAGCCAAAT
AGTACATCCA TCACACTTCG TAATGTGACT TGTTACGGAG GGACGGGTAT TGCCTTTGGA
TCGATTGCTC AGTACGCGGG CGTGGTAAGT GATGTCGGAC CTTAGGATCG CCTTCATAGC
TGACTGTCGA GATATTGATC TCATATAGAA AGATGTGATT GAAGATGTGT TTATGGAGGA
TATTCGACTG TATCCATCTA ATCAGTGCCC AGCCTATCAA GGTGTCTATT TCAAATCTTG
GTTAGGGTAA GTGTCCGACT GACATCAACT AGCCTTGCCC AGAACTGACC ACATTGCAGA
TACTCTATCG GACAGCCACC AAACGGTGGG GGTGGTGGGT ATGGATATTG TCGTAACGTG
ACAGTAAAGG ATGTCTACAT GGAGGATATA TGGCATCCTC TCGTCGTCCA ATCTGAGTGG
GTTTCGCTCT ACTTTACTAA CTTTCTGTAA TAAGGCGGTG ATATATCGTG CTGATGAATG
ATACTCCACA GCTTAACCTA TCTCACTTTA GACCGTGAAA AATTTACAGA TTCCGGTCTC
TTCGAGTAGG TTCTTTTATA AGATTGACAC GTTTCTGGAT CTGATTGATT GTAAAGGTGG
TATGATATCC ACCTGAAAAA TTTCACAGGA AAAGCTTTGG GTAACAGGAT CGCCTGGATG
TCCTGTTCCA AGTTGACACC ATGTCATGAT TGGACATTTG AGGGCATGGA TATCATGCCA
GGTAAACAAG ATCACCCCGA GATCCATTAT ACTTGTAATA ATTTTGTGCT GGGGGGGAAT
GATGGACTTA ATCAGTGCCA TCCCAGCAAC TCAAAGCTTG AAACTGAGAA TGGTGGCACA
CTCTGA
 
Protein sequence
MRFLCLFACV LSSAVVQGSL FVLSESQDQK IIPEILEKDN SIWPYHHEVF EFHPEHKDLS 
TGLVRPLCVL HTLGEGADDS YNFEKAVHQC GRGGIVRLPD ANYTISRPLD IYLSNSVLDL
HGWLSFSANV SSWIENRMPL GFQNQSLAFV VRGNDYILEG NDKGGINGNG QAWYDYAKDY
GNKFGRPMSL AIKNSKNVII KNFSIVQPQF WASLIWGSEN VYIKDFYVNA TSFNPESSSD
QKNWLQNTDG SDTYQSHNVT YENMIYQGGD DCVALKPNST SITLRNVTCY GGTGIAFGSI
AQYAGVKDVI EDVFMEDIRL YPSNQCPAYQ GVYFKSWLGY SIGQPPNGGG GGYGYCRNVT
VKDVYMEDIW HPLVVQSDLT YLTLDREKFT DSGLFEWYDI HLKNFTGKAL GNRIAWMSCS
KLTPCHDWTF EGMDIMPGKQ DHPEIHYTCN NFVLGGNDGL NQCHPSNSKL ETENGGTL