Gene CNL04210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04210 
Symbol 
ID3254941 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp163732 
End bp167381 
Gene Length3650 bp 
Protein Length970 aa 
Translation table 
GC content47% 
IMG OID638253892 
Productconserved hypothetical protein 
Protein accessionXP_567974 
Protein GI58261128 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5309] Exo-beta-1,3-glucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.945618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGACCC TACAACAAGC GCAAGAGGGG GATGCCAGTA ACAATTCCGT CATGCCAGGG 
GAAGCAGAGG GAACTAATGG TGGAGCACAA GGGGACGGAG ATGCAGAAAT GTCTGAGGTG
AGTCTTCGGC CAACGCACAC TGACATTTCC ACACCCCAGA GCAGCATCTT TGTCGACCAT
TCCCCAGCTT CAGTGTCCAA TCCTGCTTTT CAGACATTCA TCAACAACAG TCTTTCCAGA
GTGCAGTGCC AGATTTCAGT GTCGGACAAC GAAGATGATT CAGCAGCGCC ATCTGTTTCT
GGAGAGCATG GCACTGAGTC AGAGGAAGAA TTAAGGGATG CGCAGTCCCA TCTAGACGTG
GGGGGGAGAA GGGAGGATAT GGAATCATGG GATGAGCAGG CTGTCGCCAG AGAAAAGAGG
GAAGCAGGTC CGAGATGTAA GTTCTGTTTT CATAAGGATT TCTATTCACC AACTCATTGT
CTCATTATCC GATAGTGTAT GCAATGAGCG AAGAAGACAT TGATAAGATT GGCCTATATA
GTTGGAAGGT CGATCACAAC ATCTCACGAG CTGCATACAA CAGTCTTCCA GACCATGCCA
ACAATATACC GTTCATCTCC CTCAAAGAGG CACAGTCTCT AATCGACAAT TATTCAGACA
CCCTTTGCGA TTATTATGAC ATGTGCAAGC ACTCTTGTCT TTGCTTCGCC GGCCCTTATG
CTAAAGATCA GAAGTGCTCC AAGTGTGGTG CAGACAGATT CGACCGAAAC GGGAAACCAT
TCCAAAGGTT TTTGTACATC AAGATTATTC CAAGGCTTCA GAAGATGTAC AACCACCCAT
CAACAAAGGA AAGACTCATG TACCGAGACA AGAGCACCCG CGAATACGAT AACTCCAATA
TCCAAGACAT ATTTGATGGG AGTATGTACA GAATACTCTT GGAAAAAGAG ATTGAGGTGG
ATGGAAGAAA GCTCGGAACC AAGTACTTTG CCGATGACAG AGATATTGCT CTCTCATTCT
CATCGGATGG GTTCTGCCCA TTTGACAATG GCTCAGGGAC GTGTTGGCCT CTGGTGCTTT
TCAATCTCAA CCTACCACCA GAAGAAAGGG TCCATACCCG CGAGATACTG TCGGTGGGAC
TCATCCCAGG CCCTAAAAAG CCAGAAGATT TTGATTCTTT CATCTACCCA TTAGTGGAGG
AGCTTTTGCA ACTCACAGTT GGAGTGCATA CATGGGATGC CGATCTGGGA AGTTTTTTTA
CACTCAGGGC ACATCTCCTT TATGTTTCAG GGGATACACC TGCTGTATCA ATGATCATGC
ACATGAAGGG CACTGGAAGC TTCCACCCCT GTCGGATGTG CAATGCAACC TCAGTCTGAC
ATTCCCAAAA CTGTAACCCC AACCAACTGT ACCTCCCCCT CCAATCATCT GCGACCGATC
CGACTGTACC CGACCATGAT CCCTATAACC TACCTTGTCG CACCCATTAC CAGTTCATTC
GCCAAGCCCG AGAGGTGGCT GCTGCATCTA GCGCAGCTGA GAAAGAAAGA CTTGGGCAGA
AACATGGTAT CAAGGGTCTG TCCATCCTTG AAAAACTCCC CACTATCGAC TTTCCTCTTT
CATTCCCTGT CAATGCAATG CACAACCTTT TTGAGAACAT CCTCAAACTT CAAGTTAACC
TTTTCTCAGG GGCTGTCAAG GACTTGGCTG ATGAGGACTT TGTCCTTACT ACTCGTCAGT
GGGCAGCTGT TGGCGAAGCA ACTGCTGCTG CCGGTGCGAC AATCCCCTCA TCATTTGCCG
CTAGACCACC GGACTTCGTC AAGAATAGGC AGGCATGCAC AGCTGACACA TGGTCTTTTT
GGCTGCAGTA TATTGCACCG GTTTTGCTGG ATGGATTACT TGAATCAAGG TTCTACAAGC
ATTTTATGCG GTTTTTGAGC CTTTTCAGAG AATGCCTGGC CTATGAGATG CCGAGGGAAC
GCCTTGCGGG TCTGAAGACA GACTGGATTA AGTGGGGTGT GGACTTTTAC CGGTGAGTTG
CATATTCAGA TTACATTATG AACTCTTCTT AGACATTGAT TTCGTTTAGT CTCTACCATG
AGAACAAGCC TAGCCATCTC CATGTTTGTT CGGCTCAGGT GCATTCAGTC TTTCACTTCG
GTCTAATTAT CGAGATGATG GGGCCCATTT GGGTATCGTG GCAGTTTGCA ATGGAATGTT
ATTGTGGTGA TCTTGGCCGA AATCTAAGGA ATATGCGACA CCCTTATGCT AGCATGTCCC
AGCGGATTTT GGCCAAGAGT CATCTACAAC ACATTGCGAA CAAATACGAC CTTCACTGGG
GAGGTAGAGG AAGGGTGAGC GATGATGAAG CCCCTAAGCG GTATGAGACG GTCTTGCCGT
CATGTAAGTC CAAGTCATAT CATATAGCGC TTGATGTATA ATTGATGCCG GCTCAGATGC
GCATTATGCT TTGTCCTCCC CGACCAATGA CGCCCAAGGC GTCATCCGAC ACAACCTGAT
GAATAAGCTC ATCAACACGA TGACCACCCA ACACGATATC CCACCGACTG TTGAGCTTAG
AGCTTTAGTG TCAAAGGGAA AGTTGAGGGT GTGGAACCGT GTTCGAGTGT TGAATGGTGG
CGATTCCATT CGTGTGTCGA GGATGGATAC AGCTGAGGAG GACAGAAGGT GTGCCTCCTT
TGTCTCGGTG AGTTAGACTC TATTTAGTCT GAAGAGAACT TTAGAAATTG ATGATGAAAT
GTTGGTTGTA GTATATGGTG TCTATCGACT GGAATAGGCG CCATCCAACC AGGCATGTCG
AGTTTGAGGA CCACCTATGT TATGGTGAGC TCCTGAACTT GATCACCATT GACCTCCCCC
ATCACCCGAA ATCCTCCGAC ACCCCTCCTG TTACTCTCGC CTTCGCCCAC ATCAAACCAG
CGGTGTCACA TAAGCACCGT ATTGGCAACC ATGAATGGAG CTATTATAAA GAGTACCAGG
CGGAGGAGAT GGTGGATCTG GCCTCTGTCA ATGAACTGGT TGGAAGGGTT AAGAGTATCT
CAAAACCTGG CACGACCTAT ATAATTGACA GGGGCACGCC CTACAGCCGC GTGTCTTTGG
TGTAAGTGTA AATGGGCGGC TAAAGCGTTA GCTGGGGTTA TTGAGTGGAG GATGGGTGTG
GCAGGCCCCG TGCAGTTGAG TGTCTGTCCT CGGTCCATCG TTCAACTGAG TCTGTCTTTG
TGTCTAGCTT CCATCATCTT CCATCATCTT CATCTTCATC TTCATCTTCA TCTTCATCTT
CATCTTCATC TTCATCTTCA TCTTCATCTT CATCTTCATC TTCATCTTCA TCTTCATCTT
CATCTTCATC TTCATCTTCA TCTTCATCTT CATCTTCATC TTCATCTTCA TCTTCATCTT
CATCTTCATC TTCATCTTCA TCTTCATCTT CACCCTTGTC ACTTATATTC CTACCTACCG
ACTTGCTTTC CTGTAGCTAC CCACCATCCC TTCCCTGTTA CTATTTACTT TACCTCTGTC
ACGGCCACTC TCTTCTTTCG CCTCTGTTCC GATTTACCTC AGTTTTGTCA CGGCTCTTCT
ATATTTTTTC TTCATTTATC GTTCTCTCTT CTAGCCTCTT CTTTGTTTAG
 
Protein sequence
MRTLQQAQEG DASNNSVMPG EAEGTNGGAQ GDGDAEMSEV SLRPTHTDIS TPQSSIFVDH 
SPASVSNPAF QTFINNSLSR VQCQISVSDN EDDSAAPSVS GEHGTESEEE LRDAQSHLDV
GGRREDMESW DEQAVAREKR EAGPRLYAMS EEDIDKIGLY SWKVDHNISR AAYNSLPDHA
NNIPFISLKE AQSLIDNYSD TLCDYYDMCK HSCLCFAGPY AKDQKCSKCG ADRFDRNGKP
FQRFLYIKII PRLQKMYNHP STKERLMYRD KSTREYDNSN IQDIFDGSMY RILLEKEIEY
FADDRDIALS FSSDGFCPFD NGSGTCWPLV LFNLNLPPEE RVHTREILSV GLIPGPKKPE
DFDSFIYPLV EELLQLTVGV HTWDADLGSF FTLRAHLLYV SGDTPANILK LQVNLFSGAV
KDLADEDFVL TTRQWAAVGE ATAAAGATIP SSFAARPPDF VKNRQACTAD TWSFWLQYIA
PVLLDGLLES RFYKHFMRFL SLFRECLAYE MPRERLAGLK TDWIKWGVDF YRLYHENKPS
HLHVCSAQVH SVFHFGLIIE MMGPIWVSWQ FAMECYCGDL GRNLRNMRHP YASMSQRILA
KSHLQHIANK YDLHWGGRGR VSDDEAPKRY ETVLPSYAHY ALSSPTNDAQ GVIRHNLMNK
LINTMTTQHD IPPTVELRAL VSKGKLRVWN RVRVLNGGDS IRVSRMDTAE EDRRCASFVS
YMVSIDWNRR HPTRHVEFED HLCYGELLNL ITIDLPHHPK SSDTPPVTLA FAHIKPAVSH
KHRIGNHEWS YYKEYQAEEM VDLASVNELV GRVKSISKPG TTYIIDRGTP YSRVSLVFHH
LPSSSSSSSS SSSSSSSSSS SSSSSSSSSS SSSSSSSSSS SSSSSSSSSS SSSSSSSSSS
SSSSSPLSLI FLPTDLLSCS YPPSLPCYYL LYLCHGHSLL SPLFRFTSVL SRLFYIFSSF
IVLSSSLFFV