Gene CNA01900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01900 
Symbol 
ID3253759 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp510433 
End bp512806 
Gene Length2374 bp 
Protein Length724 aa 
Translation table 
GC content47% 
IMG OID638252523 
Productconserved expressed protein 
Protein accessionXP_566565 
Protein GI58258305 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.550521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTC TGGATAGGGC GCAGGAAAAA TTGAAGAACG GATGGGGTAA AGAAGAAAGG 
GAAGCTCCTC TACCAATGAG ACCGTGGCCA AACCACATAG GAGGCTACTG TGCTCCAGCT
TCTGACCCTA GATGGAAGGG AAAAAGGCGC AAACCAACAC CCGCAGTGAA GGATGATGGA
TTTTGGGAAG GTGTTCTCGG TCGAACGTCG ACTCCTGTCT ATCAAGGGAT AACATCAAAA
GCATATGAAA TTCCCATTTT TCCTCATATC GCTAAACACC CATCTTTTTT ATCCATTCCT
TCCAAGTGCT CTTCATCATC TTCTAATCGC GCTCTCTGGT CTGATCGCTA TCGTCCTCTT
CGAGCATCCG AAGTGATAGG TAACGAAGTT GAAGCGACCT ACCTCCGAGA TTGGTTATCT
ACTCTCGCCG TAGGTGGGCA ACATGCCAAA GGTTCAAAGA TTGTCAGACA GGTTGTAAAG
AAACCGAGGT CGGCCTTGGT TGATGGCTTT ATTGTGGATG ATCTGGGACT CTATGGGGAC
ACACCTAATT CTGAAGAGGA CGGTGAAGAT GAATTTCCGC ACCTTGAAGA TCTCCCGGAT
CCTCCCATCT CTCATGACCT CAACGCCCGT CCCGATAAAT ACCCTTCTTT GGCCTCTCAC
CTTGCCAACA CCATTCTCCT TACTGGTCCA ACGGGTTCGG GCAAGACAGC TGCAGTGTAT
GCTGCGGCTC ATGAGCTAGG TTGGGAGGTA TTTGAAGTTT ATGCGGGAAT GGGCAGACGG
ACCGCTGCGA ATTTGATGAA GTGGGTAGGA GAATTAGGCA AGAATCATAC TGTCCTCCCG
CAGGATGGCA AGTCGCAAGG CACGATAAAC GACAATGAGA AGAAGGGGAA GAGCAGGGGG
AGAGGGAAAG GCCTCTCGTC ATTTTTTGAT AAGGGATCAT TCCAGTCTAG CAAGGTTTCC
TTAAGCCGGG GGATTGCCAG TGATCCGATA GACATTGAGT CTAACGGCGA GAGCGACAAG
ATACCAGTGA CTGAAGCTGC TAATGTTTCT GGAGGAGAAC CAGGGATCAA ATTCAAAGAA
TCATTGCTCT TAATTGATGA AGCGGACATC TTATTTGAAG AGGAAGGGTC GTTCTGGCCA
GCAGTGATCG CTTTGGCATC CGAGTCAAGG AGACCGATAG TATTGACTTG TAATGGTGCG
TACTTCATCT AGCCCGATTT CCTCGCCTCT CAATTAATCG CTGACTTTAT TAAACGGCAG
ACCATCAGCG AATACCAAGG ATTCAACTGC CACTCCAGGC AATTCTGCAA TTCCATCCCA
TTCCGTCATT CATTGCCCTC CCGTATCTCC AGGCTATCTC TTCACAAGAA TCGCAATTGC
GCGGAAAACC TTGTAACCCT TGTGTAGAGA CTATTTTTCG AGGAGCTATC CACCAAACGC
CCGAAAAGGA TGTGCTTAGC GACCAATGCT TGCCGCCTAA TGGACACGAG CGGATACCAT
TCTTTGACCT TCGACAAGCG ATGATGCAAC TGCAATTTGG GTTGACAGAT CAAATACTCC
AGAGAGGCTG CGCAAAGAAA TATGGAACCC CAGATGAGGA TGAGAAAAAG GACGATTTAC
AACTAATGAC GGAGAGGATG GAAGTTATCT CGTTCTCTGA TGCCTTCATC GATATCCGAC
CTAGGGTATT AATGGAAGTG AGCAGCTCTT CTATCTCAAA TAAATATATT ACATCCGCTG
ACGCTTCACT TTATAAGCTT TACGACGTCG ACAAGCTGCA GCCAACTTCA GATGAGGAAC
TGGATGTCGC TGCCCTGTTG AAACCAGAAA TGTACGAGAC GTATCCCATC CTAGCAATGA
TCGATAGATC TTCGGACATT GCAGGTTCAC TCGTCCAGGC CGTTGGTGGA CACCTGCCAC
CTTTCGGTGA TCTTGGCCTT GCAAGGTTAG TTTCTTCTAA TCGCAAAGAT TATGAATTTA
TGTTGACCCC AAATGAAGGG CCAAGTACAT CCGCTCCATG CTCCCCTTAC TTGACCCCCT
CATCCCTCTA TCGGAACCTT TATTGCCCCA TTCAACTCTT TTCCTTTATA CCCTCCCTAC
AATGCAAAGT ATTATTTCCG CTGACGACAT CTTTGAGGCG CTTGAACAGC AAGCTGTCGA
TAGAGGCGAC GAGAAAATTA ATCCGAGAAC GGGGAAGCCT ATGCGCCGGG TAGCAGGATA
TACATATACT AGGTATTGGG ATTTGGAGGG AGCGGAAAAT GAGGCAAGGA GGATTAGCAG
ATTAATTTGG TAGAAGTAAC AGAGGTAACA CGTCTGTAGC GATAATCAGT CAGATTCGCA
GTGCTTATGT GTAGTATCCG ACGACAAATG ATAT
 
Protein sequence
MSTLDRAQEK LKNGWGKEER EAPLPMRPWP NHIGGYCAPA SDPRWKGKRR KPTPAVKDDG 
FWEGVLGRTS TPVYQGITSK AYEIPIFPHI AKHPSFLSIP SKCSSSSSNR ALWSDRYRPL
RASEVIGNEV EATYLRDWLS TLAVGGQHAK GSKIVRQVVK KPRSALVDGF IVDDLGLYGD
TPNSEEDGED EFPHLEDLPD PPISHDLNAR PDKYPSLASH LANTILLTGP TGSGKTAAVY
AAAHELGWEV FEVYAGMGRR TAANLMKWVG ELGKNHTVLP QDGKSQGTIN DNEKKGKSRG
RGKGLSSFFD KGSFQSSKVS LSRGIASDPI DIESNGESDK IPVTEAANVS GGEPGIKFKE
SLLLIDEADI LFEEEGSFWP AVIALASESR RPIVLTCNDH QRIPRIQLPL QAILQFHPIP
SFIALPYLQA ISSQESQLRG KPCNPCVETI FRGAIHQTPE KDVLSDQCLP PNGHERIPFF
DLRQAMMQLQ FGLTDQILQR GCAKKYGTPD EDEKKDDLQL MTERMEVISF SDAFIDIRPR
VLMEVSSSSI SNKYITSADA SLYKLYDVDK LQPTSDEELD VAALLKPEMY ETYPILAMID
RSSDIAGSLV QAVGGHLPPF GDLGLARAKY IRSMLPLLDP LIPLSEPLLP HSTLFLYTLP
TMQSIISADD IFEALEQQAV DRGDEKINPR TGKPMRRVAG YTYTRYWDLE GAENEARRIS
RLIW