Gene CNF04020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04020 
Symbol 
ID3258081 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1164709 
End bp1167922 
Gene Length3214 bp 
Protein Length788 aa 
Translation table 
GC content46% 
IMG OID638257520 
Productconserved hypothetical protein 
Protein accessionXP_571363 
Protein GI58268414 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.43338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGGTCGGTG CGTGATATAA AGGGATGAGC ATAGAGAACT GCATCCCATT TCTACGTTTT 
CCATTCTTAA ATGATTCATC AACAGCCAGT AATGAGAACA ATCCCATTGT TATGCTTCAA
CGACGTTTAC AGAGTCAACC AAAAGTACAA CCCTCAACCC GGAGCTCCCG AAGACAACTC
ACCGGACAGG ACAATCACCG TCTCCCAGTT CGCAGAGTTG CTTTTGAGCG AAAGAAGCAA
ATGGGCCGAT AGACAGAGTG AAGATCAAGA TGATCAAGAA GGTCCGAGCA AAGAAGGATT
GGTGCTTTTC GCTGGTGATG TTTTCAATCC CAGCGTCGAA AGTTCGGTGA CCAGAGGATC
ACATATGGTA AGTTATAACT TGAAAATCCG CCAAGAAGTG CTGCTGAGTT TATCTACAGG
TACCAATCAT GAATGCTCTG AAGGTGGACT ATGCATGTGT AGGTGAGTAT CACTCGCTAT
AACATCAAGT GATTAACTGC TGATGCTCGG TAACCAGGAA ATCATGACTT CGACTTTGGT
AAGCAAGCGG CAATAGAATT GGCCTGCAAG TCTATCGACT GGCACTAATT GAGCTTCGTG
CAGGCTTTCC CCATCTTACA AAACTTGTAG AATCTACCAG CTTCCCGTGG TTACTCTCGA
ACATCGTTGA CACCAACACG GGTCGTCAAC CGGAACCTCT CAAACGATTC ATCGTTACGG
AGCGATGCGG AGTGAAGATT GGCCTCATTG GTTTAGTTGA GAAGTAAGTT GGAGTCGGAA
CTGCAAGTGA CTTCCGTCAC TGATATGGAT TCAACATCTA CAGGGATTGG ATAGCCACAA
TTCCTTCCTG GCCTCATAAT TTCAGATATC GTTCGATGAA AGATACAGCC CTGGAGTTGT
CTCGAGAACT TCGTGATCCC AACGGTGAAC ACCAGGTGGA CATTATTATT GCTTTAACAC
ATTGTCGAGT ACCAAATGTA AGCTATACCG CCTTTCCTTC GTTACGATCA GCGGAGTTAG
CTGACCCGAT CTAGGACATT CGGTTGGCTA TTGAACTTGG AGCAGTGGCA GACAAACCCG
GAGTCGAGAA TGAACATGGT ATAGATCTCA TTGTTGGAGG TCACGACCAC GTAAGCAGCT
TGCCAAAACA TTGATATCGC AGAGCTGATT TTCAAATCTT TTGACAGATA TATTATGTAT
GTTGCGCAGA GTTTTCTTTT GGTCTTTTTT TGTTTTTTTT TTCAGAAAAA GTGTATCTCT
AACCTTTCGT AGATCGGTAA AGGTGCAACA TCTTGGGAAG GTTATGCTGG ACGAAAGGAT
GTGCCCGGAA CTACGGAAGA CCATGGTGTT CGGTAAGTGT GATTTAGTTT AATGTGATCT
CTGATTTACA TTTCATGTAG ACTCATCAAA TCCGGTACCG ATTTCCGTGA TCTCACCTCT
GCCAACCTCA CCGTCACTCC TACCCCCTTA GGTTCTATTC GTCGCCAACT CATTACATCT
TTGACAGGAA AGCACCTCTA TGTACTCCCT TCTTCACCTT CATCACCACC GTTTGAAGAG
CTTGTCAAAT CCTTGTTGTC TTCCGTATCT GAAGCCCTCA CCAAACCGGT ATGCTTTACT
CTCACCCCAT TCGATGCCCG ATCAGAAGTA GTCAGAACGC AGGAAAGCGG ACTAGGCAAT
TGGATCGCGG ATGTACTGAT GCATGCGTAC GCCGAGAGCA TTGTACAGGC GAAAGGCAAA
GAAGGAGGTC TCGGAGAAGA GTTTGAAGGA GATGCTGATG CAGCCATTCT CTGTGGGGGT
ACATTGAGAG GCGATTCGCA ATATGGGCCT GGAAAGATCT CTTTGGGTGA TATTTTAGGT
GAGCTTCCTG AAACCATGCC GGCCTGTATG TTGAGGACCT TATTCAGAAA TCCTTCCTTT
CGAGGACCCC GTTGTCTGCA TCGAGGTACG TCAGGTGCTA TGTATGGTCA CCAACTAGCC
GTTAACATTT TCTAGCTTGA CGGCAAAGGC ATATGGAATG CTTTAGAATC TGCTCTTTCG
AAATGGCCGG CTCAGGAAGG CCGTTTTCCC ATTGTCTCTG GTCTCGCCGT CAAATGGGAC
CACACGAGGC CTCCTGGACA GCGAATTTTA TCTATACACC AGATTGCGCA ACCGAAGAAA
GACGACGATG ACTGGGAAGA CCCTGCGGAC ATGGTAGATT TCAAGGAACA AGAAGATGGG
ACAACAGTGG TGGTTAAACA GAAAAAGTTG CAGTTGGGCG AGGAAGTGAA GAATGAGGAA
GGCGGAAGGA TGTACAAGGT CGTGAGTGTA GCTTCTTTTT GATGAGGCCT GAATCATCAC
TGATACATGT CGCAGATCAC TAGAGATTAC ATGGCTCAAG GATATGACGG ATTCGAAGCA
TTGAAGAATA GAAATTTTAT TGTAGACGAT GAGAATGGAC AGATTATGTC CAGTATATTG
AGAAGCTTCC TTCTCGGTAA GTCTAAAGTT ATCAGCCGTG ACCGTAGTCT GACATTACAA
AAAGGATCCT CTTATATCTT CCGTCACAAG CAGCTTGAAG AAGCTGCCCA CTCCCACCTT
TCTCGTCGAA CCTCGCAAGT GCTACTTCGT GCCCGTGCTG AACACACATC TCAGTCTCAA
TCATCTCCTT CGGCATCACT CTCTTCTTCA CCGAAGAGGA ATCTTCTGGT TACTCAATTC
AAATCATCTC AGGACCGCAT GACTTCGCCC AACCACCTCA CATCTGCTTC TTTGCCCTCT
AATGCCTTGT CGCCAAACTC CGAGATATCT GAATCTTCCG CTTGGGGTAG ACTAAGAAGG
CACGTCGTAC AACATGACTG GGGAACAATT AGGAACGCTT TGCATGTTGC AAAGCATGAA
CATATGAGCG GATTAGACGT GCTGGTGAGT AGATTCCTCC CGCATGCGGT AAGGCAGTCG
CCGATACTTC ATAGGCCGGT CAAGCTATGC GTCAAGCCCG AAATCACATG CCTGGAGCTT
GGTCACCAGT TCAAACCCCT CCAACACAGG AAATGCCTGA ATACGATGGC ATCGTACTTC
CTAAAGACGA GGATGGTGAG ACTAATCTAG CGGATCTGGC CATCGTAAGC CCTTTAGTGG
ATGGAAGGAT GAGAGATATT TCGGCCGATA AGCGGTAGCT AGTCAATTTG GACGGAAGGG
AAAGGTAAAT CTATCTGTAC TTTACAAGTT CGAA
 
Protein sequence
MIHQQPVMRT IPLLCFNDVY RVNQKYNPQP GAPEDNSPDR TITVSQFAEL LLSERSKWAD 
RQSEDQDDQE GPSKEGLVLF AGDVFNPSVE SSVTRGSHMV PIMNALKVDY ACVGNHDFDF
GFPHLTKLVE STSFPWLLSN IVDTNTGRQP EPLKRFIVTE RCGVKIGLIG LVEKDWIATI
PSWPHNFRYR SMKDTALELS RELRDPNGEH QVDIIIALTH CRVPNDIRLA IELGAVADKP
GVENEHGIDL IVGGHDHIYY IGKGATSWEG YAGRKDVPGT TEDHGVRLIK SGTDFRDLTS
ANLTVTPTPL GSIRRQLITS LTGKHLYVLP SSPSSPPFEE LVKSLLSSVS EALTKPVCFT
LTPFDARSEV VRTQESGLGN WIADVLMHAY AESIVQAKGK EGGLGEEFEG DADAAILCGG
TLRGDSQYGP GKISLGDILE ILPFEDPVVC IELDGKGIWN ALESALSKWP AQEGRFPIVS
GLAVKWDHTR PPGQRILSIH QIAQPKKDDD DWEDPADMVD FKEQEDGTTV VVKQKKLQLG
EEVKNEEGGR MYKVITRDYM AQGYDGFEAL KNRNFIVDDE NGQIMSSILR SFLLGSSYIF
RHKQLEEAAH SHLSRRTSQV LLRARAEHTS QSQSSPSASL SSSPKRNLLV TQFKSSQDRM
TSPNHLTSAS LPSNALSPNS EISESSAWGR LRRHVVQHDW GTIRNALHVA KHEHMSGLDV
LAGQAMRQAR NHMPGAWSPV QTPPTQEMPE YDGIVLPKDE DGETNLADLA IVSPLVDGRM
RDISADKR