Gene CNF04880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04880 
Symbol 
ID3258266 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1420017 
End bp1423302 
Gene Length3286 bp 
Protein Length849 aa 
Translation table 
GC content51% 
IMG OID638257606 
Producthypothetical protein 
Protein accessionXP_571456 
Protein GI58268600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCCA ATCCTCCCAC CATGCGCCGC GGTTCCATCA TAGGAGGAGG AGGAAATGTT 
GGCGGCAGCG AGGACTGGCG ATCAAACAAA GATTCGCCGA CGTCTCCCTT TCAATCTACC
AAGTCTAATC ATCGACGAAG TCGGTCGCCG TCCCGCCGAG CTCATTCGAT AGGTGAGTAC
CAGCGGCCTC CGCGGTATCT GTACGACTCG CCTGCTCCAC CTCGTGACAA CCCCTTTTCC
TACAACCTAT CTTCATCGAG CCTTAGCCAT AGCAGTAACC CTTCTATCCC TCCTCAAACC
CAAGCCCAAG CCCAGGCCCT TCATCAACTC CCATCCCAGT CCCATCTGCA TTCTCATCCT
CATTCCCAAA CGCAACTTCA TCTCCCCCCC ATCTCCTCAT TCAGCACCTC CACCTCCGCC
GGCACAACCA TCCCTGTCGG ATCTCCTGTG GAACACTCTC ATGGTACCCT CGTCCTCGGC
AAATCCGGTC GTTCCCGGTA CTTGGGTCCC ACAGCCGGCA CTGAATGGCT TAAAAACCAA
GAAGTCGGTG GGATCGGTAT GGAGACACCT TCGAACTCTC GTTCACCTGT GGATACTACC
AATCCTGGGA ATGGAAGTCC TGCAGGAGGA AGAGGGGAAA AGGGGGGAGA GGAAGGTGTT
TATCATGGAA GAGAAAGTGG AGGAGCATAC GAAGACCCAC TATACTCCTT TCCATTTAAC
GAATCTGGAG AAGTATCGAC GGTGGAAGCA TTGTTTGCGC GATTACCCCC AAGAGCAGAT
GCAGAGACGT TGGTAGATTC TTATTACCGT TATTTTGCTT GGAAGTAGGT TGCTCGTCCT
TTTCTGGCCC CGGATGGGGG TCATGTTTCG CGGAAACTGG ACTGATATTT ATTTTTCCCT
CACCTCCGCG CCCACTAGCC ACGACCCCGC TCCCCGCCGG ACCTTCCAAC CAATCTTCGA
CCGCGTCTAC GCCTCTCTGC TCCATCCCCG TCCCGAAAAT AGCGTCCACC TCCAACAACT
TGCTCTCGTG TACATGCTTC TTGCGATGGG TACCGTCCAC AACATAGAAT TACCCCCCCA
CGATGAGAGT GCGGAAGAGT ACTTGACTTT GGCGCAAGCG GCTATGACGA AAGGGAATTT
TATGAACCAT GCGACAATTG CGGGGTTGCA GACTTTGGTG AGCCTTCTCT CTTTCCTCTT
AACCATTTGC ATTGGATAAG ATGAAGATGA CTGAGCGATT CTAATCGTGT TTCGCCCTTC
TTGTAGGTAA CGATGGCTCA CTATTACCTC GAAACGGAGA GCGGACGAAA CGGGGATTCC
GCGTGGCCAT TATGGGGTCT AGCGATGAGT CTTGTCGTCG CTGTAAGTCG TTTGTTTCCT
CAACTCCCAA CCCGATTTTA CTCCTTCACC CGGTCCCTCC CACCCTCATC CATCCCTGCG
CTCCCTTCCT TGCCCTTTCT CCTCCTCACT TCCCCTATCC CAACTCCAAC ACCAAACGTG
TCTCTTTTCG CGGACATTCA GAGTTTAAGG TATAGGCTGA TGGATTCCTT TTGATCGTAG
ATGGGATTAC ATCGAGATGG AGCGAGATGG AATTTGCCAG ACGATGTTGT TCAAGAAAGG
CGGTATGTTT TCCACTTGTC TTTTTCTATT CGCTTCGAAA TGTATTATAC TGAAGCGACT
TCTCCAGCCA AGTATTTTGG GAATGCCACA CCATCGAAGT CTTTCAAGCC AACTGTTTTT
CTCGACCCAA CACCCTCGTC CCGCGCTACA TTGACACCGC CTTCCCTTCC CCCAACTCCG
CCGAAGTCGC CATGGGTGGC AAAGGTTGGC CCACTCTCAA ATTCGAGCTC TGCCAAATTT
CATCTCAGGT TCTTGATGCG GGTATGACCG TTCATTTTCA ATCCTACGAT TCTATCCAGA
AACTTTACGG CCAACTATGT GAATTCGAGT TGAACGTCCC TTACGACCTC CGATGTCGTT
CTGCCCTCTT GGCGTTACCG TCAGTATACC CTGACCCGGA GATGGCAAGG AAAAATAGTC
CAGAGATAAG CCGGCACAAT CTCCATAGGA CATTGCAACA GTTCACGCTC TCGTTGAATA
TATCGGAGAA TATACTGTTC TTACAAAGAC CGTATTTTGT GATGGCGATG CATGATGAGC
CAGCAGATCC GACGAGGTCA GTGTATGGCC ATTCGTATCT TGCTGTCGTA GAAAGGTGCA
ACGTAAATGC TTTTCCTCCT TTTCTGATCT CCGCATTTTG CTAACTTTAC CGATAACAGG
TCATCATCCA AGTCGTCTCC GACTTGTACA AACTCCACCC CACCATCATC TCCCGTCAGT
GGTTCTTCTG GTACCATCTC TTCACCGCCG CCGTTTGCTT GGGCACTCTT ATTCTCAAAA
ATCCCCAATC TGCTCTTGCA ACATTTGCCC TCTCCCAAAT CGAACAAGCG ATCAACGTTT
ATTCGGTACT GATCAAGCAG AATAACTCAC CCTCGATGGT GCAGAATCAT GATTGGTTAC
TGAGGCTTCG ACAAAGGGGT GCCAAGAAGA TTGCGCAGGC AGCTGGAATG GGAGGGACGA
ATCTGCCCCT GGGCGTTGGA CCTGGAGGAG GAGGTGGAGA TGGGGACACA GGAGGCCAAG
AAGAAGAAGA TCGTGAACTC CTAGGCTGGA AAACCCGACT TATCGAACGC GCTGGCTCTG
GCGTCCACAC TGCCGTCAAC ATCTCTTCCT CCAACCCCGC CAGCTCAGTG CCACACCGTA
CACCCTCGCC TGGATCTTTG ACGCAAAATG GCGGCGGTAA TAGCGGGATG ACCCCAGCTA
TGCACTTGCT TCAGCAACAT TTTGTACCGC CTTTCCAAAC TCCGCCTGTT AGTGGAATGT
TGGGTACGAC GGCGCAGACT CTGGGAATGG ATAACTCGAC GGATTTACTG GTGAGTAAGA
GCTAACTGAT TTTGTAGTCA CTCGTCTCTT CTGCGTTCGG AGAACGATAA GCTGACAAAA
TTGCGCGCAA CAGCTACACC AATTTTGGGA TCCAATGATG ATGGCAGATT CCACAAACAT
GACCGTACGT CCTTACTTTT TTTCATTTTT TCCTTTCATC CTTGCTGCTC TCAATATCAA
AGCTAACTGC TCTTTTTTTT CCCTGTTAAT ACAGCAGAAC GCAAATTGGT GGTCGTGGGA
TTTTGGAGGT CTAGCGGAGA ACGGCACTCC CATAGCTGGA GGAGCTGCAG GATCGCAAAC
CCAACCTCAA GCAACCCCCT AACCCTAGAT TTGAGAAAGG ACCTGT
 
Protein sequence
MDSNPPTMRR GSIIGGGGNV GGSEDWRSNK DSPTSPFQST KSNHRRSRSP SRRAHSIGEY 
QRPPRYLYDS PAPPRDNPFS YNLSSSSLSH SSNPSIPPQT QAQAQALHQL PSQSHLHSHP
HSQTQLHLPP ISSFSTSTSA GTTIPVGSPV EHSHGTLVLG KSGRSRYLGP TAGTEWLKNQ
EVGGIGMETP SNSRSPVDTT NPGNGSPAGG RGEKGGEEGV YHGRESGGAY EDPLYSFPFN
ESGEVSTVEA LFARLPPRAD AETHDPAPRR TFQPIFDRVY ASLLHPRPEN SVHLQQLALV
YMLLAMGTVH NIELPPHDES AEEYLTLAQA AMTKGNFMNH ATIAGLQTLV TMAHYYLETE
SGRNGDSAWP LWGLAMSLVV AMGLHRDGAR WNLPDDVVQE RRQVFWECHT IEVFQANCFS
RPNTLVPRYI DTAFPSPNSA EVAMGGKGWP TLKFELCQIS SQVLDAGMTV HFQSYDSIQK
LYGQLCEFEL NVPYDLRCRS ALLALPSVYP DPEMARKNSP EISRHNLHRT LQQFTLSLNI
SENILFLQRP YFVMAMHDEP ADPTRSVYGH SYLAVVERCN VIIQVVSDLY KLHPTIISRQ
WFFWYHLFTA AVCLGTLILK NPQSALATFA LSQIEQAINV YSVLIKQNNS PSMVQNHDWL
LRLRQRGAKK IAQAAGMGGT NLPLGVGPGG GGGDGDTGGQ EEEDRELLGW KTRLIERAGS
GVHTAVNISS SNPASSVPHR TPSPGSLTQN GGGNSGMTPA MHLLQQHFVP PFQTPPVSGM
LGTTAQTLGM DNSTDLLLHQ FWDPMMMADS TNMTQNANWW SWDFGGLAEN GTPIAGGAAG
SQTQPQATP