Gene CNE04870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE04870 
Symbol 
ID3257576 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1357390 
End bp1359473 
Gene Length2084 bp 
Protein Length603 aa 
Translation table 
GC content51% 
IMG OID638257070 
Productproline dehydrogenase, putative 
Protein accessionXP_571179 
Protein GI58268046 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCAA TCCGACCGAT CATGAGACCT TTTCGAATGA CAACTCCTTT TCGACAATCT 
TCTAGCTTCC GCCAGCTAAG GATTACCTGG GCTCCTTCCT TTGGCGCTGG AAGCAGCCGT
TGCCTCACTA TCGGCCTTTC TCCTTCCTCT GGTTCTGGGT CCAGCTTTCG CCGCCGACTC
CTTTATCCCC TTGCTATCCT CCCTGTTGGC TTGCTCTTAC TCCCTGTCCT CTCTGCCGAC
TCTGAGCCAG ATGCCATTCC TGTGCCCACT TCTCTCTCCA CTTCTACGAC CTCCGAGCTT
CTCCGGACTT GGTTCATCTA CGCCATCATC TCTATGCCTG GCGTTGTTGA CTACTCTCCT
GCCGTTCTTA ACTTCTTTAT CAACTCTTCT TTACGCGGCC CCACTGAATG GTTCGTCCGA
CACACCTTTT TTGGCCAATT CGTCGCTGGT GAGACCGTAG AGGGATGTAT GCCTACTTTG
AAGGCTTTCA GGGAGAGGAA CGTTGGTGCC ATGTTAAACT ACTCTGCCGA AGTAGACGAG
TCGCAGTTGA CCGAGACTGC TCCTTCCAAG GAGGAAAGGA ACAGGAAGGA GAGAGAAAAG
AAGTTTGAGA CTATCATCAC TGCTTTGGAG GCTGCTGGAG AATATGAAAG AAGCTTGCCC
GTTGACCAGA GAGGTGTTAC TGGTTTCGCT CTAAAGATCG TGCGTTCGTT CCTTGGCAAA
ATCGAAGAAA TATGTACTAA TATAAATCCT CTAGACTGGC CTTATTGACC CCAACATCCT
TGAAAGAGCT TCGTACACCC TTCTCCGATT ACGTCCTCTT GCCAAGTCCA ATTCTCCCAC
AGCCCCCAAC ACTCATCTTT TCGTCCCCTA CCCTGGTACT CCCGAAACCC TGGACCGGCA
AGTCGTTGCC CGCACTCCCG AGCTTAAGCT AGGTGATGGC AAGGAGCTCC TTGCTTTGAA
GGGCAAGTGG GATGACATGG GTGTTTTGGA AAAGGATCCT GGATTGCAAG AGGGTGACCT
TGAGGAGCTT AGACAGTTGT GGTACAAGTT GCAGAAGATT GGTCACAAGG CTAAGGAGAA
CGAGTGAGTT GATTGAACCC AGGCCATCAT ATTTGTAATG CTGACAGATC TCAGCATCAT
TCTCTATGTT GATGCCGAGT ACACTTGGTA CCAGCCAGCT TTGGACGCAT ACACCCTTCT
TCTTTCTCAA GAGTTCAATC GACCTCCCAC TTCCAAAGAG GAGATCTGGA CTGGTCCTCT
GATTTAGTGA GTTCTTCTTT CGAGTGTCTC CCTTTCATCC TTAACTAACC CATTGCAGCG
GTACTTATCA GACCTACCTC TGCCGTCAAC CCACACACCT TATTCACGCC ATACAACACG
CCGAAGTCAA CGGCTACGCC CTCGGTGTCA AGCTCGTCCG TGGTGCCTAC TTTGAGCAAG
AACGCAAGAA GTGGTCCGAC GAGGGCCGTG TCGGTGCCGC TCCCATCTGG CCCAACAAAT
CTGCTACTGA CGTCGCTTAC AATGGCTCTA TCTCCACCAT CATGACCACT CTCGCCTCCC
AACTTAAGTC TCCCCACCCC GAGCTCGCTT TGAGCGTTGC GTTCGGTACC CACAACCCTG
AGTCTTGTGA TCTCGTCTGC GAGAACTTGC TCAGGAACGG CCTTGCCAAG GAAGTAGGGG
AAGCGAAGAT GTTGAGGTTG AGAGAGGACG TGCGGGGTAA GGTTAGGATT GCACAGTTGC
TGGGTATGAA GGACGACCTC ACAGATCGTA TGGCCAGAAA GTTCGTCAAT GATGGCAAGC
CCGTTGCTCT CAAATACATG GCATACGGCA AGCTTTCAGA GGTTATGCCT TACCTTGGTA
GGCGGGCGAT TGAGAACAAG AGTTTGATGA GCGGTGATCA CGGTGCAGCA GCAGAAATGA
GGCGAGTGGC GGCCGAGTTA AAGAGAAGAT TTTTTGGTGG CTCAGTATAA GGCGCTCAAG
TGGAGATGTA AGGTGTAACA GAGTCCCGTC AGTAGTATCC GTCATCTTTT GGGGGTTTCT
TGTGTTTAGA TATAGACTGC CATGTACTGT ACAATGCATA ATTA
 
Protein sequence
MSAIRPIMRP FRMTTPFRQS SSFRQLRITW APSFGAGSSR CLTIGLSPSS GSGSSFRRRL 
LYPLAILPVG LLLLPVLSAD SEPDAIPVPT SLSTSTTSEL LRTWFIYAII SMPGVVDYSP
AVLNFFINSS LRGPTEWFVR HTFFGQFVAG ETVEGCMPTL KAFRERNVGA MLNYSAEVDE
SQLTETAPSK EERNRKEREK KFETIITALE AAGEYERSLP VDQRGVTGFA LKITGLIDPN
ILERASYTLL RLRPLAKSNS PTAPNTHLFV PYPGTPETLD RQVVARTPEL KLGDGKELLA
LKGKWDDMGV LEKDPGLQEG DLEELRQLWY KLQKIGHKAK ENDIILYVDA EYTWYQPALD
AYTLLLSQEF NRPPTSKEEI WTGPLIYGTY QTYLCRQPTH LIHAIQHAEV NGYALGVKLV
RGAYFEQERK KWSDEGRVGA APIWPNKSAT DVAYNGSIST IMTTLASQLK SPHPELALSV
AFGTHNPESC DLVCENLLRN GLAKEVGEAK MLRLREDVRG KVRIAQLLGM KDDLTDRMAR
KFVNDGKPVA LKYMAYGKLS EVMPYLGRRA IENKSLMSGD HGAAAEMRRV AAELKRRFFG
GSV