Gene CNF04870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04870 
Symbol 
ID3258256 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1416302 
End bp1419734 
Gene Length3433 bp 
Protein Length857 aa 
Translation table 
GC content51% 
IMG OID638257605 
Productconserved hypothetical protein 
Protein accessionXP_571454 
Protein GI58268596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTCT CCCTCTGCAC TCAACCCCCA GGTACAACGA TGCCCAGAGG CCCGTCCATC 
CCGCCCTACC TCACCGCAGG AGCATCCTCC AGGGCGCACC ATGCTCTTCT TGCACAGCTT
TCACAGGCTG ACTCCACCCA GGAAGAGGAT CAGATCGTTG CCCACCATCT AACCCAGGCC
AGGGCTGTCT TGCACTCGCG CGATGTGAAC ACTGTGGGTT TCTCTAAACG TGCTGCCGGA
GGGTATAGCG CTGAGGGAAC GGCCAGACCA GGATAGCCGA GAGTCTGATT GTGATACTGC
ACTGCACGAT GTTGAGGCAC AACGTGGATG ATGATTTGGA TTTCGCTCTG ATGCCGGCAC
TGAAGCTTGC CGAAGCTGGC AAGACCATCC AAGAAAGGCG GATAGGTATG TTGCATTGGC
GCCGGTGCTG GTAATTGACT AATGCCCTTG GGGATGTAGG GTACTTGTTC TTGGTAGAAC
GCCTTTCCCC GGATCACGAG CTGCACCTGT TGTTGGTAAA TACCATCCGC AAGGTATGTC
GCGGATTCTG CACACGCAAA ACAGAGGTCA AGTCACTGAA AAGGCCGGAT GCACCCAGGA
CCTATCGTCG AACCAGCCGG CAAATATACT TCTTGCGCTA CACACGATTG TCAAACTACC
CTCACGCGAT CTCGGGCCGG CCGTCACCCC GCTGCTCATC GCAAAACCCC TTCTCCGTCA
CACCTCGTAC GTTTTTTCCT CTTCTCTACA TATATCGTTT TCCGTTCCAA GGCGCTGACT
GAAATCCTTG GCCGTTCTAC CATTCTGTTT CTAGAGCTGC GATACGGCAG AGAACGTACC
AAGCGCTTGT GGCTCTACAC CTCTCCTCGA CCTTTCCCCG TACACCGCAA CCACATTCGC
AATCCCAATC TATCCGCCCT CACTCTCCCT TTCCATTGTC AATGAGCAAA GTCGTGAAAG
CGGTGTGTAA AGAAAAAGAT TCGTCATGTC TTTGTGTGCT TTTCAAATTA CTCGGGCGGC
TCATCCATGT AAGTGGGGTC GTAAATGTAC GTTGTAACTG ACGGGTCGTA CGCGAAAAAA
AAAAAAGACT GGAGCACATG GGATAAAGAA TGAAGAGGAG AGGGGGTACC TAGTACAGCT
CGTGCTCGAC AAGACGGAAG AAATGGGGTG GCTCGAGCAA GGGCAGGTGG AAGGGGGAGG
CGACGGGGGA GAATTGATAC TCGAGTCAGT GAGGTTGTTG GGGACCTTGG TCAGTGCCGA
GATCTTGGAG ATGGATTCGG CGGAACGGGG GGAGGAAGTG CGCGAGCAGA TTGAAGAGTG
GATACGGAGA AAGATGGAGG AGATGCGGAT GGACGGTTCT TGGTCTCGCT CTCGTTCTCA
GTCTCGCTCT CGCTCTCGCT CTCAGTCTCG GTCGCAGTTT CAATCTCAAT CTCAATCTCA
GTCTCAGTCT CAGTCTGGGA CGGCTGTGGG GTTGACGAGG TGGAAAAAAG GTGAGTTTAT
GTGTCATCTT CTCTGGCAAA ATGACCCAAA ATCCTTTCCT CGCTAAACAT GTGTCCTGTA
AGCGTTCCTT CTAGAAGTAT GTGGTATAGC ACCCATGGTA CCGGCAGTGA TCGGGCATTG
TCTGGGGGTT GTGTCCCGTC TTTTGGTACC ATCTTCCTTC CCCTCAACCT CTTCCTCCTC
ATCAACCCCT GCCAACAAGT CTACCCACAC CCACACCCAC ACCCACACCC AGACCAGCGT
TTTACCCCCG CCAAACGAAC ACGTTCTAGC CCTCCGATGT CTTTTACTCC TCCCACCGCA
GACATGGGAC CGAAGTGAGA AGATGGGGGA AGGGGAGATG GGAGTGGTGA TGGAAGGTGT
TGGGAGTGGG GATACGAGTG TTCGGCGATT GGTGCGTCTT TTTTAGGGGG GTTGTGTGGG
TGCGAATTCC GCTGATGTTT TGATGCCTAA AAGACGATAG AGTTACTTTA CGCGCTTTCG
CCAGATTTGG CGAGAACGGT TTTCGAGAAT TATTTGGAAT CGATCGTCAC GTCGACAAAT
CTCTCGCTCC CGTTGGACAT GCAAACACAA ACACAGATAC AGACCCTGGG AATGGAGGAG
AGGGTGAAAG TGAAGATGGG AAGGAGAGAG ACGGCAGGCA GGGCGCTGGA AGTTTGTGAG
GTTGTGGCGC GTGTGCAGGC GCGTGTACAG CCATGGCCAG GGGTACAGGG AAAAGAGGGG
GAAAGTGGAA ATGGGGACCG AAGCGTGGTG AATTGGGGAG ATGTGGTGAG CGTGCTGGTG
ACGCTGGGCG AAAAAGAGAA AGAGAAGGAT GGGGATGGTT GGGAAGAAGG GGTGAAGAGG
GTTATGGATT TGGTTAGGCT CTGTGAGTTC CAGAGTTCTC CTTCCATCAT GTTATCTAGC
GCAGCCCTGC AGAAAAGAAA AGCTGGCTAA CCACTGTTTT TATGCCTGCC ATTTCAGACG
ACCCATCTGG CGAATTGCTA GTCGATGCTA TACACGCGCG TTACGAACGG CGCATTCATG
AAACCGAAAG AGGACGGCAG GGCCATGTGC ATGAGCATGA GCATGAGACT GAGACTCTCT
GGATCCAGAG TCCGACAGTG GTCATGTTGC TTGGAGCGGT GGCGTGTGAG TTTTCCGCTT
TGGGTTTAGG GGATGGAAAG GCAAGGCAAA ACAAGCGAGC GAGTCTGAGA GAGTTGCTTT
CGACTGCCGC TTTGGGCGGT TCTCGTGCTG GCATGATCAG CAGCGTAAAC AGCACAATCG
GGGTGATCCC TACAACATCC ACAGCGAGAG CGACGTCCGT GCAATCGCCC GCATTACAAG
AGATTGTTTT ACTGGCGTTT GTGGCTTTGT TGAGAGACGA AGGAGAGGGA GAGGATATGG
AGAGGATACG AAAGGATGTA GAATCTCTCG CCAAGGGAGC GCAGGGGTAT ATAAAAAGAG
TGAGTTTTTA AACAGGTTCC TACCTCCCAT CGCGATGATA GGTAGTTGCT GATCAATTTG
GTTTTGTGGG TGTCATCAGC GATGTGACGA AGTGATTTAT ACAATTGATA ACGGGCTGGC
TCGAGAATTA GGTGGTGGGG CAAAGTCGAG TTCGGTATGT CATTTTCGTG TCCTTTTTTT
TTCTTCATCT TTTTATGAGG ATATTTGTGA GCTGATTTGC TGTTAATTCC TTGCCTTCTT
ATTTGTTTGT TTTTCTCTCT TCCTAGCTTT CCGATATCCG CACGTCTCTT GTAGATACTG
TGGCCACACA CAAAAGACGG GCATCCTTGA AAAGTGAAAG CGAAACGAGA TCGCCTCTGT
CTTGGTCGAC TTTGACGGCA ACAAAGCAGC TTCGCTATGA GGCGTATGGT TCAAAGTGAA
TTCCACAAGT TGTATATTAT GCTTTCCTAA TGGTATATTT GTATTTAAAT ATTATCTTGT
CATTATATGC ATT
 
Protein sequence
MRLSLCTQPP GTTMPRGPSI PPYLTAGASS RAHHALLAQL SQADSTQEED QIVAHHLTQA 
RAVLHSRDVN TTRIAESLIV ILHCTMLRHN VDDDLDFALM PALKLAEAGK TIQERRIGYL
FLVERLSPDH ELHLLLVNTI RKDLSSNQPA NILLALHTIV KLPSRDLGPA VTPLLIAKPL
LRHTSAAIRQ RTYQALVALH LSSTFPRTPQ PHSQSQSIRP HSPFPLSMSK VVKAVCKEKD
SSCLCVLFKL LGRLIHTGAH GIKNEEERGY LVQLVLDKTE EMGWLEQGQV EGGGDGGELI
LESVRLLGTL VSAEILEMDS AERGEEVREQ IEEWIRRKME EMRMDGSWSR SRSQSRSRSR
SQSRSQFQSQ SQSQSQSQSG TAVGLTRWKK VIGHCLGVVS RLLVPSSFPS TSSSSSTPAN
KSTHTHTHTH TQTSVLPPPN EHVLALRCLL LLPPQTWDRS EKMGEGEMGV VMEGVGSGDT
SVRRLTIELL YALSPDLART VFENYLESIV TSTNLSLPLD MQTQTQIQTL GMEERVKVKM
GRRETAGRAL EVCEVVARVQ ARVQPWPGVQ GKEGESGNGD RSVVNWGDVV SVLVTLGEKE
KEKDGDGWEE GVKRVMDLVR LYDPSGELLV DAIHARYERR IHETERGRQG HVHEHEHETE
TLWIQSPTVV MLLGAVACEF SALGLGDGKA RQNKRASLRE LLSTAALGGS RAGMISSVNS
TIGVIPTTST ARATSVQSPA LQEIVLLAFV ALLRDEGEGE DMERIRKDVE SLAKGAQGYI
KRRCDEVIYT IDNGLARELG GGAKSSSLSD IRTSLVDTVA THKRRASLKS ESETRSPLSW
STLTATKQLR YEAYGSK