Gene CNC04900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04900 
Symbol 
ID3256546 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1482768 
End bp1485759 
Gene Length2992 bp 
Protein Length845 aa 
Translation table 
GC content52% 
IMG OID638255709 
Productconserved hypothetical protein 
Protein accessionXP_570047 
Protein GI58265782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.266265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGTG AGTATTCTGC TGCAAAGTCA CAGTATGCCT TTCCCTCCCG CTGTTTTGAT 
ACTGGACACG CGATGATAGC TTCTCGCAAG ATCTCCAGCT GACGTCACTT TGGAACATAG
TTATGCTCCG CTACGTTCAC TAGACCTCAA CATGTAGGTC GACACCTGAG ATCGCATACC
GGCGATAGGC CCTACGAATG CAAAGAATGT CCCCTGCGGT TTGCCAGAAG GTGGGTTTAG
CGAAAGGTAA GAGAAAGAGC CCATGCTGAT AGGATGGCGC AGTGACCTTC TTTCGCGCCA
TGTCAATAAA GCACATAGGG CTCCTGATGA AGGGCCCAGC GACAAGAAAC CGACTAAAAA
GGGAAGAAGG AAATCAGTCC CAGCCTCAAG CCACTCAGCG CTCACCAAAA TCAGAATTGA
AGAACAAGAA GAAGAGCAGC GACGACAGTT GCTTCAGCGA CAACAGTTTC AGCGACAGCA
ACAGCAGCAA CAGGAGCAGC AGCAGCAGCT ATTACAGCAG CAAATGCAAC AAGAAGCCGC
GGCAGTAAGG GAACGGTCCA AGAGCGAAAG TCTCAATCAG CAACCTCATC TTCAAGCTCT
CCGAATGTAT CCTCATCACC CGCTTCTGGC TTCTAGACCT GTTCAACCAT CTATTGCCTC
AAATTCCTGG AACACCAATC CAAGCGGCTC TTTCGCCGCA GCGGGCATGA TGGCCATGTC
ACCACCTTTC GACCCTACAA TTCAACGCCT CGGAGGAAAC TTACAACCTT TCCAAGGCCA
ACCTTTTTTG GTTGGGCAAC TTCCCAATGA CCCTCCTTTT ACCGGCCAAC CTATGAGGCT
CTCTGGATCA GATCAAGGCA TGAAGGCTGC AGACGGGCCT TCAGCTTCAG TCTTCTACGA
GTTGGGTATG AAGAAGAGGG CATGTGATCA GTGCAATCAC TCGAAGGTGA GATGCGACTT
TGCCGACCCT TGCCGTAAGT ATGCGCAATC TCATCAACAC AGTGTCCATT GCTGACTGAC
TGCCACATCA GTTCGGTGTA GGCAAAGGAA CCTGTCATGC TCATATCGCA AACCTCCAAA
GCTTTCTACT ACCATGGCAC CGCCTCCAGC CCCTTACAAC CTGCCTTTAA ACACAAGCAG
CACAACAGTG TCTCCAACGT CGCCGTACTC TTCCCCGAAT TCTTCAACTA GTGTCCTGGC
TCCCACCGAG CCTATTTCTT ACCGAAAGCC TTCTGTTGCA TCCCTTCCTC CAAACCTTGG
CAATGTCCCC AATCAGTCAA ATGCCCAGTC CGTGCCATGG GTATCTTATC AAAATACCAT
GGGTACCACC TGGCCATCTC CGCAAGCCCA GACCACATAC TCTATCCCTG ATGGTACGGC
CCCTGGCAGT ATTACTGGTC AGTTTGTTGA GTCTCCTGGG GCTGTACATC CAGCGTTGCC
GCCATATTCC TCACAGTCAA AAGCTCGAGC TGAACACAGC TTGTCCCCGC ATCGGTTGAG
CATGACTCAG ACACCTTCAC TGGTTGACAG TATTGATACT TCTTCAGAGA TGGACATGGA
CGATCCTGCT GAAAGAAGAG GCAGTCATAA CTCGTTTACC AGCTCAGTAC CCAGAACTCA
ATGGGGTATC AAGGGTAAAG AGGTTGAAGG GGTTTTGAGG TTGCTTCCTA ACACCGGTGC
TTTTGAGCAG AACACATCTC CTACCCAGAC TGCGTCTAAC CAGGTGTCGC CCGTCCCCGT
CCGTGCAGAT CAGCTCTCAG ATATAGTTTC CAATCCTCAG TGGCAGTCAC AAGGAACCAT
CAACCCAAAA TGGCAAACAC GGGGCAATTT CTCGTCGGAC GACGAGAGTT CTTCAGTGTT
GTCCTCTTCT GCAAACTCGA CTTTTCTCGA GCAGTCGGTT AACGCCTCTG CCAGTCATGG
AAACATTCAT CATGAACGTC GTAGATCTAG CGAAAGTGCA TGGGACAAGG CCATGGAACA
AATGACAATA CAAGATCCAC AAAAGAATAT GGCCGTAGTT GGTGAGATTG CGGACGGAAG
TGTGCCTACT GTTCCACAAA TTTCTGAGAC CGCATCTGAA GGGACACAAG AGCTGCCTGG
CCTTATCGGA CCGAATGCTG AAGAATCGCC GATGGTGCCT ACCCTTTCCG ATGTGAAAGA
TCTGTGGCGA TTTTTCATGA CTGAGCCGAT GACTGGCTTG ACTCCAGCTG GCGAAAAGCT
TAATGAACTC GATAATCTAC CCGTTGTCAC TCCCAGACCA GGCATGGGCA AGCGCACCTT
TAGCAAGTCA AGCTCGATGC CCGACCTTCA ATCTCCCTTG GTTACTGGAC CCGCGTTCTT
CTCCACATTC CTGAGCGGTA TGACTCCCAA GCCAACAGAA GCCCAGCACT CGTATATGCC
CTCGCATTGG GCGGAGAGAG ATCACTCCAG TGTCACCGAA GGCCTCGACG AGCCGGACAT
GGGGAAGTGG AATAAAGAGA TTGAACAGAG GCAGTCGTCA TTCAGCCTGG GGAAACCGAA
TGCCAAGTTG GGCAAAGGTA AACCAGATAC GTTGTCTGGG ATTAATGATG CCCCTATCCA
ACCCCCTGCG ACGCAACCTA TTCGGCCTCT TCCCAGTGTC GTCCAGCGTT CTTCTGCCCT
TGACCAAACT CTAGCGCCCG AAAGGCTTCC AAGTTTTGGT CTGACACCTG GATTCGAGAT
TCAACAGAAC CCGCTCCTTT CCAAGCTTGG TTTGGCTTCT GCTGATGCCA GGACTGGAAG
CAAGCGAATG GCCAGTTCTA CATTGGTTAA CGACCATAGA AAGGCGACTT TCACCGTATG
GGACGAAGAT GGACCTGCAA CGCAGGGTCA GGGTAATGGG GCAGGTTACG GCGCGGAGAG
CAAGGCGCTT GCAGGTGGGG GGGCTTTAGG AGTAAGTACC GCTCAGGGAA AGGGACTTCA
ACAATCACCA TTCAAGAGCT GGGCCCTCAA TACTAGTGCC GGAGCAAGTT GA
 
Protein sequence
MSRRHLRSHT GDRPYECKEC PLRFARSDLL SRHVNKAHRA PDEGPSDKKP TKKGRRKSVP 
ASSHSALTKI RIEEQEEEQR RQLLQRQQFQ RQQQQQQEQQ QQLLQQQMQQ EAAAVRERSK
SESLNQQPHL QALRMYPHHP LLASRPVQPS IASNSWNTNP SGSFAAAGMM AMSPPFDPTI
QRLGGNLQPF QGQPFLVGQL PNDPPFTGQP MRLSGSDQGM KAADGPSASV FYELGMKKRA
CDQCNHSKVR CDFADPCLRC RQRNLSCSYR KPPKLSTTMA PPPAPYNLPL NTSSTTVSPT
SPYSSPNSST SVLAPTEPIS YRKPSVASLP PNLGNVPNQS NAQSVPWVSY QNTMGTTWPS
PQAQTTYSIP DGTAPGSITE MDMDDPAERR GSHNSFTSSV PRTQWGIKGK EVEGVLRLLP
NTGAFEQNTS PTQTASNQVS PVPVRADQLS DIVSNPQWQS QGTINPKWQT RGNFSSDDES
SSVLSSSANS TFLEQSVNAS ASHGNIHHER RRSSESAWDK AMEQMTIQDP QKNMAVVGEI
ADGSVPTVPQ ISETASEGTQ ELPGLIGPNA EESPMVPTLS DVKDLWRFFM TEPMTGLTPA
GEKLNELDNL PVVTPRPGMG KRTFSKSSSM PDLQSPLVTG PAFFSTFLSG MTPKPTEAQH
SYMPSHWAER DHSSVTEGLD EPDMGKWNKE IEQRQSSFSL GKPNAKLGKG KPDTLSGIND
APIQPPATQP IRPLPSVVQR SSALDQTLAP ERLPSFGLTP GFEIQQNPLL SKLGLASADA
RTGSKRMASS TLVNDHRKAT FTVWDEDGPA TQGQGNGAGY GAESKALAGG GALGSWALNT
SAGAS