Gene CNC06020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC06020 
Symbol 
ID3256433 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1775365 
End bp1778480 
Gene Length3116 bp 
Protein Length793 aa 
Translation table 
GC content55% 
IMG OID638255823 
Producthypothetical protein 
Protein accessionXP_569823 
Protein GI58265334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCGATAAC AGAACATCCA CAAACAATGA CCCAGCAGCA TCTCGCCACC TTCACCTGGG 
GCGCAGGCGC ACAAACAGTG AGTTCTGCCA GCCGCCCTGA CCCGTTCCTG CGCCGTCCCC
ACTCCCAAAT CCCCATTCGC CGGGTAAACA AACCAGGGCG GAAATGTCCA CTTCGCGGAG
CCATCCATCT CTCAAGTCGT GTCAAATGTG GGCTTCATCT GCTGACTGTG TATTCCCGCT
CGTTTCTTCC CGCACTTTGC TTCTGAATGC ATCACTCAAC TTGACTCCGC ACCTGCCGCC
AACTAATAAA TAGGTTTGTG TCGCCGGTAA CTTTAACGAC TGGTCTGCCA CTGCCACTCC
TCTCAAGAAA CAATCAGATG GAAGCTTCCT GGCCGACGTT TCCGTTCCAT GGGGAGAGAA
GCAGGCGTTC AAGTATGTGG TCGATGGAGA GTGGAAGGTT AGAGAAGACG AGGCAAAGGA
ATGGGGTGGG TCTGTTCCTT TCTCTTGGCC CTGAAAGAGC TGACAAGGGA TTGGATTAGA
TGCCGCTGGA AACATGAACA ATGTCTACAC TGCTCCCGAA GGCCCTGATG ACAAAAAACC
TGTGGACAAG TCAACGGCTA CTGGTGCCAC TGGTGCTTCA ACCTCTGCTG GCGCTGGCGC
TGGCGCCGGT GCTGCCGCGG CTGCCTCCGC CCATGCCAAG GACCCCGCCA CGAAGGATGC
TACTTCCAAA CCTCCAACCA CCGCCGTCCC CCCTGCCGAC ACTGCCGACA CTGCCCCTGC
CTCAACTGGT AACAAGAAGA CCGATCCTGA GGCTCTGATT GCTGCTGCTG CCCCTGGTGC
TGCCATTGGT GCTCCTATCT TGGGCAAGCC TGTCGCCGGT GCAGAGACAG CTCCCACCTC
TGGCATCACT ACGACCACCG CTATTGGTTC TGGTACCGCT GCTGCCACCG AGACCAAGAC
CAAGGACAAG GCCCCCGTTG AGAGTGCTAC TCCTTTCGAC AACAAGGCTG CAGCCGACAA
GGCTGGGGTC GCTGAGAAGG GTACTGCCAA GGACACTCTT TCCCCCGGTC CTGTCCCTGT
CGCTATGGTC GCTGAGAAGG GCACTGCCAA GGACACTCTT TCCCCCGGTC CTGTCCCTGT
CGCTATTGCC GTCCCTACAG ACAAGAAGGC TGGTCTCACT GAGAAGGGTA CTGCCAAGGA
CGTGAACGCT CCTGGCCCCA TCCCGGGATC AACCGCCGCT TCTGCTGATA CCGCCGCCAA
GGCTGACGAG CCCATCGACC CTGCTGTAGC TGCCACCACT GACAGTTCAG GTGCTGTCCG
AGTCACCGCT GTCGACGCGA CGCCTCAGCA AATCGAAATG GTCGCTGCTG CCGCTAATGT
TGGTGAAGCT CCAACCGCTA CTGTTGGAGG AGAGCATGGT CTTGCGGAGA AGGCTGCCGA
GTACGGTGCG GCTGCCATGG CCACCTTTGG ATCTGTTGTA GGCGGTGCAG CTGCCGCTGT
GGAGAAGGCG ACTGGTGTTG ACCTTACTCA CTCTAGTCCT GTAAGCTTTT GTCGCTCTAT
TATTTTCTGC GCGAAAGCGC TAATGCGAGA CAGGCTGAAA CTAACGTACG TAACCTCTTT
ACCTCTTCTT TAGCTGTCCG TCGAAGAAGC CCGCGCGCGA GGCATCGATG TCACCAACCT
CGAAAAGGTC GATGCCCCAA CTGACACCAC TTCCCCTCAG GGATCCGCAC CTCCCGCTTC
CGCTGTCGCC GCGCTCGACG AAAAGGTCGC CGAACTCAAG TCTGAAACCA TTGGCGCTAG
TAGCACAACA GGAGTGACAG ACCAGGTGGC TATGCCATTG CCAAATCAGC AGGCTCCGAA
GAGTACGTCG TTGTCGTTGC CGCCCTCGAA GGAGGTTGAT ACTGTGAGCG TGGATAAAGG
GCTGAGTGAT TGGTTGGATT ACACAGCTTC TTACCCTGCC GACGGGTCTT CCACTCTCGG
CCACACCGCT TCTACGACAG ACAACAAGGA GAAAGACATC AAGAATGACA TTCCTGCACA
ACCTGAAACA GCAGACATGA ACCACCGAGC CGTACCTGCC CCTGTATTTA CCACAATCGC
CCCTAAGGAC CCCAAGAAGG ACCGATCTCT AGGGAGCAGT GACCTCAACG ATACTGCCGG
TACGAAGCCC ATCGATGCCG CTCCCGGTGT AGACGCAAAG AAGGCCAAGA GGGAGGAAGA
AGCCAACCCC ACAGGCGCGA CTGGTGAGAA GCCCGAAGTG GCCGAGGCTA AGAAGGCTGC
TGCTCCTGAT ACTCCCGAAG ATAACTTGAA GCTCAAGTCT GAAGACGTGG GTAAAGGTAC
TGGTGCTGGA GTAGGGTCCG CTTCCGATGG CAGAGCGCAA GTCCCTCAAT CAGGTGTCAA
GGCCGAAACT TCTCTTTCCT CCCCTTCGGC TGCAACTGCT ACTACTACCG AGACTTCTCA
ACCTGCTAGC AGCGCGACTG GAACAACCGC CACGCCTTCC AAGACGACAG CGGCGACGAC
TAACGGAACA TCTGTGCCTG CTGCTGCAGC TGGTTACGGT GGAGCTGTCG CCGGTGCCGC
TGGTGCCGTC GAGGCTGGTG GTGCTGGTTC TACTACCACT GGTGCTTCAA ATACAGCGGC
TCCAAGTACC CCTGCCGGGA CTAAGCCTAC TTCTACCTCC ACGCCCACGC CAGGAAAGGA
GTCTAGCAGT GGACCCGGAT CTGTGAAGAA GAAGACTGGT TTCCTTGCCA AGGTCTGTCA
CTTTACCATT TATTCCCGAA CGCAATGATG GAAAGGAAGA ATACTGACGA GGACTTTACC
TGTAGATCAA GCACGCTCTT TCGCCCGGAC ACAAATCCAA GTAAAGAACA TTGCCCATCG
ACTGTATTAC GAGCTCTCCA TCTATCGGCC TTGATAAAGT CTTTTGAACT TTTTCCCTTT
ACTCATTTTT TTCTCCCACT TAATCTTTTT TCTTCTAATT CTTATTTAAT ACATGGTCTA
GCGTCGTAAT AGGGGAACTG TGAAGTGAAA TATACAAAGG TCCTGGATAT ACCATGGGAT
AGTGTTTTGC GTATTATACA AAAAAGTATG TGAAATGAAA AAAAAGTGTT ATTTCG
 
Protein sequence
MTQQHLATFT WGAGAQTVCV AGNFNDWSAT ATPLKKQSDG SFLADVSVPW GEKQAFKYVV 
DGEWKVREDE AKEWDAAGNM NNVYTAPEGP DDKKPVDKST ATGATGASTS AGAGAGAGAA
AAASAHAKDP ATKDATSKPP TTAVPPADTA DTAPASTGNK KTDPEALIAA AAPGAAIGAP
ILGKPVAGAE TAPTSGITTT TAIGSGTAAA TETKTKDKAP VESATPFDNK AAADKAGVAE
KGTAKDTLSP GPVPVAMVAE KGTAKDTLSP GPVPVAIAVP TDKKAGLTEK GTAKDVNAPG
PIPGSTAASA DTAAKADEPI DPAVAATTDS SGAVRVTAVD ATPQQIEMVA AAANVGEAPT
ATVGGEHGLA EKAAEYGAAA MATFGSVVGG AAAAVEKATG VDLTHSSPLS VEEARARGID
VTNLEKVDAP TDTTSPQGSA PPASAVAALD EKVAELKSET IGASSTTGVT DQVAMPLPNQ
QAPKSTSLSL PPSKEVDTVS VDKGLSDWLD YTASYPADGS STLGHTASTT DNKEKDIKND
IPAQPETADM NHRAVPAPVF TTIAPKDPKK DRSLGSSDLN DTAGTKPIDA APGVDAKKAK
REEEANPTGA TGEKPEVAEA KKAAAPDTPE DNLKLKSEDV GKGTGAGVGS ASDGRAQVPQ
SGVKAETSLS SPSAATATTT ETSQPASSAT GTTATPSKTT AATTNGTSVP AAAAGYGGAV
AGAAGAVEAG GAGSTTTGAS NTAAPSTPAG TKPTSTSTPT PGKESSSGPG SVKKKTGFLA
KIKHALSPGH KSK