Gene CNN02230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN02230 
Symbol 
ID3255368 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp690576 
End bp693970 
Gene Length3395 bp 
Protein Length834 aa 
Translation table 
GC content48% 
IMG OID638254633 
Producthypothetical protein 
Protein accessionXP_568709 
Protein GI58262598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.140535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTCCTGACG TTTTCCTTCT CATCTCCGTT TTGATACTCT TACCTGTAAT CTATAGTTGT 
TTGTCATTTA ATTCATCCTT ACGAAACTCT TGACCGCTGT CCACCTAGAA GGTGACTGTT
TCACACGCCC ATAACCCTGC TCGTTCAGGC AACGACGGAA TTGGTCACGA CTCACGTTCA
ATCCCACGAT TTCAATTGCC TTATACGTTC ACCCCTACCT TTCACAGCTA CTTTACTCAG
ATATACGCCT ACTTTGACGG CTCAAGGAAA CATTTCATAA CTACGCCAAC TTATCACCGC
ATTATATGAC ACCATGACCT CATTTTCCAT ACCACTGCCA CCTCAACACA CCTCATCTGT
TGCGCCACTC TCAGATAGTT CATTTGCACC TCGCACTCCC CCCCGGTTCA CCAGGCATCG
CTCCGATACA ATGCTTAGCA CCTCGTCCTC CACCTCATCA TCCATCTTAC AGACTCCAGA
AACGCCGTCA ATTCTGGAAA CTTTGTCCAC AGGCTTGGGT TTGGGCATTC CCGCTGGGCA
ATCTAACCCA TTAGTGCAGT CACCTAACGA ATTGAGGATA AGAAGAAGAG CGAGCAGTTA
TGAGTTCGGT GCGTTATGGG AAGGGTCACC AGATGACCAT CCAGTGGGAA AGCCTGGTCC
TTCTTCGACG TTTTCAGTCG CAGGTACAGG GTGGAGACTG TGTGACAGTC CTGAGCAGCA
TGGTAGAGAT GATTACTTTC CGCCAGCTGC TAGTATAGCT CCAGCAAACG TGATGACCTA
TCAACCGAGT CCATTACACC AACAAACACA GTCAGAGGCT CCTCGAGTCG CTGACCCTGG
CTTAACAACC ACCAATCCCA CCCGTCCCGT GACGGCAAGA CGAGTGTCTT TAAAACTCAA
AGACGCAATT CGGCTCAAAC TGGGAAGATC CAAATCATCC TTCAAGGTCC TCAGTCGAAA
GACTCAGCGT GAGGGCGGCG AACTTTCTCA TGACCAAGCA TCCCAGTCGC CATCTGTAAC
CAGTAGTGAC CCGTCCACTA GATCTCGCCG ACAACGCTTG AAATCTCTCA TCTCGTTCTC
TCAATCGAGC AACTTTTCTT CACAATCGGT ATGTTCTGAC ACTCCATCCT CTGATTTGGC
ATTTTCTTCA GGCGGCTCGC AAGGAATCGC CTTGAGTGCA CCTTTGAATG CCGATGAACG
ATCGCATTCA TTCATTATAC CGACTACAGT TGATTTCAGT GTAGAAAAGG CGATTGTTGA
AGCATGCAAT GAGGAAATTG GTCAAGTTGA AGAAAAAGTG GATAGGAAAG GTAAAGGCCG
AGCAGTACAT TCGCCACATC CCTCACGAAT CCAATCCCAA ACAATCGATC CTCATAGTAC
GTTTTACGAT CTCTCGACCC CTCTTTTAAC CCAGGAACTG TCTTCTCCTC CCGATATGTG
TCCCACTGAG GAGGCTGAGG AAGCGAAGCA CCTTAGTTTT GAAGAAAGTC TTCCGAAAGA
GCTCAAGCTT CTGGTGATGA AGAAGCTGAT GGAGAGTTTT GCGGATGAAA GCTTTGGTAG
GTTCACGGGT GAGTTGCTGG GAAGGATGGA ATTGATAAAG ATGAGTACTG TAAGTCATAA
ACTTACAATA ACCGGTGCCG AGTTGATCGA GTTAGGTTTC CAAATCATGG GAGGCATTAT
GTTTTGATGG TCAGCTGTGG CCAGCTATCA ATTTAGCTGC TATTGCCCAT CTCCTTCCCA
TCTCCATCCT TCATCGTATA CTTAAGCACT CTGCGTCTTT TATTACCGAC TTTTCTCTCC
GAGGGATGGA CGCAGTCGGC GGCAAAATGC TTATCAAAGC TTTAGTAGGC GAGAATGTTG
ATATCAAGCT GTATGACCAT ATCGATCTGA CTCCACGACT CAATATCCAA CGACTTGATT
TGAGCGGGTG CAAGACTTTG ACCGAAAATG ATTTATGTTG GATCATTTCT TGCTGCCCCA
ACTTGCGCTC TCTGAATCTC AGAGGGTTAT CCGCTGTGGG CCCCAAATCG ATAGGCTTCA
TCTGGAAAAT CGAAACGCTT GAAGAGATCG ATGTGTCATA TTGCAGAGCG CTTGAACTAC
CTTTTCTTTT AAGCTATATC AAACGCATTT CAGAGGTGCA AGCGAAGAAC TTGCGGTCTA
TCAGAGCGGC CGGTCTTTTC TTCCGGAGCA ACATGCTGTT ATTGTCTATC ATCAGACGTT
GTCATAACTT GGAGAGACTT GATCTCCAAG GGTGCCACGG CTTAACCGAC GACATGTTTG
AAAATTTCCA CAATTACTGC ATCGAGGACG ACAAGTGCCT GACAAGTCTC ACCCATCTCA
ATGTCTCAAA CACCCCACTT ACCCCTGCCA TCTTCACTTA TCTCAACGGC CATCTCCCTA
ACCTGACCCA TCTTGAAATG GCCAACCTCT CGGGTGCAGA CAACCCGGAC GACGATGACG
ACGGGTATGA ACTGTCAAAG ATGTTGAAGA GTATGCCCAA GCTACGAAAG GTGGATTTGG
AAGACACCGC CGGTTTGTCA GGGGTGAGTG ATATGGTGCT CGAGGCGCTG ACGCCGATGG
ATGGGGATGT GGGGACGACG GGGTGTGAGC TGGAGGAGTT GAAGATTGGG TATGCGAGAG
TATCATCAGG GGCGATCGTG GACCTTATCA AAGGCTGCAA GAAGCTCAGA GTATTGGAAC
TTGATGTAAG CCAAGATGAT CCTCCCTTTT TTGGTTTTCT TTTTGGTCTT GTCCGACTTT
TGCTGATTCT TTCGTTGATC AGAATACGGA AGCAAACAAT ACAGTCATGC GCGAATTTCT
CCGCCGTTCG CATCCTTGTT CCCGGCTATC CATCATCGAC TGCCATAACG TCACATCGGC
TGCTTACACT GAGATTGCAG CTTCCACCAG GGCTCGTCAG GGGTGGGAAG GATGGCCAGC
TGTGCCGTTT GGTTATGATA AAGATGTGGA GATGGCTGAG AAGGCGGTGT TGAAGACGTT
TTGGGGTTGG AAGAGGGTGG TAGTCCCGAA AGGGTGGGTG GAAATGAGGA ACGAGGCGGA
GACGATGGAG AGCCGAAAGA GAGCGCGACG CGAGCAAGCG CAAGACGCTT GTAGCTCAAC
AGAAGGGGAA AGTTCAGATG GTTCAAAATG GAAGGGGAAA GGGAAAGCCA AGGATGATGT
TGATGGCGAT TCGGGGAGAA CCAGGCCGAG AATGAGGAGT AATGGTTCGA TCAGCCGTGA
ACCTGTGGGG TGCATCATTG CATGATGACT CTTAACGTCG GGTATAGGTA TCATTAGTTT
CAACCGCTTT AAATGGTTGG AGTTTCAAAT GCCTAGTGAT TTTTGTATGC CCATACGGAT
ATGTTTTGTT GTTTACATTT TTGAGTGCAG TTAAC
 
Protein sequence
MTSFSIPLPP QHTSSVAPLS DSSFAPRTPP RFTRHRSDTM LSTSSSTSSS ILQTPETPSI 
LETLSTGLGL GIPAGQSNPL VQSPNELRIR RRASSYEFGA LWEGSPDDHP VGKPGPSSTF
SVAGTGWRLC DSPEQHGRDD YFPPAASIAP ANVMTYQPSP LHQQTQSEAP RVADPGLTTT
NPTRPVTARR VSLKLKDAIR LKLGRSKSSF KVLSRKTQRE GGELSHDQAS QSPSVTSSDP
STRSRRQRLK SLISFSQSSN FSSQSVCSDT PSSDLAFSSG GSQGIALSAP LNADERSHSF
IIPTTVDFSV EKAIVEACNE EIGQVSKSWE ALCFDGQLWP AINLAAIAHL LPISILHRIL
KHSASFITDF SLRGMDAVGG KMLIKALVGE NVDIKLYDHI DLTPRLNIQR LDLSGCKTLT
ENDLCWIISC CPNLRSLNLR GLSAVGPKSI GFIWKIETLE EIDVSYCRAL ELPFLLSYIK
RISEVQAKNL RSIRAAGLFF RSNMLLLSII RRCHNLERLD LQGCHGLTDD MFENFHNYCI
EDDKCLTSLT HLNVSNTPLT PAIFTYLNGH LPNLTHLEMA NLSGADNPDD DDDGYELSKM
LKSMPKLRKV DLEDTAGLSG VSDMVLEALT PMDGDVGTTG CELEELKIGY ARVSSGAIVD
LIKGCKKLRV LELDNTEANN TVMREFLRRS HPCSRLSIID CHNVTSAAYT EIAASTRARQ
GWEGWPAVPF GYDKDVEMAE KAVLKTFWGW KRVVVPKGWV EMRNEAETME SRKRARREQA
QDACSSTEGE SSDGSKWKGK GKAKDDVDGD SGRTRPRMRS NGSISREPVG CIIA