Gene CNI00810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00810 
Symbol 
ID3259543 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp196833 
End bp200109 
Gene Length3277 bp 
Protein Length988 aa 
Translation table 
GC content48% 
IMG OID638258566 
Productexpressed protein 
Protein accessionXP_572741 
Protein GI58271170 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAACCTTCAT CTGTCATCTT TTCGCCTCCA CGACCTACAA CAAATAATCT TGAGAGCCCA 
ACATCAGTAA CAAATAGCGC AGTCACGAAG GTTGATTAAT ATTGTTGTTG ACTGACAAAA
TGGGACCTCT ACGTCGTCAG CCAGCTATGC TCGACTTGCA CAGCGCCAGT CGCGCTGTTA
GCGTCGAATG GGATGACTGG GACAAGGATC CATGGGGACA GGAAGAGATA AACCATGGAA
GTCTCAAGAG ACAAAGCACT GATGTACTGG GGGAGACTGA GGATCGCAAA AGAATCAAGG
AGGTGAGGCA GGTAAGCAAT ACCCTGTGTT GATTGTTCGC TCCATAAGCT GACATAGACT
GACCAGAGCG ACAAGTATCG GAAGGCATCA TCCATTGCGA AGAGGTCTGA GCCTCTCTTG
AGAAGACCGG TAAGGTCATC AAACTCGCCA GACACCACCA CCATTGTGCT CGAACCACGA
AGTCGGTCGT TTTCATCCGC CCGCTTCGAA AAAACTGTCA CTTATGACCA TGCATCGGCC
ATATCGCCTT GTTCCGAGTA TGGCGAAGGA ATCTCTTTGG TTCTAGGAAT TTCAAAACCT
CCCCAACGAT TCCACGCATC AGAGTATCCC GAGTCAGCCT ACTCCACTGA ACGACCAGAC
CTTGCTGTGT TTGGACAACC CGCCTCTCAA CCAATGGTCA GAGAATATAC TTCTATGGGC
TGGGATGGAG AGAATGTTGA GAACCGCTGT GATACCGGCA AAGTTCTTGA AGGGCTCCCA
CTGATTAATA CCAGTAGATT AGAGCGAACG AGGAAGGGTA ATTCCAAGTC GCCATGGCAT
CAAGTGTCGC GCAAATACTC TCGGGCGTCA CCTTTGTCAA ATTCTACTGG AGACCATGAT
GCTCGAACGC ATACCCTTTT GCATACAATG AAGAGAGGCA AATCCAAGTC CTATCAAGGT
GGCCTTTCTC TCAATTTTCA GAGCGTTTCT CTCTCACCAT CACCTATCGC ACCCACACCC
TCCCCAACAT CCTCATCTCA CTCTAGTCAA TCATTCCTTT CCGCCTCCAT GTCTGATCAT
TCCGTCCGTA TTCTCACAAC GATCACAGCC CAGACGCGAC CACATATGAA TTTCTTGGCC
AATGCATCTG ACGTCTCGAT CCTGGCGCCT TCCACCATCG GAATAGGCTA CTCTTCGTCT
GCTGATACGG GCTCTCCAGG TAGTTTGCCA GAAAGTCTGG AAGCGGAGTG TGTCGATCTG
ACGGAGACTG GTCGTCGGGC CTCGCATGCG AGTAAGAGAG ATAAGCGACA TTCGTGGATG
AGCCACACGA ACACGGTAGA CCCGGAAACT GTCGAGATAA TCACCGAAGA ATTACCATCT
ACCCAGCCTT TGAAGTCCGA TCCAGTATAT CTCAGGCCCT GCTACATTCT ACCTGTTGGC
TCTCAAACCC CTTTACATAC TTGTTCGTCT GCGCAAGGAC CAAAATTGAA AAGAGCACAT
ACCTATGCCA ACTTGAAGGA TCTCTGGAAA CTCGCTTCAT TCCCCGGTGG GCTTATCGAG
GTGCAGAAAG ACGAAGAACC CGTAAAGCAA GAAAACCAAG GTCTATCCTC TCAAATTCTC
GAACCTCCTG CTATCATCCA ACCACCATCG CTTGGTCCGC CAATCGATAT CCTATCCCGT
CCAGTCTCTT CTGCCATATC GCGTGACACC ACCACAACAG CTCGAGGTCC TCCAACTACC
CATTCTTACT CTCCCTTCCG ATCAGTATCC ATGTGCTCTC CGCCGGATAC AGTCGAAGAG
CTCTTTCCTG AATCCCCATT CAACTCCCCC CAACCTGGCA ATATCGGCAA AATCAAACGA
TCCCCTTTTC AAGCAACTCG TAATTCCTTG AAAGGGATGC TAAACAAGTC TTTCAAAGGC
CGATCACTGG CCCATGCGTT TGATAATGCG GAACAGAACA GTCAGCAAAG AGAGTCGTCT
TTGAAATCAA AGATATCTGG GCCTTTGCTG ATCAGAGGTA TGCGAGGTGA GATGGATAGG
AAGAAAAGCG ATAAAGAATG GAGAGAAGAG GTCTTGAAAG ACGTTGTAGG AAGGACGCTG
TCTTCGCAAT TCAAGCTCCT TGAAGAGGTA GCTAAGCAGC AGCGAGAATC TTTACTTGAT
AAAGAGAAAA TTAGCCAGAA GACTCCGTCG TTTGGGATTG GCCTGAAAAA CAGGAGCCCA
ATACCAGAAA CACTTTTGAG CGCCGTTTCG ATATGCGACA AAGAAAATCC CATCACTGCT
GATTCGCACG CAGACAGAAT GAAAATGAAG AGCGGAACAA GACCGAGTAT GGAAAGTAGC
TTGAGCTTGA GGATGGTAAC AGATGAGACA TTCAACAACC CGAAAACCGA GGGTCTGACA
TCGTGAGTCT ACAGCCCACA GATGAAGGGC AATCGTTGAC ATGTCTTAGT GGACATCTTG
CCAATTCCAT TGGCCGTAAA AATCTCACAA ACCAAACCCC TCCGCATCGG CTATCGGGCA
AAGGCCGTCC TACCTCTGCC CCACTTCTCT CTGCATGGTC TCCACCACGT CCCAAATCCG
AAAAGCTGTC TCAAGTGATC AATCCACCAA TTGTCGTCAA TCATGCGACA CCCCGTCCCG
AGGAGTCTCC TAAGACCAAC CCGCATCATC CGCCTTCATC TCCTTCACCG TCGCCAAAAG
CGGGAAAAAG TAAGAGCGGG AGGATCTCTA GAAGGTCAAG TATTCTGAAC TTGCTCAAAT
CCAAAGATTC AAAACACCAG AACGCCTCCC AGAGTCCTCC TAGACCAAAG AAGATGGCAA
GCGGGACGTT CACTCTCACA GGTAGCATTC GCTCCTCTTT CTTGGCTATC ATTAAGAATC
ATATTCAAAC CATGCAGCAA GAAAAAAGAG GTAAATATTC TAGGACACCC TGCATCATCC
CTCCTTCCAC GTTACCCCTT CGTACACCTA TCCATCAACA TACTGAACTT TATCGACCTC
TTTCAAGAGC GAGATCATCG TTCGAGCTGA CTTTGAACCT TGAGCCTGCA AAGCCATTAT
TGGATGAGCT GCTGGCGAGG GATGACGCGC TGGGCTTTTT GGAAGGAAGA AATAAAAGGG
AGCAGAGAGT AGAGGTTGAT ATTGAGAAGG TGCTGGAATG GCGTAAAGAA GTGGAAGAAG
ATATCTAGAG ATTAGCATGT GGTGTGCGAA ACGATGTGGT TTTGGGCTCT GTGTAAAATC
TTTAACTGTT GGGTGGCCTA ATGAGTAATG ACGATTA
 
Protein sequence
MGPLRRQPAM LDLHSASRAV SVEWDDWDKD PWGQEEINHG SLKRQSTDVL GETEDRKRIK 
EVRQSDKYRK ASSIAKRSEP LLRRPVRSSN SPDTTTIVLE PRSRSFSSAR FEKTVTYDHA
SAISPCSEYG EGISLVLGIS KPPQRFHASE YPESAYSTER PDLAVFGQPA SQPMVREYTS
MGWDGENVEN RCDTGKVLEG LPLINTSRLE RTRKGNSKSP WHQVSRKYSR ASPLSNSTGD
HDARTHTLLH TMKRGKSKSY QGGLSLNFQS VSLSPSPIAP TPSPTSSSHS SQSFLSASMS
DHSVRILTTI TAQTRPHMNF LANASDVSIL APSTIGIGYS SSADTGSPGS LPESLEAECV
DLTETGRRAS HASKRDKRHS WMSHTNTVDP ETVEIITEEL PSTQPLKSDP VYLRPCYILP
VGSQTPLHTC SSAQGPKLKR AHTYANLKDL WKLASFPGGL IEVQKDEEPV KQENQGLSSQ
ILEPPAIIQP PSLGPPIDIL SRPVSSAISR DTTTTARGPP TTHSYSPFRS VSMCSPPDTV
EELFPESPFN SPQPGNIGKI KRSPFQATRN SLKGMLNKSF KGRSLAHAFD NAEQNSQQRE
SSLKSKISGP LLIRGMRGEM DRKKSDKEWR EEVLKDVVGR TLSSQFKLLE EVAKQQRESL
LDKEKISQKT PSFGIGLKNR SPIPETLLSA VSICDKENPI TADSHADRMK MKSGTRPSME
SSLSLRMVTD ETFNNPKTEG LTSGHLANSI GRKNLTNQTP PHRLSGKGRP TSAPLLSAWS
PPRPKSEKLS QVINPPIVVN HATPRPEESP KTNPHHPPSS PSPSPKAGKS KSGRISRRSS
ILNLLKSKDS KHQNASQSPP RPKKMASGTF TLTGSIRSSF LAIIKNHIQT MQQEKRGKYS
RTPCIIPPST LPLRTPIHQH TELYRPLSRA RSSFELTLNL EPAKPLLDEL LARDDALGFL
EGRNKREQRV EVDIEKVLEW RKEVEEDI