Gene CNB01350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB01350 
Symbol 
ID3255863 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp403989 
End bp406985 
Gene Length2997 bp 
Protein Length858 aa 
Translation table 
GC content49% 
IMG OID638254785 
Producthypothetical protein 
Protein accessionXP_568807 
Protein GI58262794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGTTT CTGGTCTTTG GGACGTGAGT CAATTTACTC CATTTTACTC GTTCTTAAGG 
CATTAGGTAC TGATATAAAT TAGCTTCTGA GACCAAGTGC GGCGAGCGTT ACGCTACATA
CGCTCTCCAA AGAAGCCTTT TTGGAAAATA AGAACGGCCT GAGAGCACTC ACAGTTGGTA
TAGATGCTTC GTAAGTGGCT CTTGTGGTAA ATGGAAGTCC TAGCTTATTG TACCTTGTGA
CAGAATTTGG ATCTTTCATG CGGCTGTACC TCAACATGGT GAAAACCCTT TCCTAAGGAC
CATATTTTTC AAGATCACAG CATTACTCCA ACATCCTGTA CTGCCAGTAT TTGTTTTTGG
TTGGTGAATT TGGTCCTGAC AGGATACGAT AACTAATTAC CAATTAGATG GTCCCAACAA
GCCTGCGATG AAAAGAAATC AGAAGGTCGG GGGAAAATTC GGAACCCATG ATTACCGAAG
CAAACAGTTT AAGGCCTTAC TTGACACCTG TGGTCTTGAA TGGTGGAACG TGAGTAACGG
CGATTTGGAA TTAATCGTCC AGCTGAAGGT ATGACAGGCG CCGGGAGAGG CGGAAGCAGA
GTTGGCTGTA ATGAATCGGC AAGGAAAGAT AGATGCCATT TTGTCCGATG ACGGAGATGC
TCTTCTATTT GGAGCGAAAT GTCTCATCAG GAAGTAAGCT TTTGGATGGG TTCTATACCT
TGCATTCGTG CTGACAGCTA GCCAGTTCTT CCCCGACTCT CTCAGGATCG CTGGCTTCTT
CAACGAAGAA TAATCCATCT GCCGGCTCTA AACGTGATTA CGACGTATAT ACACTATCCC
GGATCTGTGG AGAATGGGCA AAAGAGCAGG ACACCGAACT GACATCTGAA GAAAGCTGCA
CAATGGCAAT GGTATGGATT GCCCTTTTAA GTGGCGGAGA CTACACGCCC GAAGGACTCT
ACAGTATCGG TGAGTGTCTC GAACTCAATG CGATAATATG TTTACTGACC TAATTCAAAT
ACAGGACATA AAATATCCTA CGGTCTTGCC AAAGCTGGGC TTTCTGACTA CTTGAAAGAA
TACTGCCGTG ACAAGCAAGC TTTCCTGAAG TCTTTGCCCG GGCTTCACGC TCGTATGGTG
GAAGAACTCC GGACAAATTC TTCTAAACAG CAGGACAAGC GTTACCCTGA TCGCTCTAAC
AAGCTCTCAG CAATGTCCCC TTCGCAGCTG TTTCCAACGT CCACTTTGGA CGCTTATCTC
AGCCCATGCA CTAGCCCTTT GGACGACCCT TCTCAAGGAT GGCCTGGTTT CGGACAGGGA
AGCTGTTCCA TGGCCAGAGG AAAGGCAAGG AGTGAAGGGA GAGGCGATAT GGAGGGCATG
GCAGCAGCAT GTGAGAAATA TTTTGAATGG GGAACCAAGG ATCTTGTGTG CAAGAAATTT
GCAGGAGAGT CTGTGGGTAT CTTTGGAGCA GAAATTATGA ACGCCGCTCG AGAGGCGGTA
CGCGCTAGGG ATAGCTTGGG TCTAGGCGTT GGCATAGGTC CAGAGAAGAC GCCTTCGAGA
ATTACTTCCT TCTTTCAACA GTCTGTCCCA TCTCCTATAT CGTCCAAATC GACCGGAGCT
TCTCAGTTTC CTAACCCGGC AACTCAGCTG CGCGATACCG TTCCACCTCA TATCGTCCAA
ATCCACTCAG AGAGGACAAG TAAAGATGGA ACGGAGAAAG ACTACCGCAT CTCATTTCAC
CAGGATGTGT ATGTTGAACG TTGCCGCAAT GCCATGCTTG GCATACGAGT CGACCCTAGC
GAACTTCCTC AAGAAGAAAA GAACAGACTA GGACTCGCCG ACCATGCTGA TAAAGACAAC
GATGATAAAG TGTCCGCAAC TCAAACGGCC TCCAAGTCTG AAATCAGAGT CTGGCTTCCT
CAATACCTCG TACGAGAGGC ATGGCCGGAA CTAGTCAAAG CTTACGATGA TAAGCTGGCT
GCCAAAACGG CCAGTAAGTT GAAAACGCCT AAAAAGGATG TCCAGCCTTT GAAAACAAAT
GCCGCTGGTA AGGCAAAGAG GGGAAGGGGG AAGAAGGCGC TTGAAGCAGA CGGGGAGGAT
GTAAATGCGT TCACCTCCTT CTTCAGTCAA CGCCCCAAAG AATCAACACT AGATGCTTTT
GAGGAGGAAG AAGAGCAAAT TGAGCCAACG CCACCGCAAA GTAAGGCTAC GCAAGAGGTC
ATCGATCTCA GCCTGTCGCC TTCCCCTTCA CCACCTGCAT CTCCCACGCA AAAGTCCTCA
AATAATGCCG AGAAAAAGAA GCTAGCACGA CCAGCTCTTA ACACATCTAC CTCCAGTACA
CCGTCCGGTG GAGAGAAAGG TGAAGGCTCC AGTCGAACTG CCCGGCGTGC TCGTAAAAAA
ATCCACAAAT CCACTTCACC TTTAAAAACG ATCACCGACC TCTCCAAGTC TTCCCCTTCT
CCTGAGCAGT CCACGGCAAC TCCCACTATC ACCCCACGGT TCTCCAGTCG AACGTTCAGA
AAGACTATAT CCTCGCCATC TGCCTTCCCA CCACGCCGGG TGCAGGCTCA GGAAGTTATC
GATCTCTGCT CATCGAGTGA AGAGGATACA GCTCCAGCGA AGCCCATTCA TCGTAGACAA
GCCGCCAGGT CGCCTAAAAT CTCATCCCCG CCATCTGGTA TTCTTTCTAG TTCCACTAAG
ATCAATACCC AATCATCTCG TTCATCTCGC TCTACCTCTG TCTCATTTCC TGCCTGTCCT
TTGAACAAAC CAAGCTCACG CTCGCCACGG GAAAGTTCGA TACTCAGCCA CCCTCTCCTT
TCTGGGTCTA GCCAATCATC CTCCCTATCG CCTCCCCCGC CTACACCTCC TAATACACAA
AGAGCAACAT CCCCAAAGAA AACAAAAGTA TCATCCCCGG CTAGGCGACG GCCAAAGTAC
AAAATTATCA GCTCGACAAA AGACGGCGAA GTCATTGATT GTACGATGCG ACGATAA
 
Protein sequence
MGVSGLWDLL RPSAASVTLH TLSKEAFLEN KNGLRALTVG IDASIWIFHA AVPQHGENPF 
LRTIFFKITA LLQHPVLPVF VFDGPNKPAM KRNQKVGGKF GTHDYRSKQF KALLDTCGLE
WWNELASSSP TLSGSLASST KNNPSAGSKR DYDVYTLSRI CGEWAKEQDT ELTSEESCTM
AMVWIALLSG GDYTPEGLYS IGHKISYGLA KAGLSDYLKE YCRDKQAFLK SLPGLHARMV
EELRTNSSKQ QDKRYPDRSN KLSAMSPSQL FPTSTLDAYL SPCTSPLDDP SQGWPGFGQG
SCSMARGKAR SEGRGDMEGM AAACEKYFEW GTKDLVCKKF AGESVGIFGA EIMNAAREAV
RARDSLGLGV GIGPEKTPSR ITSFFQQSVP SPISSKSTGA SQFPNPATQL RDTVPPHIVQ
IHSERTSKDG TEKDYRISFH QDVYVERCRN AMLGIRVDPS ELPQEEKNRL GLADHADKDN
DDKVSATQTA SKSEIRVWLP QYLVREAWPE LVKAYDDKLA AKTASKLKTP KKDVQPLKTN
AAGKAKRGRG KKALEADGED VNAFTSFFSQ RPKESTLDAF EEEEEQIEPT PPQSKATQEV
IDLSLSPSPS PPASPTQKSS NNAEKKKLAR PALNTSTSST PSGGEKGEGS SRTARRARKK
IHKSTSPLKT ITDLSKSSPS PEQSTATPTI TPRFSSRTFR KTISSPSAFP PRRVQAQEVI
DLCSSSEEDT APAKPIHRRQ AARSPKISSP PSGILSSSTK INTQSSRSSR STSVSFPACP
LNKPSSRSPR ESSILSHPLL SGSSQSSSLS PPPPTPPNTQ RATSPKKTKV SSPARRRPKY
KIISSTKDGE VIDCTMRR