Gene CNC00420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC00420 
Symbol 
ID3256535 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp121518 
End bp123676 
Gene Length2159 bp 
Protein Length477 aa 
Translation table 
GC content49% 
IMG OID638255261 
Producthypothetical protein 
Protein accessionXP_569332 
Protein GI58264352 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0136325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACAGTTAAA AGGTTATTAC CAAGTCCCAG AAGGCCATAT ACCTCTGCGT GTATGCCCGC 
AGGCCGTTAA GCATCAGTCT CCACCGACAG CAAAACAACA ACGACATGCC GCCTCCCGGC
GTATTCACCT TTGACCAATC ATCCGACGAG TCGCCAGAAC TTCTTCCTCG CGATGCTCTC
CCAGAGACGG AGCACATCCC AGAAGTTTCT ATGGTGGAGG AGAGTACCGT CACCGGTGTC
ACTTTTACCC CCGAGTGAGT CTACAAACCG TGTCCGAGCT ACCTGACGTC TGCGCACGTG
ACACCTGGTC ACGTTTATTG ATTGTGTGCT GACTGTGTCG CAGACTTTGG ATGCAACGAC
GACAATGGGC ACTACAAACA CTTCGTAAAG AAGGTGTACG ATCTGTACGT GTTCGAGTTT
ATAGCAGCAC TTTTTCAACA GCAGCATTGG ATCTGACCCA ACCGTGCAGG TCCTCGATCT
AGGTTGTGGT CCAGGTGCTC TTCTCGAGAC TCTGGTCATG CCAGCTTCAA CTATTTGCGA
GCCCCCTATC AGAGAGAAGC CGTCTGAAAC CCGTCATGCT GAGGATGAAG AAGAAGATTT
TGATCATGAA GACGAGCTTT TCATTGGCCG CCTTGCAGGT ATCGACGCCA ATCCTGAAGT
TATGAACCCC GCCCTTTCCG TCCTTTCACC TCATTCTGAA ACATCGACAT TTCCCCCTCC
TCGACCCCGA TGGGAGCCTA TCACTACCGA GCTATGGCTC GGCGGACTCG AAAAATATAA
TGCGAGGCTT GAAGGTTACG AGGCGATTAC GGCGTTGGAG GTGATTGAGC ATTTGGATCC
GAATGTGCTC AGTCGGTTTG GCGTGGTTAC TTTGGGTACT TATCGACCGC GAATTATGCT
TATCTCGACT CCTGTGAGTA AGATCTTTGA TCCAGATCTC GCGTCATTGA CAAAAATTTC
TAGAACTTTG ACTTCAATGC CAAGTTCCCC CAAGCCAATG GGGACTGCTT TGCCAAAAAG
GGATTTGTCG ACCCTACTGG AAGAACAGAT CGGGTTTTTA GACACTCTGA CCACAAAATT
GAGATGACTG GTGCCGAATT CAGAAACTGG GCAGAGACCG CAGCTGCAGA TTGGGGGTAA
GTTTCAGAAT TCTCGCCCTA GGCACCATAC TAACTTGAAC AGATACGATG TCGAGGTTTC
TGGCGTTGGC AGCTCTTCTA TTCCGTCATT CTACCCCAGT GATGACATCA CCAAGCCTCC
TCGACCTATC TATGCGTCAC AAACCGCCAT CTTCCGAATT GCAACTGGCA TGCCCCTCCG
TTCACCACGC TCTGTTCGCA CTATGCAGCT CCCATTCACA CCCTCCTCCA AAGAATCCTC
GCATCCACAC AAACTTGCAG GAAGATTTAC CCTTCCAGCC ACCGCTCCTG GTATTGGCCA
AAGGTCATCC CCCGAGGAGG TCAGAGTCAA GGTCCGAGAA TTCTTTACAG GCTCTAGCGT
GAATGAAGTG TCACTCGAAG AACTTTGGGG TGTGCTTGAC ATTGCTGGCG CTTGTGGTGG
AAGTAAGCGA TGGCTAGTTG GAAGCCTTGG AGGGTATGGA GATTGTCCTG CTCTTGATGC
GAATGATGAA GACGAGCTTG AATTTGAGGT TAAGAAGGTG AAAGGGGTGG GATTGTCAGT
CCAGTGGAGA GAATGGACAC CGAGAGCGGA AGAGAAGCCG AGAAGTTGGG GTACGCCTGT
ACAAAACGAA AGCGAGCATA CCTCTCCCGC AGCTGCTCAA GGGTGGTAGC AAAGCTATTA
CGAAAGGAGG CGATAGGGCC ATAAGAGCAA AGAACGAATA TGTGAAAAGC AGGCATGGTG
TTGAAGAACT GTAAGAAAAC ACCGCAAGCG GCAGCAGGGA GAAAAAATTT AACAACATAC
CACCCATCAC TCATCACTCA TTTCCTCAGG AATTATGTCT GTCTTTTTAG TAATATTTTC
TTCTTGTTCG CGTTAGTCTC AATTAGGGTA TCAAAAGGAG TAAATGTTTA TACGTTTGCC
CACTTTTCTG TAAACCTGGA GACGGTGGCC GACAACAAAA AGTTGCTTGT GGGCACTAAA
TACATGTTTC GTTAATTCAG AATTGTAAAC CATCTGGTAT GCATGGTTTA TCTGTCCTG
 
Protein sequence
MPPPGVFTFD QSSDESPELL PRDALPETEH IPEVSMVEES TVTGVTFTPE LWMQRRQWAL 
QTLRKEGVRS VLDLGCGPGA LLETLVMPAS TICEPPIREK PSETRHAEDE EEDFDHEDEL
FIGRLAGIDA NPEVMNPALS VLSPHSETST FPPPRPRWEP ITTELWLGGL EKYNARLEGY
EAITALEVIE HLDPNVLSRF GVVTLGTYRP RIMLISTPNF DFNAKFPQAN GDCFAKKGFV
DPTGRTDRVF RHSDHKIEMT GAEFRNWAET AAADWGYDVE VSGVGSSSIP SFYPSDDITK
PPRPIYASQT AIFRIATGMP LRSPRSVRTM QLPFTPSSKE SSHPHKLAGR FTLPATAPGI
GQRSSPEEVR VKVREFFTGS SVNEVSLEEL WGVLDIAGAC GGSKRWLVGS LGGYGDCPAL
DANDEDELEF EVKKVKGVGL SVQWREWTPR AEEKPRSWGT PVQNESEHTS PAAAQGW