Gene CNI00020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00020 
Symbol 
ID3259545 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp5006 
End bp6241 
Gene Length1236 bp 
Protein Length327 aa 
Translation table 
GC content52% 
IMG OID638258486 
Productexpressed protein 
Protein accessionXP_572755 
Protein GI58271198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTCAC ACAGTCTCCT TGCTTCCCTT CGCGCTCGTT GCGCCGTTTC CTTGCTTGCG 
TCCTCGTCCA CCTCCGTCCG ATCCCTCGCC ACTCAAGCCT CGTCCACCAT CACATCTATC
AGACCCAGCC GTGATGAAAT CGTCAACATG CGCCTCAGCC AAAGATCTAT CCAAGCAGCC
CTCGAAGCTT TACGGGCAGA CGGGATAGTC GTGGTCGAAG ACGTAGTCAA CAAAGAGGCG
ATCGATAAGC TCAATTCCCA CATGGAGAAG GACACACATA CGCTCATGGC ACGAGGAGAG
AAGGGACCGT TCAACTATAA CCTAGGCAAT CTACAACAGA GCCCGCCATA CGATCCGGAT
CTGTTCTCAC CGTCGATATT TGTGAATCCT TTCGGGATAC AAGTGACGAA TGCCTACCTT
GGGGAACGCC CGACCATGTC GTTCATCTCG GCCAACTCGG CCGTGAAGGC AGAGGTGGGA
CAGCCAGTAC ATTCTGATGC TGACTTCAGC CATCCAAGTG TAAGTAGACA GCGTTCATGA
ATCGGCGCCA TGACCAAACC TCGCTGATAT GTATATAGAT TCCATTTGCA GCTGTGGTTA
ATGTCGGCCT CGTCGACATG AACCCCAAAA ACGGCTCAAC CCGTGAGCGT CGAAAAGGCG
ATGGCGGTTA TTCTCTACGC TAACATTCCC TTCAGAGGTC TGGCTCGGAA CACATGATGG
CACGGATCTT TCTTGTCAGG AAGGAGCGCA TGGGGAGAGA GCCTCCGGTC GAATCAAGCA
GGATTTATTG GATGCCCGGA AATCAATTTC TCCTCCATTG CAGCCCACTA TCCCGAAAGG
TGTGTAGAAT GACTTTCCAT ACCTTGGTCC AGGCTAGCTG TTCTCTTGCA CCAAAATGTG
TGGAGATGTT AACCATCAAT TAGGCTCACT GATCATCCGC GATCTTCGAT TGTGGCACGC
CGGTATGCCC AATACAACCG ACGAGATCAG GATCATGTTG GCAATGATCC ACTTTGCGCC
TTGGTACAGA CAACGAATGA CCATGAAGCT GCCCCGATCT CTGCGACCCA CACTAGAAGG
AGTGGACCGG CTGGGCGTGG CGGCGGACTG GCAGGACGCA GAGGTAGATC ATCTGGATGC
GCCGTATGGG AATGCGTTTG ATTTCAGCCA GGATCAGTAG TGTTCTTCCA GCATTGAGTA
GGTTTCTAGT CATGTATGCA AGCTGTTTAT TGTCAC
 
Protein sequence
MLSHSLLASL RARCAVSLLA SSSTSVRSLA TQASSTITSI RPSRDEIVNM RLSQRSIQAA 
LEALRADGIV VVEDVVNKEA IDKLNSHMEK DTHTLMARGE KGPFNYNLGN LQQSPPYDPD
LFSPSIFVNP FGIQVTNAYL GERPTMSFIS ANSAVKAEVG QPVHSDADFS HPSIPFAAVV
NVGLVDMNPK NGSTQVWLGT HDGTDLSCQE GAHGERASGR IKQDLLDARK SISPPLQPTI
PKGSLIIRDL RLWHAGMPNT TDEIRIMLAM IHFAPWYRQR MTMKLPRSLR PTLEGVDRLG
VAADWQDAEV DHLDAPYGNA FDFSQDQ