Gene CNL04140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04140 
Symbol 
ID3254787 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp127562 
End bp129462 
Gene Length1901 bp 
Protein Length518 aa 
Translation table 
GC content48% 
IMG OID638253887 
Productexpressed protein 
Protein accessionXP_567969 
Protein GI58261118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.215775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCAATCCCT TGATCTGCTT CACCATGTCA GACAACCAGC ATCTGCAAGG CCAATCTAGT 
GTCCCAGAAA CGCCCGCAAA TACGTCAGAA GTTGATACAT CGGGCGCGCC GGCCGCAGCA
AGCAGCGTCG GGCCCGCTTC GAGCGAGGGG ACAGATCAAC AGGACCCTGT CGTGTCGACT
TCTTCTTCAA AAGTACAAGA TATACTACAG CAAAGCATCA ATCGCGACAA CTTTCAGGAA
GAAGTTGGAC AGGTGATGGG CACCATAAAC AGTTGGTGGG GGGGCGTCAA GAAACAAGTA
CGATCCCATC CTACTCAGTA GTGTCTATTG TGCTGATCAA GTCCATGTCT AGTCCGTGTC
TACTTTGGCG ACTTTAAAGG CCGATATAGA TAAAACAGTG ACCCAAGCCC AAGCTGACTT
TGAATACCTC AAGGCAGCTA AAATTGAAGT GGTACGCAAA GACGCCACTT CCGAGCCCAC
TAGAGTGTCA GCCAAGGAGG ATCAAGATAT CAGTGCAGAT ACTTTGAAAG AAGAACCGAT
AAGTGTCCAA AACGATCAGG ACAAGGGAAA AGGGAAAGAA ACGGCACAGT CATCGGCAAC
GAATCAGACG AGCCCCCCTG CTTTCTTTAC GAAACTCGCT TCTTCAACTA GTCAACTTCA
GCAATCACTC CTATCCGCTG TACAGTTTAC ACTCGATGCC ACAACTGCCA ACTCGGCACT
GTCCAATCCA AATGCCTTCC GTCAGCAACT TGTGGATAAT CTACGCCTGG CCTCCGCTCG
AGAGAACTTG CAGCTTTCAG TCAAACAAGC TGAAAAACTG GCTGAAGAAT ATCTGCGCAA
AGGAGACCAG TGGGTCAAAG GTGCTGAGAA GTGGATGGAA GAGGCTGTTA AAGTCGTACC
TCCAGAGGGA GAAGAGACTC ACGTGGTCAA CATCGGCTGG GATGGCGGAG ATTGGTACTC
CTTTTCAACT TCTGATAATA CCCCTCTGCA CATATCAACG ATTGACAATG GTGCCCCTGG
TCCCTCAGCT GCTGGTACCC AGGTCAAGGT TCTGGCCAGC TCTCGTAAGG ATGCACTTTT
GAAGCGCCTT CGAGAAGATA AGCAACTTTT GTTGGTTGAT CCTGAAGGTG AAGGGGAAAC
TGAGAAAAGG AAAGCAGAGT TCCGTGACTG GGTTAAGACA CAATGGGAAG CACAAAAGAC
AAATGGGCGA CTGGAGGATG AGGGTCTTGT GGGTCATATT AGGATGGAGC TTGGTAAGTG
ACTTCCATGT TGGGATCCCT GAGAATATGA TACTTATATA AATGTCTTCT TACCCACTAC
TCTCATCAGT GCCTGAGTAC CTCACAGATG AGCAATTCTG GCAACGTTAT CTATTCCACA
AACATATGAT TGAAGAGGAA GAGCAGAAGA GGAAACTGCT CTTGCAAAGT GAGTGAGCAG
TATTCCTGTC ATGCTCCTGT TGCTAATTTT TTTTTCTTTG GGTAGCTTCT CAACAAGACC
AGTCAGATGA TTTCAACTGG GATGATGAGC CTGAAGAAAC TACCCCCCTG GGTGATGGGC
AGGCATCCCA TGGTGTAGTC ACCCCTAAGG TCAGCCCAGT TGGCAAACTA CCTAGTTCAG
TGTTCAGTCA CTCAAAGGCA AAACTTGCTA CTTTGGACTC AACAAGCCCA CATGACTCGG
AGGAGAGCTA TGACCTAGTC AGTGATCAGG GAGGGAAGAC TGCCAGGGCT GCCCCTCCTG
TGGGAGATGA TGATTCTGAC TGGGAGTGAC AATGATTAGT TCTGCCACTT GTTATTTGTT
GTTGTATATA CTAGCCAAAC CAAGCTCATG TCAAATATTT TACAGATAGG AAAAGACTAT
GATAAAAAAA GTTTCAGATA GAACTGATCC CTTGGATGTT T
 
Protein sequence
MSDNQHLQGQ SSVPETPANT SEVDTSGAPA AASSVGPASS EGTDQQDPVV STSSSKVQDI 
LQQSINRDNF QEEVGQVMGT INSWWGGVKK QSVSTLATLK ADIDKTVTQA QADFEYLKAA
KIEVVRKDAT SEPTRVSAKE DQDISADTLK EEPISVQNDQ DKGKGKETAQ SSATNQTSPP
AFFTKLASST SQLQQSLLSA VQFTLDATTA NSALSNPNAF RQQLVDNLRL ASARENLQLS
VKQAEKLAEE YLRKGDQWVK GAEKWMEEAV KVVPPEGEET HVVNIGWDGG DWYSFSTSDN
TPLHISTIDN GAPGPSAAGT QVKVLASSRK DALLKRLRED KQLLLVDPEG EGETEKRKAE
FRDWVKTQWE AQKTNGRLED EGLVGHIRME LVPEYLTDEQ FWQRYLFHKH MIEEEEQKRK
LLLQTSQQDQ SDDFNWDDEP EETTPLGDGQ ASHGVVTPKV SPVGKLPSSV FSHSKAKLAT
LDSTSPHDSE ESYDLVSDQG GKTARAAPPV GDDDSDWE