Gene CNL04020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04020 
Symbol 
ID3254735 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp98381 
End bp101084 
Gene Length2704 bp 
Protein Length490 aa 
Translation table 
GC content48% 
IMG OID638253874 
Productexpressed protein 
Protein accessionXP_567957 
Protein GI58261094 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTTATCTT CAGCCTCGGC AAGCCCGATC AACTTCTTAA TCTGGTACTC CAACTTGTTC 
TCCAGCCCTC TCACTTTGTC CATAACTTCT TGGTTCATTA CCAACTCTCC CGCGATCTCA
TTCAACACAT TATCATCTGT GAGTTCCGGT CGTGAACGGG GGTTCGGAAA AGGGATGGAA
AGGGCTGTGG AAGTGGATGG ATCGGGAGCA GCCTCTGTTG AGGATGTCAA CCGTAAAGCA
AGCAGGATAA CGAGTTGGTG AAGGGATGAC AAGAGGAGGT GAGGGCGGAG ATTGAGAAGG
GAAAGACCAT TGGCAAAATC AAGATCCTCC TCTTTGTTTT GGACTCTAAA GAAGCTTATC
AGCTGTCTTG CAGCATAGAC TTAAAAGCGA GACAAGCTCA CTTGTTTAAT AATGGAACAG
AAGAAGAAGT AGTTGCATCA ACAGCTGTCT GAATAGTCCC AAATAATTTC AGAACCTCAT
CCGTAGAAAG AATAACCTCG GCGTCACTGT CCATCGTGCT ATATATTTTC CGTGACGACC
TATTGACAGC ATGTCAGCGT TTGTCCCAAG TTGATCACTG TGATAAAGTG GATGGCAATG
GAGAATCCTC AGTCGCGAGT TTTCTAGGCT CGCTCCACCC TCCTAAGGTG GAATAGGCGT
GTAATCTGTA CACCTCTCTT TTGGAGGTGC TCAGTGCCCA GTGCGGTAGC ATCATCGGAT
CCTCCATATG CTTGTACAGG CGAAAGAAAG ACTTACGTGG CTGATGTGGT TTTCACGGTT
TGTGGATGTG AAGAAGAGAA AAAAGACGAA AGAAAAAGGA AATGGCTGAA AAATTGTTGC
AGGGAATTTG ATGTGAAAAA GTGGTGACAT CAACGTATGT TCTGCCGGCT GCCGAAGAGC
CGCGCATAGC AGCGAGGACA TTCAACTTAT TCGTACATTT CCATTTAGCA TCTACACATC
TACCTCTTCA AGCGAATATG TCGCCATAAT GTCTGTCGCG CATATAGCAG ACAGCAAAGG
CGTCCAGGGA CCCGCAAAAT GGTCGGCTGT CAAGAACGCA GTGCAGTTCT CTGCTGCTGC
CAAGGATAAG AACAGAAGAA ACCAAACAGA CACCTTGCCA GTGTCGACCA CGTCTACCTC
GGTTTCTGCG GCCTTTGCTA GATCTTTGGT GCTTTTTACC GGTAAGTTAA TAGATGAGGT
ATGCTGAAAG TGTCCTGATC CGGATATATA GGCTTTCTAT TTAAACGGCC GTCAAAACTG
TTCAGACCAA ACAGAGGTAC TGCCGTTGCA CTGACGTACT AATTGGCTTT GCTGACTCAA
GGAATTATAG TGGATACCTG GTTGGGTCTA AGGCAATTAG CTCTGTCAAC TGAACAAACT
ATATCCCCAG CTTTCATCAG ATCCCTTCTT AGACAAAAAG CTGGTATAAT TGCGGTGACC
CTGACGATCC TTCCTCCAAT GCTGGTTAAC GCAACGCTAG GCTTCCTGCT ATTCACGTCC
CATTCCCTCT TCACTCTGGG GCTCTCTCGG TTATCATTCT TCCAGAGAAA GATCGAAATG
GAAGATGGGA CGGAAGTCGA CGAGGAAGAA GATATCAATC TGGAAACATT GATCAGGGGT
CCTTCAATCA TTCCAAATCA TCCGACGATT TTGTCTGCGA TTGCTGGGGC GGGCGCAGGA
CTCGTGCAGG GTGCTGCCTT TACGCCAGTA GAGAATGTTG TACGGTAAGC AAACCATGAT
CGTTCCATGT CTGCCATGCT CATTCATTGC AGTTTCCTTC ACCAATCTGC CACATCTTGG
GCTACACTTC TCGCCCGTCT TGTCCATTTG CCCGTACCCG AAGTACCCAA TGCATTCGAA
GGGAAACAAC CGGCAACGCC TATGCAAGCC ATCAAAAACT TGTTTGCAAG TGAAACTTGG
AGGAAGAATA GGAATTGGTG GACTGGATGG CGATGGGTGG TGGCTCGTGA CGCGTGAGAG
GGCCTTTCAA CTCATTTGTA AACGAAGCTG ATAGTTTCGT GGGTAGACTG AGCTATAGCT
GTTTCTTTGC CGCCTTTGAT GTTACGCGCC GCGTGGCTCT ACGAGTAAAA GCCCTGTTTG
GGGGTAATAT TGAGCATGGT TGGGAAAACA TATTTATTAT CGAATTCCCA GACGATCATC
CGCAAAGTGC ATCCCCATCA ATCTCCAACT CTCCTGCCAA ATACCGCCCG GATTCCGATC
AGCCCCAAGC GCCCACTATT GCCCGTGTCG CGCAAGCAAC AACCATCGTC ACTGGGGGCA
TTATTGCGAG CTATCTCGCA CAGATGGCTG GGAGGCCGCT TAGGACTTGT CAAAGGATCA
TGATGCTTGA CGAAAGGGAG AGGATGCGTG CCGAAAAAGC ACAAGGACGA GCTCAAGCTG
GCGGAGCAGG AAGCGTTTCG AGCTCAAATA GCAGCAATAC TTTGAGACGG GGCAAACCTC
ATCCAATACT TGAAGTTCTC CGAACTAAGG GTATCCGCCC CTTCATTCAT TCAGAAGGAC
TGTTGCAGTC AGCACCGATG AAAGAGGCTT TTCAGCAAGA GGGCAGGCTG GTGAGGACGA
TGAAAAGTGT CGGATGGAAG ATGGCCGCCA TGGGTCCGTG GGGTTTTGGA TTCCTAGTGT
GGGCTTGGGT TGGTGGAGAA GTATGAGATC ATCAATTGCT ACAAATCAAC ATTGCAAAAC
GATA
 
Protein sequence
MSVAHIADSK GVQGPAKWSA VKNAVQFSAA AKDKNRRNQT DTLPVSTTST SVSAAFARSL 
VLFTGFLFKR PSKLFRPNRV DTWLGLRQLA LSTEQTISPA FIRSLLRQKA GIIAVTLTIL
PPMLVNATLG FLLFTSHSLF TLGLSRLSFF QRKIEMEDGT EVDEEEDINL ETLIRGPSII
PNHPTILSAI AGAGAGLVQG AAFTPVENVV RFLHQSATSW ATLLARLVHL PVPEVPNAFE
GKQPATPMQA IKNLFASETW RKNRNWWTGW RWVVARDALS YSCFFAAFDV TRRVALRVKA
LFGGNIEHGW ENIFIIEFPD DHPQSASPSI SNSPAKYRPD SDQPQAPTIA RVAQATTIVT
GGIIASYLAQ MAGRPLRTCQ RIMMLDERER MRAEKAQGRA QAGGAGSVSS SNSSNTLRRG
KPHPILEVLR TKGIRPFIHS EGLLQSAPMK EAFQQEGRLV RTMKSVGWKM AAMGPWGFGF
LVWAWVGGEV