Gene CNL03940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL03940 
Symbol 
ID3254900 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp80159 
End bp81665 
Gene Length1507 bp 
Protein Length458 aa 
Translation table 
GC content65% 
IMG OID638253866 
Producthypothetical protein 
Protein accessionXP_567952 
Protein GI58261084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.414601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTCGTCGGC AAAACACATT TTCTTCGTCG CAAAGCACCA AAAAAGCGCA ACAAACGATG 
GCTGACCTGA CACCCGGCGC ACGCAAACCC GCGCCAGCAG AGCTTACAGG CTTCCGCTCA
GCCCTCGCGC ACACCGGTAT CCCGCACGGC GTGCTTCTGT GGAAACCCCG TCTTCCCTCA
CGCAACTGGC TCGTCTTCTG GTCCGTCTCC CTGTCGCTCT CGTACGCCTA CTACTACGAC
CGCGCGGAAT GCAAGCGGAT CAAGCAAGAG GTGGTGGAGC GTGTGGAGAA GTACGGGCGG
GAGCCGATGC CTGGTGGGAG TCTCGGTGAG CCGAGGCGGG TCGTGGTCTG GGCTGGGAGG
TGGGGAGGGG ACGACGATGC CGACCGAGCT GGGCGGTATT TCCGCAAGTA TGTCAAGGTG
AGTGCTGTGT GGCCGTGCTT GGGAAGGGGA AAAGAAGGCG CTGAACGATC TCTCGCGCAG
CCTTACCTCG TCGCTGCCGG CATAGACTAC ACGCTGCCGT CTGTCCCTCT GCACGGCTCG
ATAACCCGCC AGCTGCACGC CGCGATCCTG CTGCAGCGCC GCCAAGCACT CGGCCTCGCG
CCCACCGCGA CGCCGCTCTC TCTCCCCGGC GTGCTGGACC CGGCAGAGGC GAAGCGGCGG
GAGGTGGAGA GCGGCGTGGT GGTCGTCGGG CGGGCGAGCC TGAAGGAGTA CCTGGAGGGG
CTGCGGAGGG GCTGGGAGTG CGGCGTGGAC GAGTGGGCGT GGGAGACGGA GGTGGAGAAG
ACGTTGGCGG GCGACGGGGT GTTTGAGTCA GTCGAGTCGC CCGTCGAGCC CGCCGTCGAG
ACCGCCGAGA CCGTCGTCGA GCCCACCGCC GACGCCGTCC CCAAATCCAA CTTTGGCTTC
CTCGCCCGCC CCGCGCCCGT CACCCCCGGC GCTCCCGCCA TCCCCGCCCA CCTCCACACC
CCGCCCTCCC CGCTCCCGCC CACTCCCCCC CTCCTCCTCC TCCCGTTCAC GAACCACCTC
GGCTTCCTCC AGCTGCCGTA CATGATCCTC GACTTCTTCA ACGAACGCGC CAAAGTGCGG
CAAGGCGCAC AGTCTGCGCT CGCCCTCATC GAGGGCCCCA CCAGGGACAT GCACAGGGAG
GACGCAGAGC ACTGGGAAGA GAAGAGCGAG AGCTGGTACA ACAAGACGGC CAGGCAGCTG
CCCGAGCGGC TCCAAAAGTC GCGGACAGAG TACTACGAGG CGATCAAGTC CCGTATCGAC
CTGGCGAGGG CGTACGAGAA TGGCGACCGC GAGATGACCG AGGAGGAGAA AAAGGCCAAC
AAGGTGGAGC GGATCCAGGA TATCCAGGCG GAGAGGCTGA AAAAGGAGCT CAGGTGGAAG
GGCAGCGAGG AAGGATGGGA GATTGTCAAG CCAGAGACAC CTGCGACATG GAGAGATCGC
TGGGAGGGCT GGCTCAAGGT GTACCAGGTG CCCGAGGATG CGCAAAAGGG CTTGTAGACG
CTCCGAC
 
Protein sequence
MADLTPGARK PAPAELTGFR SALAHTGIPH GVLLWKPRLP SRNWLVFWSV SLSLSYAYYY 
DRAECKRIKQ EVVERVEKYG REPMPGGSLG EPRRVVVWAG RWGGDDDADR AGRYFRKYVK
PYLVAAGIDY TLPSVPLHGS ITRQLHAAIL LQRRQALGLA PTATPLSLPG VLDPAEAKRR
EVESGVVVVG RASLKEYLEG LRRGWECGVD EWAWETEVEK TLAGDGVFES VESPVEPAVE
TAETVVEPTA DAVPKSNFGF LARPAPVTPG APAIPAHLHT PPSPLPPTPP LLLLPFTNHL
GFLQLPYMIL DFFNERAKVR QGAQSALALI EGPTRDMHRE DAEHWEEKSE SWYNKTARQL
PERLQKSRTE YYEAIKSRID LARAYENGDR EMTEEEKKAN KVERIQDIQA ERLKKELRWK
GSEEGWEIVK PETPATWRDR WEGWLKVYQV PEDAQKGL