Gene CNE00100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE00100 
Symbol 
ID3257947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp16399 
End bp18264 
Gene Length1866 bp 
Protein Length621 aa 
Translation table 
GC content55% 
IMG OID638256592 
Productretrotransposable element slacs 132 kda protein (orf2), putative 
Protein accessionXP_570697 
Protein GI58267082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.21893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCGCA CTTCTCGTCT CATCCCCTTG AAGAAGGACG ATGGCTCTAT CCGACCTATC 
GCTGTTGGTG AACTTATCTA TCGGCTATGT GCGAAAGCTC TCATCATCTC GCATTTCCAA
CCCGACTTCC TCCTCCCGTT CCAGCTCGGG GTCAAGTCAA TCGGTGGTGT AGAGCCGATC
GTGAGGCTGA CAGAGAGAGT CTTGGAGGGT TCTGCCGGCG CTGAGTTCTC CTTTTTAGCC
TCGCTCGATG CTTCTAACGC TTTCAACCGT GTAGATAGGG CCGAGATGGC AGCAGCGGTC
AAGACCCATG CGCCGACGCT TTGGAGGACA TGCAAATGGG CCTATGGCGA CTCGTCCGAC
CTTGTGTGTG GTGACAAAAT CCTTCAATCC TCTCAAGGTG TTCGACAGGG TGACCCCTTT
GGCCCTCTCT TCTTCTCGAT CACCCTCCGA CCAACCTTGA ATGCCCTCAG TCAATCGCTA
GGTCCGTCTA CGCAAGCACT CGCTTACCTC GATGACATCT ACCTCTTCTC AAACGACTCT
CAAGTCCTCA GCAAAACTAC CCAATTCCTC GCCGACAAGC AGCACATCAT CAAGCTCAAT
GAAAAGAAAT GCAAGTTAAT CAGCTTCGAT GAGATCAGGC AGGAGGGCTT CAAGATGCTA
GGGACGATGG TAGGTGGTAA GGAGAAGCGG GCGGAGTTTC TGGAAGGCAG GATTCGGAAG
GAAATGGCAA AGGTGGGCAA GCTCAAGGAT CTTCCACATC AACACGCGCT CCTTCTATTA
CGCTTCTGCA TTCAGCAAAA TCTACGACAC CTGCAGAGAA GCCTACGCTC CGACGACCTT
GTAGATCTAT GGGAAAGACT GGACACGATG CTGTGGGAGG AGGTGAAAAG GATGAGGATG
AGGCAGCGAG AGGATACGGT GGAAGAGGAG GCTCTAGGGA GATCGTTGAC GAAGCTACCA
GCGCGACTGG GCGGACTAGG TCTACTTTCC TTCAAAGATG TAGCCCCCCT TGCTTACCGC
TCGGCAGCCG AGGCCTCCGA CACTCTCCTC GATAACCTAG GTCTCCTTTC TTCGCCAGAG
GAACCTCCAA CTCCGATCCC CCAACGAACT CGATGCGCAG AACTCTGGGA ATCGCAACAG
GAAGCCATCC TACATAATCT CGGCGACACT GAACGCAAGC GACTCACCGA GAATGCCTCC
AGACTCGGCC GAAGTTGGTT ATCAGTTATC CCTTACCTTC AACCCCTGCG CCTTTCCAAT
GTCGAGATTG CCTCCGGTCT CCATGACCGC ACCCTGGTCG GCTCCTCGAT CCCTGTCTGT
CGCTTCTGTG GGTCGGACTC ACCTTTGGGT CACGACGAGC TTTGCCGCGC CCGCAACCCC
TGGACCCAGC GCCGGCACAA TGCCATCAAC CGCGTCATTT ATCAACACCT CAAACAAATT
CAAGGTGCCA CGGTTGAGAT TGAGCCCCAC ACGCTGTCTG GGCAAAGGAG AAACGACCTT
CGGGTCAGAG GTTCCAGCGC TCTGGCCTTC ACTGACTACG ACCTGAAGGT ATACTCCCTC
GGAGACCGAG ACGCGAGAAG CACCGTCACA CCCTGTGCCC CCAACGGCAA GCTGGCCGAC
TTCTGCTTGG ACCGGTGCGT GAACTGGCTC GACAAGGTGG GTCAGGTCGT CTCTAAGAAC
GCTCCGAAGG TCACTGGTGG GGTTTTTAAA CCAATCATCC TTTCCACTGG TGGCTTGATG
AGCAGGAGCA CAGCAGACGA TTGGAAGGAC TGGAGGGAGG CGATGCCGGT GGGGGGGTTC
GAGAAGATGG AGAAGAGAAT TGGTGTCGAG CTAGTAAAGG CAAGGGCGAG GACGCTGGTC
TTGTGA
 
Protein sequence
MLRTSRLIPL KKDDGSIRPI AVGELIYRLC AKALIISHFQ PDFLLPFQLG VKSIGGVEPI 
VRLTERVLEG SAGAEFSFLA SLDASNAFNR VDRAEMAAAV KTHAPTLWRT CKWAYGDSSD
LVCGDKILQS SQGVRQGDPF GPLFFSITLR PTLNALSQSL GPSTQALAYL DDIYLFSNDS
QVLSKTTQFL ADKQHIIKLN EKKCKLISFD EIRQEGFKML GTMVGGKEKR AEFLEGRIRK
EMAKVGKLKD LPHQHALLLL RFCIQQNLRH LQRSLRSDDL VDLWERLDTM LWEEVKRMRM
RQREDTVEEE ALGRSLTKLP ARLGGLGLLS FKDVAPLAYR SAAEASDTLL DNLGLLSSPE
EPPTPIPQRT RCAELWESQQ EAILHNLGDT ERKRLTENAS RLGRSWLSVI PYLQPLRLSN
VEIASGLHDR TLVGSSIPVC RFCGSDSPLG HDELCRARNP WTQRRHNAIN RVIYQHLKQI
QGATVEIEPH TLSGQRRNDL RVRGSSALAF TDYDLKVYSL GDRDARSTVT PCAPNGKLAD
FCLDRCVNWL DKVGQVVSKN APKVTGGVFK PIILSTGGLM SRSTADDWKD WREAMPVGGF
EKMEKRIGVE LVKARARTLV L