Gene CNE01140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE01140 
Symbol 
ID3257788 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp316959 
End bp318558 
Gene Length1600 bp 
Protein Length346 aa 
Translation table 
GC content46% 
IMG OID638256702 
Productconserved hypothetical protein 
Protein accessionXP_570702 
Protein GI58267092 
COG category[R] General function prediction only 
COG ID[COG1355] Predicted dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.599641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGGT AGGCTGCCAA CTTTTTGAAA TTCTAAATAC AGGGCTAACG ACAATATCCA 
GCGTACGCGA AGCCACCCAC GCAGGCAGCT GGTACACCTC TTCCCGTACG TCTTTCCGCT
CATCGCTCCT TCCGTATCCA GCCATATTAA TCTAATACAT ACCATATGCT AGGCCCTGGG
CTTCATAAGC AACTTAGTCA GAACCTTTCT GCAGTCAAAC CTATCTCTAC ATTAGATTAC
GACCCACCCG TAAGCAATGC TAAAGCGATC ATTGCGCCGC ATGCGGGATA TAGCTATTCG
GGACCTGCCG CAGCGTGGGC TTATGCGGCT GTACCTACAG AGAAAATGTG AGTCACTCAA
ACTGTGAAGC CAACTGTGAT GAGGGTATGG TGATGCTAAT AGGAAGGGAT CATTTGCCGC
TAAAGTAAGA GAGTATTTTT ATTGGGCCCT TCGCATCATG CCTACTTGCC GGGGGTAGCG
CTTTCCAAGT TTGAAGCGTA TGAGACGCCT TTAGGGGATA TTCCTCTTGA TACAGACAGT
GATTACTCTG CAATTTTGAT ATGTAGTATA CAGACCGCTA ATGGACTCGA TAGCTATCAA
TGAACTTCGC GATACAGGGA TATTCTCTGA CATGAAATCC TCTACTGACG AGGACGAACA
TTCGTTGGAG ATGCATTTGC CCTATATTAG GTTGATCTTC CAAGGGTGGG TTGTCTTCAC
GGACTGTCTG AACTATGACC ATCTGATCAA TTTACAGGAG GGATGATCTC AAGCTTGTTC
CGATCCTCGT GGGACACCCC AGTGCTTCGA CCAGTGCAAA GCTCAGTGAA GCTCTTGCCA
AATACTGGCA AGATGGTGAG ACCTTCTTTG TGATCTCCAG CGATTTCTGC CACTGGCAAG
TCATTCATCA TGTCTAATCC ACAGAAAACG ATGGCTGATG ATAAATCAGG GGGAGCAGAT
TCTCATGCAC TCCATACTAT CCAAACATCC CGCCTTTGGC CAATCCCGTG CCTCCCGTCA
AATCGTCCAC TTCAGCCACT CCTGGCACTC TCACCCAGCC TCCTGAGCTT GTCAAAAAGT
TTTCTTCCGC CAGCTCCAAT CCGGATGTGC CTATCTGGAA GTCTATCGAG TACATGGATC
ATGAAGGTAT GGACCTTTTG CGCAAGCCTG GAGAAGATGG TGCTGTTGAG AAGTGGCATG
GGTACTTGGA AAGGACCAAG GTTTGTCTGT CTATCAACAA TTTTGGTCCA TCATCTGAAA
GACATTTTAG AATACAATTT GCGGTCGAAA TCCTATTACT GTACTCCTCA ATCTCGTCCA
GTTTGTGTAC AAAAATCAAC CGGTTAAGCC CGAGTTTGTT TTTGTAAGGT ATGAGCAAAG
CAGTAAATGT GTGAATGGAA AGGACAGTAG CGTCAGTTAT GTTAGCGGGG TCTTAAGGCT
CCCTCAGTGA TTCCTAGTCC ATATTTGGGC GTACGAGTAG TGAGACTTTT TACTGGTGTA
GGTTGGATCT GGTTGTAAAG GAAGGCGACA GACGATAAGA ATAAAGTTGT AGAGCTGAAG
AATTCGAATT CCCGAATGTA AATATACATG TATCCGTCTC
 
Protein sequence
MSGVREATHA GSWYTSSRPG LHKQLSQNLS AVKPISTLDY DPPVSNAKAI IAPHAGYSYS 
GPAAAWAYAA VPTEKIKRVF LLGPSHHAYL PGVALSKFEA YETPLGDIPL DTDTINELRD
TGIFSDMKSS TDEDEHSLEM HLPYIRLIFQ GRDDLKLVPI LVGHPSASTS AKLSEALAKY
WQDGETFFVI SSDFCHWGSR FSCTPYYPNI PPLANPVPPV KSSTSATPGT LTQPPELVKK
FSSASSNPDV PIWKSIEYMD HEGMDLLRKP GEDGAVEKWH GYLERTKNTI CGRNPITVLL
NLVQFVYKNQ PVKPEFVFVR YEQSSKCVNG KDSSVSYVSG VLRLPQ