Gene CNI03700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03700 
Symbol 
ID3259635 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp998312 
End bp999482 
Gene Length1171 bp 
Protein Length299 aa 
Translation table 
GC content47% 
IMG OID638258865 
Productconserved hypothetical protein 
Protein accessionXP_572597 
Protein GI58270882 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG5285] Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.183271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTATT TAACAGAAGA GCAAAAGCAG CGATGGAAAG AGGACGGTTA CCTCGTTTTG 
CCCTCGTTCT TCACTGACGA GGAAACCAAG GATATGCTCA ATGAGGCCAA GAGGCTTTGT
GGCGAGTTTG ATATTGAGGG GCACCCTATG GTATGTGCTG TTGATTCGTA CTTATTGAGC
TTCAGACAGA TTGATTCTAC TATAGACGAC ATTTAAGACA GCGGCGGATG ATGCACATAT
CGGAGATGAG TACTTTTTAA ACTCTGGGGA CAAGGTGAGC GAGCCTTTTG ATACCGTTCT
ATTACTCACC ATAACATGAG GTAGATCCGT TACTTTCTTG AACCAAGTTC CGTTACCCCA
GCTACTGCCA CTACACCTGC CAAACTCCTC GTGCCCCCAG CTCAATCAAT CAACAAGATC
GGCCATGCAC TCGCCGTCCT CAACCCAGTT TTCCGCAAAT ACACACTAGA AACACCAAAG
ATGTCGAACC TAGCAAAAGA ATTGGGAGAA CAAGAGAGTC CGAGGGTGTT GCAGAGTATG
GTTATTTGCA AGCAGCCGAG AATAGGCGGT GTTGGTGAGT TTTTGCGGAA CGGTTTGGTG
AGATATCGTT GCTGACTAGG CAGGCAGTTC CTTGTCATAA TGACTCTACT TTTTTGTACA
CTGATCCTCC TAGCGCTATA GGTGCATGGA TAGCTCTGGA AGAATGTACA CCTCAAAACG
GCTGTCTTGT ACGTCCGATT CATAACGCTA CCTGATGTCT ATAGCTAATG TAGGGTTGGT
CAGTCCTTTT TACCAGGCTC TCACCGATTA TCACGAACTT CAACTCGATT TGTCCGTGCG
CCCAATGGCG GTACGACTTT TGTCGATGTC CCTGGGGTGG AACCAAATAC GGAGAATTGG
GATGAGATGG AAGGCTGGAA AGAAGCGGCT TGTCCTCCTG GGACTTTGGT TTTGATCCAT
GGTGCGTCGT TGAGTCTAGA CATATATCCT GACGCTTCGA GCTAATGTGA TGCGTGCAGG
AAGTGTGATG CACAAGTCTC CTCCTAATCC TTCGGATAAA TCGAGGCTGA TTTATACATT
CCATATGATT GAGGGAGGGA AGGGTGTCAA ATATGATGAG CGAAATTGGT TGCAGCCGAC
TAAGGAAATG CCATTCCCTG CTTTGTTTTA G
 
Protein sequence
MPYLTEEQKQ RWKEDGYLVL PSFFTDEETK DMLNEAKRLC GEFDIEGHPM TTFKTAADDA 
HIGDEYFLNS GDKIRYFLEP SSVTPATATT PAKLLVPPAQ SINKIGHALA VLNPVFRKYT
LETPKMSNLA KELGEQESPR VLQSMVICKQ PRIGGAVPCH NDSTFLYTDP PSAIGAWIAL
EECTPQNGCL SFLPGSHRLS RTSTRFVRAP NGGTTFVDVP GVEPNTENWD EMEGWKEAAC
PPGTLVLIHG SVMHKSPPNP SDKSRLIYTF HMIEGGKGVK YDERNWLQPT KEMPFPALF