Gene CNH00460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00460 
Symbol 
ID3259154 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1055861 
End bp1056975 
Gene Length1115 bp 
Protein Length281 aa 
Translation table 
GC content52% 
IMG OID638258440 
Productconserved hypothetical protein 
Protein accessionXP_572238 
Protein GI58270164 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTCG AAGGAAAGGG TTCGTATAGC TCGTACCAAT TCCCAATCAA TGCTCATGGT 
CAATTACTCT AGCTTTCGTT GTCACTGGTG GCTGTGGTTC TATCGGTGGC ACTGCCGCTA
AGCATATCAT TGCCAGAGGC GGTATTGCTC TGGTAAGTGG CTTTCATATT TCAATAATAT
TCCAAGCCAA CTTTTTGGCA GATCTTCGAC GTTCTCCCCG AGGAAGCTGG TCAGGCCAAG
GTTAAGGAGT ACCATGCCGA GCGTGCTTTC TACTTCAAGG CCGACATCAC CAACGTCGAG
GTCTTCTCCG CCTGCATTGA TGCCGCCTTG AAGGTTATCC CCAAGGGCTC TCTCTTCGGT
GGTGTCCATT GTGCTGCTAT TGCGCCCGGC CGACCTTGGG ACCAAAAGTT GAAGAATTCT
ATCGCTGTAA GCTTGATTCT TCGACAGTAT GCAGTCTTCG GTTTTACTAA TAATCTCACT
TCCAGCACTT CCAAAAGGTC ATGCATGTCA ACTCTTACGG TACCTTCCTT GTCGACGCCT
GTATTGCCGA CGCTATTAAC TCCCAGTACC CGGACGAGGG ACCTTTCGGC CCCCGAGTTA
AGGAGGAGCG AGGCTGCATC GTGAACATTG CGTCTGTTGT CGCCAAGCCT GTCCCTGCCC
GATGCTTGAC CTACGGTGTC AGCAAGGGTG AGTTCTGATC CTTTATTAGC AATATCAACA
TCAAACTAAT GTGCCATGCA GCTACAGTCT TGGGTATCAG CAGCGGTATT GCCGACTTCC
TCGGCCCCTA CGGTATCCGA GTCAACTCTG TTAGCCCTGC TGTCGTTGCT TCCTCTCTTA
TGGGCCCCGA CCGAATCGTG AGTTTTTCCG TATCGCACGT ATATGTTCGC CATAGACCGA
CATGGTTATA GCCCTACTTC GAGTCTGAGC TCGAGGCCGC CGCCATCTAC CCTCGCCGAC
TCTCCCAGCC CGACGAAGTC GCTCAGGGTA TTGTCTACCT TCTGGAGAAC TCTATGATGA
ACGACTTTGA GCTCAGGGTC GACGGTGGCT GGAGAGGTAG TAGCAACTGG GGCGGCCCCC
ACGACCCCCG ATCCAACGCT CCTTCTCTTG AATAA
 
Protein sequence
MKLEGKAFVV TGGCGSIGGT AAKHIIARGG IALIFDVLPE EAGQAKVKEY HAERAFYFKA 
DITNVEVFSA CIDAALKVIP KGSLFGGVHC AAIAPGRPWD QKLKNSIAHF QKVMHVNSYG
TFLVDACIAD AINSQYPDEG PFGPRVKEER GCIVNIASVV AKPVPARCLT YGVSKATVLG
ISSGIADFLG PYGIRVNSVS PAVVASSLMG PDRIPYFESE LEAAAIYPRR LSQPDEVAQG
IVYLLENSMM NDFELRVDGG WRGSSNWGGP HDPRSNAPSL E