Gene CNH03790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03790 
Symbol 
ID3259231 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp26252 
End bp27677 
Gene Length1426 bp 
Protein Length324 aa 
Translation table 
GC content51% 
IMG OID638258105 
Productexpressed protein 
Protein accessionXP_572579 
Protein GI58270846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00355884 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATATAT ATACGGCCGA CACCGGTCTG CGATCCTTCC TGTCTGCACC TCTTATTGCT 
TTCTCACAAC ATCACTCCAA CCAGCCAATA AACACAACAA CTACTCTGAC GCCTGCCATC
ATGACTAGGA CTGAGGTGAG TTTTGGCCGC TCCACCGCCC GAGCTGGTGA CGTCTCCATC
TCAATACCGC AGTGGAGCTG CATGCCAAGT TTGGACCAGG CTAACAACTG TCCTTCCGCT
TCGTCTCCAA TCCTACCAAT CACTGTTTGT CTGCCTGTCC TCTATGCGCT GTGGACCGCT
GCCCTCCTAC GGCCTCACGT AAATGTGCCT TCACCATGGT ATCTTCATCT GGAATGTCTT
TTACCACCGT GCTGCGCCCT TATTGTATGT CCCCGTGACC GTCTCCTGTC TAACCGGACT
ACTCCACTCC GCGCCTATCT TTTCTTCCCA TCTTGCTTTT ACCCTTGTTT CGTCTACGAT
CTATTCCCCA CCATTCCTTC TCGCTCCACC TATCCCATCT TTCCATGTCA CCCCACTACG
CACTGACCCA ACAGCGCAAC CAATACCCCG CTGCCGTCCT CAAGGACCGT CATTCCCGCA
CTGGTCTCGA CAAGACCCAG TGGAACCACA AGAACGGCGA TGGCGCCCAT AACTGGGGAT
CTACCGCCCG TAAAGGCGAT GACGAAGCTT CCGGCCGTCT TGACGGTGAG GCCGAGGCTG
AAGCCGCTCT AGACGAACTC CCGTCTTCCG ACGTTTTTGA CCTCGATGAA GAGATCAATG
ATCCTGTCGG TGCGATGCCT GTCACCGACT CTGGCAATGA CTTTAAGCCG ATGGACCTTG
GCAAAAGAGG GAGTATCCAG GGCCAGAGTA ATATTGCTAC CAGCCCCACC GATAGTATGA
GCAGCCTTGA CTCTGGCGAC AGGCCTGGGA TGGGGAGGAG GATGAGTGCG GTGAGTGATG
AAGAGAGAGA GAAGATGAGG CTCTATAGGG AGGGTGTCCT CCACAAGAAG CAGGGTTGGT
TTTCATATCC ATCTCCGCTT TTACTTATGC TGACTTGTCC TTAATTGCAG GCGTCGACTT
GGCCCACATC GCTAGKTCCT CGCACGGTAT CGCCATKTCG CCTCCCACCA ACAGCTACCT
CGGCCCTKTC AGCCCTTCCA ACMCCAGGTA TGGTTTCAAT TTTGTAAGTG TTCAACTTTA
GCACCTCTTT ACTCTTATTA ATACAATATA GAACAAGTAA AATGTTGTAT CATTATATAT
GTCGAGCCGG TACTCGACCC CACCAATGTG CATTTAGTCT TTTGATGTTG TGTCCCCGCG
CTCAGAAAAT GTTGATTAGC CTTTATATCA GAAACCCTCA CGTTGTAGGA TAGTCAGTCG
CAGCAATGCC GGAGCGTTTG TAGTAAATGT GACCAGGAAA GATTTA
 
Protein sequence
MYIYTADTGL RSFLSAPLIA FSQHHSNQPI NTTTTLTPAI MTRTEVSFGR STARAGDVSI 
SIPQWSCMPS LDQANNCPSA SSPILPITVC LPVLYALWTA ALLRPHVNVP SPWYLHLECL
LPPCCALIRN QYPAAVLKDR HSRTGLDKTQ WNHKNGDGAH NWGSTARKGD DEASGRLDGE
AEAEAALDEL PSSDVFDLDE EINDPVGAMP VTDSGNDFKP MDLGKRGSIQ GQSNIATSPT
DSMSSLDSGD RPGMGRRMSA VSDEEREKMR LYREGVLHKK QGVDLAHIAX SSHGIAXSPP
TNSYLGPXSP SNXRYGFNFV SVQL