Gene CNC03640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC03640 
Symbol 
ID3256167 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1152859 
End bp1154078 
Gene Length1220 bp 
Protein Length316 aa 
Translation table 
GC content49% 
IMG OID638255585 
Productexpressed protein 
Protein accessionXP_569589 
Protein GI58264866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTCT CGCCCCCACA AGAATCCCCG AATCTTCCCT CCCTTGCAGC AGCTTTAGCC 
GACACTAGCG GAGCTGCCCA GGGCATCCAA ATTGCCCCAG AACTCATTGG TGACGGTGGT
GCACCTCCTG ACCTTCCTCG GCGCGAAGAG CTTCAAGGTC AACAACCATC TGATGGTGTT
TCACAGGCAT TCGGATCATA CCAAAATTCA TCCACAAGAG ACGATGGATA TCGCGGATGG
TCAAATGGAC CTAATCCTCC TCCAAGGGCG CAGAATCCAT GGGAAGAAGG ATACCGCGAT
CATTCCTCCC ATCCCGATCC TACAATCAAT CAGCAATACT CCTATTCTGA GCAGGCTGGA
CCATCCATCT CCACGGAAAA TATTGAAGAA CCTCAGAAAC CTCCGCGCAA ACGCGCAAGG
CAGTCCAAGC CTCGCGGGCA CGAAAAAAAT GGTGTCAACA GCGATGGTTT GCCGGAGGAG
GGCATACTTG ATTTTGCTCA TCCATCAGGG GACTTCAAAC TTGGTCCAGT ATTCGTACAT
CCGCCTAAAG GAGTTGCTCA AGCGTGTGTT CGATGCCACA AAATCAAGAG AAAGTGTGAC
AATGCGCGAC CAAGGTGTGC AGGATGCAGC AGGGCCGATG TAGCGTGCGT TTTTGAGCTG
AACCCCGCCA CCGCTAGGTA AGTCTGGATC TTGCAGTAGG TTTTGTAACT GTTTAACCTT
TTATGTTCAT TCAGCTATGT CTCGAGCTTG AAGTCAGACA ATGTCACTTT ATCTGCTCAG
ATGGTCTCCG CCGCTGAACG TATCTCTCAA CTTGAAGCTG TACTGGTCAA CACTGGCCAG
GAGATCCCTC CACCTCCACA GACTCTCAAG AACATAGATT TTACCGCCAT TGCCGGGGAC
AAGTTCTCTG CGAAGGATGA TGACGTATCG ACTGAGGATG CAATCAAGAG ACTAGCGGAG
AGTGCTTTGA CCACAAGCCT CCATAAGAGA AGACGGATGT CGTGGTAATG TTTGAATATG
TTTGAATCAG GGAAGTATGC TATCTGTGAG TCTTTTTTTT ATAGAAGGCG CTGCATGCCG
CTGACAAGTG TTGCACTAAA GCACGTCTCG GGTATCCTGG GTTGACATGA TAATTGGGAT
CAAGCTACAT TATTGACCAG CCGTACTGTA CTCCAAGTTT GTAGTCTTAG CCTCGTCAAT
TTATATATCG CCTTTCATCA
 
Protein sequence
MSFSPPQESP NLPSLAAALA DTSGAAQGIQ IAPELIGDGG APPDLPRREE LQGQQPSDGV 
SQAFGSYQNS STRDDGYRGW SNGPNPPPRA QNPWEEGYRD HSSHPDPTIN QQYSYSEQAG
PSISTENIEE PQKPPRKRAR QSKPRGHEKN GVNSDGLPEE GILDFAHPSG DFKLGPVFVH
PPKGVAQACV RCHKIKRKCD NARPRCAGCS RADVACVFEL NPATASYVSS LKSDNVTLSA
QMVSAAERIS QLEAVLVNTG QEIPPPPQTL KNIDFTAIAG DKFSAKDDDV STEDAIKRLA
ESALTTSLHK RRRMSW