Gene CNH01200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01200 
Symbol 
ID3259153 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp836480 
End bp837890 
Gene Length1411 bp 
Protein Length368 aa 
Translation table 
GC content48% 
IMG OID638258363 
Productconserved hypothetical protein 
Protein accessionXP_572310 
Protein GI58270308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATCAT CAACAGCAGC GACAGTCTTT GGACTCGTTT TGGCGAGCGT GGGAGTTCAA 
GCCTCCACCC CTCTTGTCAG GACGTACCAA GGAGACTCTT TCTTTGACCG ATGGGTGAGC
TGATTCTCCA ACATGGGCTT TACTTGCTGG GCTTGTTGAT TGACATGCTT GGAATCAGAC
TTACTACGGC AACTACGACA ATACCACCAA CGGTGACGCT ATCGTGAGTT TTCAATCTCA
GCTCCCTTGT CATGCCTGTG CGAAGCGGTA CATTTGCTAA GATCAATTGA CCGCAGTTTG
CCAATAAATC CGTAGCCACT TCCACTCCCG AGCTCACATA TGTGACGTCT GATGGCAGCG
CCATCATCCG AGTGGACAAC TCTTCTACCG TTCAGTACAA CTACAAGCGT GATACTGTCA
AGATCACCTC GACTGACAGT TACCCCGTTG GATCTATATG GGTTCTCGAT GCCGTCCACT
TACCATATGG ATGCAGTGTC TGGCCTGCAT TCTGGAGTTA TGGTGCTGGT GCGACATGGC
CTGAGGAAGG TGAAATCGAT GTCATTGAGG GTGTGAACAT GGGTTTCTCT AATCAAATGG
CTTTGCACAC CGAGGACGGG TAAATAATAA TGCGGTTATT TTCTGTTACT ACCTGTGGCT
TACTCTCCAT TGTTAGATGT TCCTTGGGAT CTTCAGGCTC TTCATTCACT GGTATTGTCA
ACGACACTTC ATGCTACTAC GAAGACAACG ACAATTCCGG CTGTGGCGTT ACCGAAACCA
ACAATGCTTC CTATGGAGCC GCCTTTGCTG CCGCCGGAGG TGGTGTCTTC GTCACCGAGC
TAGCCGAGTC TGGGATTTCC ATTTGGTTCT TTAGCCGATC CGATATTCCT GATGCTATAA
GCAATGCCGA CGACGAGATC GACACTAGTA CTTTGGGTAC TCCCAGCGCT TATTGGGGTA
CTGACACCTG TGACATTACC AAATTCTTTG GTGACCAATC TCTTGTCTTC GATATTACTC
TTGTAAGTGT ATATCCAGCC ATCATTTTTT TGGACCAAAA TTAATCCATA TGTTAGTGCG
GTGACTGGGC TGGTCAGTCT AGCATCCTTG CTTCTACAGG ATGCTCTGCT TTGTCTGGTT
CCGACACTTG CTACACTACC TATGTGCTCG ACGCTAGCAA CTACGACACT GCATACGTAA
GTGTTCGCCT TTTCTTTCGT ACATTCTATC CTCTAATATA CATCTTTTGC AGTTTGAGAT
AAACAGCTTG AAGGTCTACT CGAACGGAAG CTCCTCCAAT TCCAGCTCTG ACTCTAGCAG
CAGCTCTGCC CCGTCTACTC ACCGCTTCAG TGCCTTGGGA TGGTTGTTGG CTGGTGTTAT
GGGCGTGTCT GCCTTGGTGG GGATGATGTA G
 
Protein sequence
MISSTAATVF GLVLASVGVQ ASTPLVRTYQ GDSFFDRWTY YGNYDNTTNG DAIFANKSVA 
TSTPELTYVT SDGSAIIRVD NSSTVQYNYK RDTVKITSTD SYPVGSIWVL DAVHLPYGCS
VWPAFWSYGA GATWPEEGEI DVIEGVNMGF SNQMALHTED GCSLGSSGSS FTGIVNDTSC
YYEDNDNSGC GVTETNNASY GAAFAAAGGG VFVTELAESG ISIWFFSRSD IPDAISNADD
EIDTSTLGTP SAYWGTDTCD ITKFFGDQSL VFDITLCGDW AGQSSILAST GCSALSGSDT
CYTTYVLDAS NYDTAYFEIN SLKVYSNGSS SNSSSDSSSS SAPSTHRFSA LGWLLAGVMG
VSALVGMM