Gene CNM00020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00020 
Symbol 
ID3255289 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp2815 
End bp4183 
Gene Length1369 bp 
Protein Length201 aa 
Translation table 
GC content45% 
IMG OID638254162 
Productexpressed protein 
Protein accessionXP_568392 
Protein GI58261964 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCTGATGT AGTGCGGCAA CCTGGCTTTC CATTTACTCT CATCTTCGCC AGTTACCACG 
ACCTCTTTTT TCGGATACCT ACATCCGGGA GAAGGAAAGA TGCGAGGAAA ACGTGCGGGA
TTATGAGTGG GACTCGGATG TAAGGACTTT CAACTTGCAT GCTTTTCCAC CATCCGCTGA
CGTTGTACAG CAGGTCGGCT TCTCATCTGT TCCCCTCCGC TCGCCCATGT TGTATATGAT
ATCATGTCGT CCTTCTCATA TTTGCGATCA GGAGGAGTTG AAGTGATGAT GCCGGATCAG
AGGCTGGTGG AGCAGGAACG AAAGAGAAGG GAAGTGCGCT CAGAGGGACC GCCGCTCAAC
ATTCGTGAAC CAATGTCCGA TGAGGAGAAA GCGATGAGTG GTCCAAATTA TCAAGCAATT
GCAGGGCAAT ACATGGGATC TAAGACGTAA GTGCTTTCGA ACTAAACGCT TTTCTCGCAT
CTGCTGATGC TGTATACAAG CCTAATTGCG ATATACTATC TTTGTCTTGC GAACTGCTCC
AGTCGTATAA TGTTTCAACG CCGCTTACAC TTCCTTGAGA TTCTATCCTT ATACCTAGCT
TCAATCTTTC CATCAGGCGA TTAATTCTCA TTGACGATGC CCGGAAATGA GAGTGGCTCT
GAGGATGACT TTCTGAGTAT TATGTTCCGT GAGTTGGGAG CGACCTTCGC TTCCCTGTGT
CTTTTTATTG TGGCTGATGT CTCTCCCAAC AAACTGCGTG TCCTCTTTCA CAATGATTAG
CTTCCAGCTC GTCTCGCAAT GACTTGTTCA ATCATAAACT GTTCAGTTTA TAGACGGCTA
GATATGTTCT CAAAACGTCT CACATCTGCC CACCCCTTTC GACATTATCC CTCACGATTT
CACATCAAAG GTCAGCCACT GTCCACCTTT CTCCTCAGTT ACCTGGATCT TAAACATTCG
CCTATGGTCG TTTCTACCAC TCGTCTAGAT CATCCGGCGA GGATTTATTT GGGTTCTACT
CTACTAAGGA GAAACGATGA ATCAGTTTTG CGAACGACAC AACATGACAA GCGCCATCAT
TAACCATGCA AGCATAAAAA CATTGTGTCT ATTGGGTTCT GCTGATGTAA GAAGGGCCGC
TTTCCGGCTG ATGGATGACC TATTGAAACA CATCATCGAT AATTCCAGTC TGGTTGCCTC
AGTTGGAATG AGTTCAAAGT GGGCAGGATG ATTTTTAAGG AATAGGCTGC CAGATCGCAT
CTGCATGCAT TGGTATTGAA TATACGGCTG GGAATAGGAT TGGGTTGAAA CTGGATAAGT
AATTAGTTAT CTTAAATATT AGTCAATCGA TGAAATCGAT GATGAATTT
 
Protein sequence
MSSFSYLRSG GVEVMMPDQR LVEQERKRRE VRSEGPPLNI REPMSDEEKA MSGPNYQAIA 
GQYMGSKTLI AIYYLCLANC SSRIIRLDMF SKRLTSAHPF RHYPSRFHIK GQPLSTFLLS
YLDLKHSPMV VSTTRETMNQ FCERHNMTSA IINHASIKTL CLLGSADVRR AAFRLMDDLL
KHIIDNSSLV ASVGMSSKWA G