Gene CNH00840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00840 
Symbol 
ID3259274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp949467 
End bp951593 
Gene Length2127 bp 
Protein Length638 aa 
Translation table 
GC content55% 
IMG OID638258398 
Producthypothetical protein 
Protein accessionXP_572277 
Protein GI58270242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.361474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAATCCCC GGTACCCCGC ACAACCACTT CGCCTCAGGA CGTGTCCAAG CACAAACCCA 
TCATTGCTCG GGACCCTCCA AATTGACTCC TTCCAAAGTT CGTTCCATGG GTGACCTCTC
CCCCCCACAG TATGACAAGT CGCTTGGGCT GGATGGTCTG AGAGGGTTTG CCAGACGCTC
AAAGTCTTTT GGGCCGACAC AAAAACGGCT CGGGGAAGGT GGTGAAGGTA GTGAAGGTGA
AGAGAAAGAG GATGACGATG ACGAGGAGAC CGTGATGAGC TTCGTGTTTG GTGATGGAGA
TATATTGATA CCTGCCGAGC CGAGGCCGAG GAAGAAGGTC GTACCCAAAA ACGCACGCGA
TCGAAGAGTG TCTGGTGTGG CCGGTTCTGG TTTATCAAGG CTAGACAATA CCAAGGGAGA
GAGTGACAGC AGACCAGTCT CCAGGATGAG CGGTGTAGCA TTGTCGAGTG CGAGCGGCAA
TGCAAATGTA AAGACAGCGA CATCCGTTAG ACCGCGACCC TCAGCTCTGC GTTCGTCTAC
CCTTCCTCCG GCGACTGACT CTTCCGCATC CACCTCTGAT CCAGCTATCG CCTCAACCAC
CACTACCGCC GCCATTACTC GACGACACCA ACCTCGAGCT GTCAGTCTCG CCACCACCAC
AGCCTCTGCC AACGCCAAGT TCGCCAACCG TCTCCGGTCG TTACCAGCCG ACAGAGATGC
GGAAGGGGCG TACGTGTCAG AAGGGGAGAG TGTCGTGAGA AGAGGAGTAG GGAGGGGAAG
TATGCCGCCT GTGATGGCGA GAGCCGGGTC TGCTGCGTCA AATGTTTCGG CAGCTTCGAG
ATTATCAGGC CCCGCTTCAG TTAAAGGGCC ACCAGTTATG GGGTCGCAGC CGATGAATCG
GGCGATAAGT CATGTGACCA ATCGTGGCAA TGATAATAAG AAGGAAGGGA ATGAAACGAT
TAGAGTCAGG TCGTCTTCTT CGGCTAGGCA GAGACCTTGG AGTACTACCC TTTCATCCGG
TGACGCCATT GCCTCTGGGA CGAGTGCCGC GGTGTCCAAA AGAGCCTCCA CGCCCGCTTC
GGCTGTCAGT ATGACCATTG GCAGACGTTC CATCACTCCT CTCAGTGCCA CATCACCTAC
TGTCACTGCC CACCCTATAA AGACAGCCGC TTCTGTCACT GCTCACGCGA CCAAGACAGG
CGCTCCTGTC ACAGCATCTA CCATTTCCAC CCGTGCAAGC TCAAGAATGA AACCTGGTGT
CGGCAGAGGT AGAATGTCTA CGCCTCCTAG TACCGTCGCT AATGCCAACA CGGATTCTGC
TGCATCCGGT AATACCGGTG CAAGAGCAGG AATCAGTGCG GCAACGGCGG CTAGACGGGC
GAGGGTGTCG AGTTTGGGTC CGAGAACTAG TACTGGGACA TTGACGCCGT CGTTGAAAGC
CAATCCTGGT GCTGCCGCCA AGGCAGCTGT CAAACCGAAC ACAACTGCCT CAGCCGAAAG
TAAACCTGCC TCAGCCGAGA GTAAACCTGC CCCTGCCTCT ACGCTTCGTC TCGCCCCCAA
GCCTACTATC ACTAATGGAC GAGCTACCTC GGCCACTGGC GTCAAACCCA TTATCCGAGC
AAAGGGCACC CCAAGCCCCA GCCCCAGCCC CAGCGTGAGG ACTACTGCGA GTACCTGCAC
GCGGACCAGT ACTCGCACGC CCCAGCGCAA AGGGATCCCC ACTATGTCAA GTATGGATAC
TGCTCTTGTT GAAGTGTGGA AGGAGTATGG GAATGTAGAT ATCGACAAAT TGGGCAGCTC
GACACCGGGG AAAGCGGTAC CCAAGGAGAA GGCGGAAACG CCTGAATCTG TCAAGGCGCC
TGCAGGAGTG AGGAGAGTGC CAGCAGTTGG TGGAGGCGGG AGGGCAAGCG GACTGGACAA
GAGTCCTGTA AAGACGCCAG GATCTGTAAG AAATAAGGCT GCGCCCGTAA GAGGGAGAAT
TGGAGAGATG ACTAAAAGGA GGGATGTTGG GAAAAAGATA TGATGAGCAT TAACGATTTT
TACGACTTGC TACGATGTGA AATATCATTC TTGTATGTAG ACTGTAAGAT AGATTACCTT
TGTCTACGTA TAATGCGCTA TATGCAT
 
Protein sequence
MGDLSPPQYD KSLGLDGLRG FARRSKSFGP TQKRLGEGGE GSEGEEKEDD DDEETVMSFV 
FGDGDILIPA EPRPRKKVVP KNARDRRVSG VAGSGLSRLD NTKGESDSRP VSRMSGVALS
SASGNANVKT ATSVRPRPSA LRSSTLPPAT DSSASTSDPA IASTTTTAAI TRRHQPRAVS
LATTTASANA KFANRLRSLP ADRDAEGAYV SEGESVVRRG VGRGSMPPVM ARAGSAASNV
SAASRLSGPA SVKGPPVMGS QPMNRAISHV TNRGNDNKKE GNETIRVRSS SSARQRPWST
TLSSGDAIAS GTSAAVSKRA STPASAVSMT IGRRSITPLS ATSPTVTAHP IKTAASVTAH
ATKTGAPVTA STISTRASSR MKPGVGRGRM STPPSTVANA NTDSAASGNT GARAGISAAT
AARRARVSSL GPRTSTGTLT PSLKANPGAA AKAAVKPNTT ASAESKPASA ESKPAPASTL
RLAPKPTITN GRATSATGVK PIIRAKGTPS PSPSPSVRTT ASTCTRTSTR TPQRKGIPTM
SSMDTALVEV WKEYGNVDID KLGSSTPGKA VPKEKAETPE SVKAPAGVRR VPAVGGGGRA
SGLDKSPVKT PGSVRNKAAP VRGRIGEMTK RRDVGKKI