Gene CNI00790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00790 
Symbol 
ID3259633 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp189106 
End bp192568 
Gene Length3463 bp 
Protein Length998 aa 
Translation table 
GC content55% 
IMG OID638258564 
Producthypothetical protein 
Protein accessionXP_572745 
Protein GI58271178 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAACATATA AGGAGTCTAA TCAGTACAGG CTACATAAAA CGGCTAACAC AGTAACATCT 
AGGTATGTGC AATGTCTGCA ATGCCATTTT CGCCAACTCC TTGGTAATGA CAATCTGGAC
TGACATTATG GTCGCAGCTC GACCGACAAG GGTACAGACT ATCCATAAAC GCCCACGGAC
ATATAGAGTC GCAGCCAAAA ATATCGGTTA CACCATTATG TACGCGTACC CATTATTGTA
CCATATGTAC CAGTTCTGAT TCCCTGCAGG TCCGACTCCA CAACTCCACC GGACAGCATT
TCTCGCTCCT CTTCTCCAGG TCAGGCCGGT CCCCAACGCC GAGTCGTATC GGGATCTCTG
CGTGACCGTA TAGCCAAATT CAATAATCCT TCAGCTCCAC CCCCCGTTCC CAAGCAACAT
GTCCCCAGCG CACCTGTGTC GCGCGGCATG GTCGGAAACC GTATCCCCAG CTTGGACCGC
AAGTCTGCTG GGATTTTGGG GGTGACTCCT GATAAAAGGG TACCCGAGAG TAAGGGAATG
ATTGGTAACA GGATTCCTAG TGTGACTGGC GGCGGGTACT CCCCTGTGCC CTACCAAAAC
ACTGGGTCCA GTAGTGGCGG TATCTCGACA AATCCCGCGC CTGCGGTGGT TGCTCGCGAT
GCCAGCCCTG CCGGTTCTGC TTCTGGCTCC GTAGACTCTA GTGCTGCCTC CGCTACCGTC
GATAGCTCGG CCTCACCCAT CACTTCACGC TCATCTACAC CTCCCAGTTC GCCTGGGACT
GCAGGTACTG CTGCCACAGG CGACGTCCCC CCTAGCCTCG TTGCTGCCAC TCTTCCCTCT
TTGAATGCCA ATCTTGGTAC CTCGACACCG TCATCCACTC GCGCTGAGGC CGGAGATGCA
GTTTCCGAGT TTTCGTTGTC GGTACCATCT ACTCCCATGG GTAACAGCAC TCCCCCTCTT
CCTGCACCCG AGTACGACCT CGTCGCACCT AATCTCAAGC TCGCAACCGG GTCTATCCCC
CACCCAATGG CGCCTGGTAT CACTTCCCAA TCGGCCAAGT TTGCTCCTAG TGTATCTTCC
TCGCTTGCTA CTCAATCTGT GGGCAGCGAG GAGGAAATTC AGATGATGGC CGATGTTAGT
GGGGTCAGTA CTCCTATGGG TACACCTCGC GCCGAGAGGC GGGGGTTGGG AGAAGGGAGC
GTAGCCGGAG AAGGAAGTGT AGGCGGAGAC GGAAGCGTTG CTGGGGATGA GGAAGGCTCA
GTAAACGATG TGAGCGGCAA GATCGACAAG CTTGATCTTG GCGAAAACAC TCCGCCTTCC
GATGAGACCC TCTCTACCCC TGTCAAGGAC ATCAAGGCCC CCACAGTCCA CGAGCCACTC
GAATCTGCTG CCACCGACAA AGATCTCGAA GGTCCTCCTA CTCCCAAGTC TGCAGCTTCA
AGCTCCGACC CCAGCAAGGC GCAAGACCTT GTTGCTCTCA AGCAGGGCCA GCTACATTCC
TCGCCCAGCA ATGTGCCCTC CAGGGACATT GATGACATGG CTGCGAGCGT TATGGCCCGA
AACTTCGGCG AGTACACTAT TGCCCCCGAG GAGAAGAAGG TACAGGGCGG AGAAGCTGGC
CAATTTATCG AGGATGTGGA GATCCCTGAT GAGATTCTCG TGCCTCCCGC CCAGACTAAG
CAAGAGTACG GTGGGCAGGA AGAGTTGCCG ATTCAAGAAG TCGGCAAGGA AGGTGACAAG
GACGTTATCA AGACTGACGA GGAAAAGGCG GCTGAGCAAC CTGATGTCGT TGTGGCTGGT
AAAGACGTCC CTGCCACCCA ATCCGCCGAA TCTGAAGAGC CAGGGCCCAT CAAGACAGAT
TCGGAGAAGG CCGCCGAGGA GCCCGATGTG GTCACTGTAG GTGAAGACGT ACCGCCCACT
CAGTCTGCAG AGGGATCCGA ACCTATCAAA ACGGACTCGG AGAGGGCGGC AGAGGCGCCA
GATGTGGTCC TCGCCGGCGA CGATGTACCT GCTACCTCAT CTGGTGAAAC TGCCAATAAG
GGGGTCATCA AGACAGATGC GGAGAAGGCC GCTGAAGCTC CAGATGTCGT CACTGTCGGC
AAAGATGTAC CCGCAACACA GTCGGCTTCC CAGGGTGACG AAGAGAAAGT AGTGAAGACG
GATGAAGAAA AAGCAGCAGA GGCGCCTGAT GTAGTTGTGG TCGGAAAGGA TGTGCCTGAA
GTCCAGTCTG CGGAGCAGGA CGGAAGTACG ATCAAGATTG ATGAGGAAAA GGCTGCTGCT
GCGCCCGATG TCATCACGGT CGGCGAGGAT GTGCCTGTTG TACAATATGG TGATGCAGAA
GACGGTGCGG GCCTCATCAA GACCGATGAA GAGAAGGCTG CTGAGAAGCC GGATGTTGTT
GTCGTCGGCC AGGACGTTCC TGCTACTCAG TCTGCGGAGG ATAACACCCC TATCACCGAA
ACCAAGGTAG ATCAACCGCA AGTCGCTGCT GTAGGATCTG AAATCCTGCC GGGAGACGTC
ATCTCCTCTG ACAGGGACAA GGACGTCAAG CAATTTGAGG TTGAGCCTGC CCCTACCGCA
CCCTCAATCC AACCCAGCGC CCCAACATTT CTTGATGTCC CCTCACCTGT CGAAATTGTA
GATGAAGAAA AGACCCCTAC TGGCGAAGAA GCTGCACCCG ACTTTCCCAC GCCTCCTGCA
GCCGATCCTG ACGTTGTAGA TCCCATCTCT GAAACCACCT CTTCGACTGA ACTTGAATCT
CCTCAGCAGA GCGCCGTCGC TGCCTTCAAC GAACCCTCTG ATGCACCCGA AACGCCCATC
GATAAATCAA TGCTGAAGTA CTTCCCCGAA GTCCCTGATG AGGAGAAGCC GAGGGTAGAA
GTCCACGTTT CAAGCCCCGC TGTGACGCCT GCCAAGTCGA AGAAAGAGTC TATCGGGAAC
CCCAAGAGCC CTACAGGGTT GCCTGCTTCG AGCAGTGAAA GGTCTTTAGG GGGGGAAAGC
GACGTTGCTG ACCTGCAAGG ACAAAGCAAA TCCATCAGCC ATCCGACTTT AGATAATATC
ACTCCTAGCA AGTCATCTCT TTATGCAGTA GAAGGAGATT CTTCCGATCA ACTTGACACC
ATGACTCCCG AGGCTGCCAA ACGGCTCTCA AAGTCAAACT CAATGAGGAA GAGCCCTAAG
AGCCCGCTGT TGGATGACGA GGATCCGGGA GATTTCGAGC CTGGAGAAGG ATGGGCGGTT
GTTACTAAGT GAGTCTTTCT CTCCTGTGTC TTTATCGAAT CCCAAACTGA CAAGCCAAGT
AGGGGAAGGG ATGCCTGAAT CGAGGCGACT CTGATGAAGG GTGTGATTTT GACGATTATC
TACATGTGGA TTACGGGAAG CGTCTATGGA TGACGGTCAT GGCCTGGCGT CCGGGGGATC
TGATGACTTA CGTCGAATTT ATGAGGGGTA GAAAGCAATT GTA
 
Protein sequence
MSDSTTPPDS ISRSSSPGQA GPQRRVVSGS LRDRIAKFNN PSAPPPVPKQ HVPSAPVSRG 
MVGNRIPSLD RKSAGILGVT PDKRVPESKG MIGNRIPSVT GGGYSPVPYQ NTGSSSGGIS
TNPAPAVVAR DASPAGSASG SVDSSAASAT VDSSASPITS RSSTPPSSPG TAGTAATGDV
PPSLVAATLP SLNANLGTST PSSTRAEAGD AVSEFSLSVP STPMGNSTPP LPAPEYDLVA
PNLKLATGSI PHPMAPGITS QSAKFAPSVS SSLATQSVGS EEEIQMMADV SGVSTPMGTP
RAERRGLGEG SVAGEGSVGG DGSVAGDEEG SVNDVSGKID KLDLGENTPP SDETLSTPVK
DIKAPTVHEP LESAATDKDL EGPPTPKSAA SSSDPSKAQD LVALKQGQLH SSPSNVPSRD
IDDMAASVMA RNFGEYTIAP EEKKVQGGEA GQFIEDVEIP DEILVPPAQT KQEYGGQEEL
PIQEVGKEGD KDVIKTDEEK AAEQPDVVVA GKDVPATQSA ESEEPGPIKT DSEKAAEEPD
VVTVGEDVPP TQSAEGSEPI KTDSERAAEA PDVVLAGDDV PATSSGETAN KGVIKTDAEK
AAEAPDVVTV GKDVPATQSA SQGDEEKVVK TDEEKAAEAP DVVVVGKDVP EVQSAEQDGS
TIKIDEEKAA AAPDVITVGE DVPVVQYGDA EDGAGLIKTD EEKAAEKPDV VVVGQDVPAT
QSAEDNTPIT ETKVDQPQVA AVGSEILPGD VISSDRDKDV KQFEVEPAPT APSIQPSAPT
FLDVPSPVEI VDEEKTPTGE EAAPDFPTPP AADPDVVDPI SETTSSTELE SPQQSAVAAF
NEPSDAPETP IDKSMLKYFP EVPDEEKPRV EVHVSSPAVT PAKSKKESIG NPKSPTGLPA
SSSERSLGGE SDVADLQGQS KSISHPTLDN ITPSKSSLYA VEGDSSDQLD TMTPEAAKRL
SKSNSMRKSP KSPLLDDEDP GDFEPGEGWA VVTKGRDA