Gene CNC02140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC02140 
Symbol 
ID3256606 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp603695 
End bp607033 
Gene Length3339 bp 
Protein Length1065 aa 
Translation table 
GC content54% 
IMG OID638255435 
Producthypothetical protein 
Protein accessionXP_569483 
Protein GI58264654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.271772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCCAGCAGCG TGGACTAGGA TGGCCTCGGT AAAGGCTCTG CTGCGGACAC CCGCCTACAA 
CCCCCTCCCG CTTCCGACCC TCCCGTCCCC CGAGCCGCAC GGCCGAGACG CCGAGCAGCT
CCCACCGCTC GCAGAAAGGT ACGCGCTCGT CGCCCACGCT GACAACCGTG GCTTACCTTC
TCTTGCCAGC CTTGTTCTCC CATACTCCTT GATACAGACA CGCCACCACC ACCTCACATC
CCTCCTCCCC CTCGCATTCT CCCACCCGCC CATACCGTCT CATAATAAGC CTAGGGCAAA
GCCCTTGCCA AAAGGCGTCT TTGTGTTCCC GACCCCGCCG CCTCCCGTTC ATCCCCTCCC
AACTAGAGAA TATCACGGGC CCAGAGATCT GCTCAATGAG GCCTCTCCTT CTGCAACGCC
TCCTGCCAAC CCAGCATCGA CACCTGTCCC ACCGCAACCC AAGGGGCGCA AGCGTTCAGG
TGAACCTGAA CCTCCAGCCC ATGAGATCGA GTGCATCGCG CGTGTGACAT TGGCTCTGGG
CCCAATGTCT TTCCACGGCA CAGAGCTATG GGTCGGGCGC TTTGTGGAGC CCAGAGCTAA
TAAGCCCAAG AAGGAAAGAG CCAAACCAGG CGAGCGAAGG GAGAGGGAAA AAAAGAGGCG
AGAAGGGATA GAAAAGGAAC GACAGAGAAC ACATTCAGGT CCGAGTGCTA CACCTAAACC
CGCCGCTTCG GTGGCAAGGC CAACAGTCGC CACTGCTGGG CCGTCCGCTC CTCGCATGCG
GCCCCCAGCA CCGGGACCGG CCGTGACAAA CCGCACAGCG GCTTCGCCCC AACTCATCCA
ACTCGTTAAT CAGGCTGCCT CTCGTCACCC ATGGCTTTCA TCTCTTATCT ACAAAGCGGC
GGGTAGCACA GCCAACCAAG ACGAGTTGGA AAGATTAGGG AGAGCAGTGG CAAGGCTCAG
CAAAGGAGAA GCAATCGACG ATCTAGCACC GCAGCGTGTG AACATTGCTT CTAGTGAGGT
TTTGAAAGGG AAAGGAAAGG AGACTGCTGC AAGTACTTCT CACGCTTCAA AGACGTCTGT
TCCCGCAGTT CCAGGGCCAT CAAGTCTATC AGAAAAGCCC ACATCTGCTC CTGCTCCCTT
GCTTTCTCAA AATACTTCCT CGTCAACTCT TACACCCGTC GTACCGGCTG AAAAAGACAA
GGAAGACAAG AAGGATGATG CAGAATCTGA TTGGGACAGT GAAGTTGAGA TGAAAGGTCC
TAAACAGGTC GGAGGAGGCC CCATCGGCCC TTCCACTCTC GATTCTGCGG CCGCAGCATC
CACCATACCG TTGACATCTA CCGCTACTCA ACCTTCCTAT GCACCCAGCT CCGTGCCCGC
TTCCCAGGCG GCCAATCCTT CAGTCCCTTC TCCTGTCCCT GGGATTGTCC CCTCTCCTAT
CCCTGTGTCA GCATCAACAT CTTCTGCAGT GCCATTGTCC CCGCATGCGC CGCCTCCTAA
ACCTAACCTC CCTAATCCAC CGCCATTCTT GTTGATTGCA TTCAAAGAGC ATCCCACAGA
CAAATTCTTA ATCCCCTTGG GATCAAGGAG TTTTGTCAGC CGTGTTGGCG GGGATTGGGT
TACTAGCAAA CCTCCCCATC TTGCTCCCGA CACTACGCTG CCTTTGGGCC AATCCCTGGA
AACCAGTGGC AATACAAATC CAACAGTGGC TCCACAGCCA GCAGCCCAGG CGGCCATCTC
CAGAGAACTG CAAGCTTTAC AGTCTTCTGC AAAATCGTCT TCTTTTGAGC TAGAGGCTCA
GTCACACTCA AAACGCAGAG GACGTACCCA TGTTCGCCCA ACCAATGCCA ATTCCCCCTC
TCCGGCACCA CCCAAAGTCA AGACCAAGCC CATCGAAGAA CCCAGCTTGA CAACAACAAC
AACAACACCA GAGTTTCCGC CTCTTCCTCA ACTTCCCGGT CAAAATCCTC CCCCTGGAAC
AGTCCTCATC TCTACTCTTG TGCCGGCTAA TAAATGGAAC AAGGTTGACT GGGCGTCATT
GGGCAAAAAA GTACCTTGGT CTGAGGACTG GAACAGTAAA GTCAAAGCTG GGACGAATGA
AAATGTCAGA GAGGAGGAGC CTTTGCCATT ACCTTTGTCA GATGCCTCAT CCCAGAACCA
CGTCCAGTTA CTTAATCTTG CTGCTGAAGA CTTCCTCCCT GAAAATGGAC CTCTGAAAGC
AATCACGATC AGGTTGGGCC AAGTGGACGA TCAGATCTGG GGGAGAATGA AAGACGTGAT
GACCTTGGTC GACCGGGCGG AGATCATGGC CTTGTCGGCG ATGGGCGTCT TGCCGCCTGC
TCCAGACTCT ATAACCCACC CTGCCGACGA CCCAGAGATC AGAGAAGCAT ACCTCTTGCA
CAAAACGTCC CTCTTTTCAT CTCTCATGGG CCGCACTCGA CAACCCCGTC GTTTCTTACA
CACCCGTCCG TCTTCTCCGC CTGCTGCTCT TGTAGATGCA ACAGTAGATA AAATGGCTCC
GCGTCCCTAT CCCATATCTA CCAAACCGCT GTATCATGTC GAAGAGGGTG ACAACGAAAT
GGAGGGGCGA CGGGATAGCG TGAGGCAATG GTCGCCAGAT GTAGAATTCG ATGACGGTCT
CGGGAGAAGG AAGAAGAAGA AGACAGCTCA AGAAACGGTA GGATTCGAAA TGCCCGTCTC
GTTGGAAGCG CTTGATGAGC GGGTGGAGGC CAGTGCTCAA AAAGCACTTT TGGGAAAGCG
CGGAAGAGCC GGTGGAGGAG AAGGAGCAAG GAAAGAAAAG CAAAGGCGGG GAATTGAAAA
AGGAATATGC GAAGGCTGCG CAAGAGAGGG AATTAAGATT TGGAGAAGAG GACCGAGTGG
AAAAGGAACA TGTACGTCTG GTGTCATTGG AGTTCATGTT TGTTAAAAAA TGGATGAGCC
TTACTGACGT TTACGGTTCC AGTGTGTAAT TCATGTGGCG ATCTTTTCAC TGAGGGGAAA
CTGCAATACA GTGACTTGAA GGCCCCTGGA GCAATGAAAA CTCTGTTGGC TGCCAACCAG
GACGTCAGCG GAGCGGACGC CATGCATAAT CAGGTGGAAG AAAAGAATGA CAGTGTGCCT
GTCAAGGCCG AGCAAGGTCG TACGACCGAA GAGGCTCTCG GGGACGGCAC AGCCATCAAT
TCGGAAGAAC AGAAAGAAAC ATCTACCCAG GTCGTGCAAG CTGAGAGACC ACCAGAACAT
TCGCCACAGC ATCATGCTGT CGAGAGTCAA CCTGCAACGC AAAGCTTGCC AGAAACGATA
GAACCAGGGG TGGACATGCA GAATCAAAAG AGTTTGTAA
 
Protein sequence
MASVKALLRT PAYNPLPLPT LPSPEPHGRD AEQLPPLAES LVLPYSLIQT RHHHLTSLLP 
LAFSHPPIPS HNKPRAKPLP KGVFVFPTPP PPVHPLPTRE YHGPRDLLNE ASPSATPPAN
PASTPVPPQP KGRKRSGEPE PPAHEIECIA RVTLALGPMS FHGTELWVGR FVEPRANKPK
KERAKPGERR EREKKRREGI EKERQRTHSG PSATPKPAAS VARPTVATAG PSAPRMRPPA
PGPAVTNRTA ASPQLIQLVN QAASRHPWLS SLIYKAAGST ANQDELERLG RAVARLSKGE
AIDDLAPQRV NIASSEVLKG KGKETAASTS HASKTSVPAV PGPSSLSEKP TSAPAPLLSQ
NTSSSTLTPV VPAEKDKEDK KDDAESDWDS EVEMKGPKQV GGGPIGPSTL DSAAAASTIP
LTSTATQPSY APSSVPASQA ANPSVPSPVP GIVPSPIPVS ASTSSAVPLS PHAPPPKPNL
PNPPPFLLIA FKEHPTDKFL IPLGSRSFVS RVGGDWVTSK PPHLAPDTTL PLGQSLETSG
NTNPTVAPQP AAQAAISREL QALQSSAKSS SFELEAQSHS KRRGRTHVRP TNANSPSPAP
PKVKTKPIEE PSLTTTTTTP EFPPLPQLPG QNPPPGTVLI STLVPANKWN KVDWASLGKK
VPWSEDWNSK VKAGTNENVR EEEPLPLPLS DASSQNHVQL LNLAAEDFLP ENGPLKAITI
RLGQVDDQIW GRMKDVMTLV DRAEIMALSA MGVLPPAPDS ITHPADDPEI REAYLLHKTS
LFSSLMGRTR QPRRFLHTRP SSPPAALVDA TVDKMAPRPY PISTKPLYHV EEGDNEMEGR
RDSVRQWSPD VEFDDGLGRR KKKKTAQETV GFEMPVSLEA LDERVEASAQ KALLGKRGRA
GGGEGARKEK QRRGIEKGIC EGCAREGIKI WRRGPSGKGT LCNSCGDLFT EGKLQYSDLK
APGAMKTLLA ANQDVSGADA MHNQVEEKND SVPVKAEQGR TTEEALGDGT AINSEEQKET
STQVVQAERP PEHSPQHHAV ESQPATQSLP ETIEPGVDMQ NQKSL