Gene CNE04080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE04080 
Symbol 
ID3257735 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1148387 
End bp1151969 
Gene Length3583 bp 
Protein Length908 aa 
Translation table 
GC content50% 
IMG OID638256990 
Productconserved hypothetical protein 
Protein accessionXP_571062 
Protein GI58267812 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCCAGGCGG TGGCAGGCCC CAAGAGGTAA ACAAGAAGTT ATTACTTTAT TAGGCCAAGC 
AGCGGACTAG GCCACTGCGA AACTTTATCA CTTGCCCCCC CGTATCATGC CATCACCATC
GCAGTGCCTT TTTGAGGACC AGGAATATGA CGCTCGTCAT AATCCAATTA TTGAAATGAT
GTCATGAAGC CAACTTCCGT TCCACTATGG TAGCTTTCTT CTTTCCTACT TAAGAACAGT
GCCACCTTCC CATCCTTCCC CCTTTTTCCT CACGATTCTC AGGTTTTTGT TTACACATAA
CAACTTGTCT CTTTACCCAT CAATACCACT ACTGTAGCTG TGAGTTTTCT CTTGGTCGGA
CCTCCTTGCA AGTGAATTCC TTCTCACATT TATCCTTCCC AGATTACTTT CCCACAATGA
GTGTAGCAGC CACCAGATCC GAAATTTCCC ATAACCCCGG CAAGATGAGT GTTACTAGTG
CCGTCTCCGG GGAGAAGGTT GAGGGCGATG TTCAGTCTAG AATGAAGTTG TTCGGTGCGA
TTCAGGCCTT TAGGGATGGG TGAGTTTAAA TATCTCGATA CCATTGATTT TGCAAAATAC
TTAAGATGTC TGTGTTCCGC AGTCGTATGC CTGACAATGA ACAAATCGGC TCTGTTCTCG
AATATGCCAT CGGCCACTCC CCTGTCGACC TCCAAAAGCT CTCACCCGAA GGACGAGTCC
TCATCGATGA CTTTAGGGAC ACTCTCGAGA CTCTCCGCAT GATCGTTCAC GAGAAGAACA
CTGATGAGCT TTTCCAAAAT GCTGTCTGGT CATCATATCA TAGTGATGTT TCAAAAGCTA
AGCAAGATGG TGTCATTCCC GTCAGCAGTG AACAGGCCAA GCAAGATGGC AAAACTGGTG
AGTGAACTGG GTTTGCCCAT TTGTCTAGGA ACTCGACTAA CATTTATTTT GCAGCCGCTT
CCCACATTCG TGTCCTCATC ACTCTCTTCC TTACCAACTC TGAAGCCCGA AAACTTCTGA
AAGACTTTGG CATCGTTGGC CGAGATATCT TCGCTACCGC CGCGACTAAG GCTGCCGACA
AGTCTCGACC CTCTCAAGAA AAGCTCGACT CCGTTGACCA GGAAGCTCCT TCTCATGAGT
GGATCGGTGC CGATGGTAAG AGGCTTGGTC CCAACGAGAC CCCCGACATC CAGCTCAAGG
GTCCCAAAGG TACCCAGGCT AGATACCACC CCAGAGACGA CCCTCGAGAT GCCCAGTGAG
TTATTATTTT CATACATTCT TCGCCTTCCT ATTTGCTGAT TTGTTTTCTA GGCTCATTGA
CGACAAGGGC AACTCTCGAT CTGCTGGCGA GGCCTATAAC CAGGCTCAAG AAGCCAAGGC
TGATGCCCAG TCCAAGGCTC AGGACCTCAA GTCTTCCGCT AGAGACTACA AGGAGACTGG
CAAGCAGCAG GCTCGTTCCC ACGCTCAAGA TGTGGCCGGT AACCGTGACC CCAATGCTTC
ATTGTCTGAG CAGAAGGAGC AGGTTAAGGG TGCTGCCTAC GACAAGAAGG ATGCAGCTAG
TGCCCAGGCC GGCCAGAACC TTCCCGATCC CAATGACGAG GGTAACCAAC AGAAGGCCAG
GGGCAAGGTT GCCGAGTTGA AGGACCGAAT CCCTGATGAA CACAGGCAAA AGGCGGCCGA
CTACATTCAA AAGTCCAAGA ACTTCGTCAA CGATGAGTTG CCCGAGGAGA GGAGAGATCA
GTTCATCTAC AGACTTAAGA AGGTAAATAT CTTGTCTCTT TTGGCACCAA CAGGACAGCA
TGTTTACCCA ATATTTTTAG GTTGTCGTCG AATGCCAGGG TCACAAGGAC TACCAGGAGG
CCATGACTTG GCTCCTTGAC ACCCTCGAGA ACTACCGAGG TCACGCTAAG CACGTTACCA
ACAAGGGCAC CGAGTCTGCC CAAACCGTTT CCAACGACCC TGCCGTGGGC GACTCCACTA
TCCAGTTCCG AACCCTTCTC GAACGATTTG CCAACGGCAA GTCTCTCGAC AACGTCTTTT
CTGCTCTCGA CCAGATCTAC ACCGACGTTC AGAACGACTC TGAGCTCCGC GAATGGTTCA
CCACTTTCAA CGACTACATG CACCGAGTGC TTCTTGAGCC CGGTTACATC CTCGACGAGG
ATTCTGACCG TGAGGCCAAG CAGCTTCGAG AGTCTGGCAG AAGATTCTTC CAGGAGAAGT
ACAAGGCCCA CCAGGAACTT CTTTTCGACG AGCTCCAAGT CTGGCTCACA GCCTTTGGCG
AAGACCCGCT CAACGTCCGA CTTGGTGACG ACATCAAGCG ATTCTTCAAA GACTTGCTCT
TCAACCACGA AGGTAACCTT ACTTTCAAGC CCAAGCTTTG GAACGATGTT CGACAGGTCT
TGATCCCTAT GCTTCTCAAG CAAGTCAGCT ACGTCCCCAT TCCCCGCGCT GAGTACTCCG
ACAACAGCAT CGACTTGGTT ATTGAGGACT TGATCCTTTC TGGTCCTAAC CTTTTCCCCA
ACATTGTCCA CATCGAGTCT TTCAACTCCT TCTCTTTCAG CCCTTACCCC AAGCTGAACA
AGACGATGGA CAACCAGCAC CACAAGTTCA GGTTGAGTCT CAGTCAGATC CAGGCCGACA
TCCGAGATGT TGCCTTTGCC TTTAGGCGTA AGAGCGGATG GCCCAAGCTT TCTGACCACG
GTCTTGCCGA TGTTATTCTT GCTGGAAAGG GTATCTCCGT TGACGTCGAG CTTGAGTCTA
TCGAGAACCG TCGGGACACC GTCTTCAAGA CCAACTTCAT TCACGTCAAC ATTGACACTC
TCAAGTTCGC CATCAGGAAC TCCAAGCACG ACTTACTCTA CAAGGTGAGC ACTAGTCAAT
AATAACAATG GGGTATACTG ACTTGATTAC ATTACAGTTT ATCAAATCTA CAGCCACTGG
TCTTATTAAG AGGGCTATTA CTGCTGCTGT GCAGAATGCC ATGCACACCG CTCTTGGTCA
CCTTGATGAG CAACTCGTTG AGATCCGAAA CAGGGTCAGT CTATCACACT ACATAAGATT
TTGAAGTATT GCTGACACAA TTTTTTTTTC AGGTCGATGA GGCCAAACAG TCTGATGAGA
CAACCCGAAC TGAAGCTCTC AAAGACCTCT ACTCCCGCAA GAAGGAGAGT GCCCAGGAGA
AGAAGGCTGC CGCCGACGAG AAGACTGGTA CCTTCAAGAT TGTAACCGAC CGTGACAGTC
AGCTTAACCC CGAGCTCACC CATGACGGTG GCAAGTCTTG GGCCAAACGA GCGTTTAAGG
TTGAGGACGC TGCTCGTACG GGTAAGGAAT GGAGGTCACC GGCCTTCAAC CTTATTGACC
CTGCCCACCC TGCGGTGACT GGCCAGCATC ACCCTGCTGT CCAAAATGCG GATGTCGATG
CGGAAAGGTT GAAGCAGAAG GCTGAGGCTG CGGCTCCTGG TGTGGCTGCT GGAACTAAGA
GGCTGTAAGA TTTACCGGGT TTAAGTAATG TAGAATATGA TCAGAAGTTT GTTTGTATAT
TACCCGTATA AAATAATTTA ATCGATGAAT ACGCTTGAAT CAA
 
Protein sequence
MSVAATRSEI SHNPGKMSVT SAVSGEKVEG DVQSRMKLFG AIQAFRDGRM PDNEQIGSVL 
EYAIGHSPVD LQKLSPEGRV LIDDFRDTLE TLRMIVHEKN TDELFQNAVW SSYHSDVSKA
KQDGVIPVSS EQAKQDGKTA ASHIRVLITL FLTNSEARKL LKDFGIVGRD IFATAATKAA
DKSRPSQEKL DSVDQEAPSH EWIGADGKRL GPNETPDIQL KGPKGTQARY HPRDDPRDAQ
LIDDKGNSRS AGEAYNQAQE AKADAQSKAQ DLKSSARDYK ETGKQQARSH AQDVAGNRDP
NASLSEQKEQ VKGAAYDKKD AASAQAGQNL PDPNDEGNQQ KARGKVAELK DRIPDEHRQK
AADYIQKSKN FVNDELPEER RDQFIYRLKK VVVECQGHKD YQEAMTWLLD TLENYRGHAK
HVTNKGTESA QTVSNDPAVG DSTIQFRTLL ERFANGKSLD NVFSALDQIY TDVQNDSELR
EWFTTFNDYM HRVLLEPGYI LDEDSDREAK QLRESGRRFF QEKYKAHQEL LFDELQVWLT
AFGEDPLNVR LGDDIKRFFK DLLFNHEGNL TFKPKLWNDV RQVLIPMLLK QVSYVPIPRA
EYSDNSIDLV IEDLILSGPN LFPNIVHIES FNSFSFSPYP KLNKTMDNQH HKFRLSLSQI
QADIRDVAFA FRRKSGWPKL SDHGLADVIL AGKGISVDVE LESIENRRDT VFKTNFIHVN
IDTLKFAIRN SKHDLLYKFI KSTATGLIKR AITAAVQNAM HTALGHLDEQ LVEIRNRVDE
AKQSDETTRT EALKDLYSRK KESAQEKKAA ADEKTGTFKI VTDRDSQLNP ELTHDGGKSW
AKRAFKVEDA ARTGKEWRSP AFNLIDPAHP AVTGQHHPAV QNADVDAERL KQKAEAAAPG
VAAGTKRL