Gene CNL06730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL06730 
Symbol 
ID3254939 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp869685 
End bp873075 
Gene Length3391 bp 
Protein Length967 aa 
Translation table 
GC content49% 
IMG OID638254150 
Productconserved hypothetical protein 
Protein accessionXP_568192 
Protein GI58261564 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGCTTCTC TCGACTTTTA TCTTAGCATC TTCATTTCAA CTGCTTATAG GAACACCTTA 
AATGGCCGAA GATAATCATA ACGAACCTCC GCCGCCCAAT ATACCGACGC CTACTTTCCC
ACAATCAAAA ACAATAAACA CAGCTTTCGC ACGCTCACCC ACTACCGATA GTGAGATCGA
GCCTGCTGAC GAGGAAGAAG GAGATCATAG ACCCTTCAAC CCACGTACTT CCACTACTGA
CGAGCATCAA ACGCTCTTCT ATCAAGACGC AAGTCACTTT AATGTTGAGC ATCATGTCGG
TCGAGTTTGC CGCAGTTTTG CGCCACTCGC ACTCCTCGCC ATCATCGCGC TTGCATTCAT
TCCTGTTGCA TCTGCTACCG ACCTAGATCC CTACACTTCA TGTCCAGCAC TTCCTCAGAA
GAATACTCCT ATTCTCTCTA CCACCGGTTA CGTTTTCAAC CATGTTGTAA GGTTTATCGG
CCTTGACGAC AGTTTCGTCG GCCAGCACCT CGCAAATGTC AAGAGGAATC TCGACAACAC
GAGTATAATT GAAGCTTGTA TGGTACCTGT GCTCGTATTG TTGAGTGGCA TGTTTGCGGG
TTTGACTCTT GGGTGAGTTA AAAGGCTTCC TTTGCACAAA GGAGCATTGA TTGATCTCGC
GTAGTTACTT TTCTGTGGAT CAGACTCAGC TGCAAGTGCT TGCTATTTCA GGAACACCTA
AGCACCAAGA GTATGCTAGG CTTATCATGC CAGTCCGGTA CGTCAGCTTC TAATTGTATT
TTGGACGTTA TTGACGTATA AAAAAATGCA GAAAAAATTC ACACTTGCTT CTTACCACTC
TAATTCTCGG TAACATGATT GTTAACGAGG CACTGCCTGT TGTCATGGAC GGTCTTTTGA
GCGGTGTTGT ATCCGTCGTC GTCAGTACCG CAATGGTTGT CATGTAAGTC TCATTCATAT
CATGCCCTCT TTTCAACTGA CATCAGTGTA GTTTTGCCGA GATCATCCCT CAATCAATTT
GTTCCCGGTA TGGTCTCCTC ATTGGTGCTC GTATGGCCTG GCCAGTCCGT ATCATGATTT
GGATCGCTTA TCCCATTGCA TGGCCAATCG CCAAACTTCT GGAATGGGTT CTTGGTGCGC
ATCACGGAAT CATCTACCGC CGAGGCGAAC TTCGCGAACT CATCAAGATG CATGCCGCGG
GCGGTGAAGG TGGTGGCGAC TTGGATTTTG ATACTGTACA GATCACTCAA GGCGCTTTGG
ACCTTGCTCG AAAGACGGTT AAGGATTCCA TGACAGCCAT TGAGCAAGTG TTTATGCTTC
CTATCGAGGC AAAGCTCGAC TATGAGACAT TGGGGCATGT TGTGAGGTCT GGCCACTCGC
GTATTCCGGT CTACCAGATG GTCGAAGTTC CTGATATTGA TCTTTCGGCT CCTACTCTTG
GTCCCACTAA GACAAAGATG GTTAAGAAAG TTTTGGGCAG TTTGCTTGTC AAGAGCTGTG
TTCTGCTGGA TCCTGAAGGT GATTCTAATC AAGCTTGCAG CTAAACCTAC AACGAATGCT
ACTGACTGAG TCGCAGACGC TACTCCTCTT GCTTCTATCC CCATCAACGC CATCCCTTCT
ATCCCATTCG ATGAGCCCTT GACCAACATG CTCAACGTCT TCCAAGAAGG TCGCTCACAT
ATGGCCATTG TCTCTCGTCG TGTACGCCGT GTGGGGCCTG TCGATCCCGA AGATGCCCAG
TCTGCCATGA CCGCTGCTGC CGGTGGCCTT CGTCAACGAT TCTTCCGAAG GGTTGCTGGA
ATCTCTGGTA ATCGATCATT CTCTGATGGC GACTCTTCTG CTTCTGGAGA AGAAGACCTC
GAAAAGGGCG AAGGAGGGAA GAAAAAGAAG CGAAGGCGCA AGAACGCTGA GAAACATGAT
CGGGCTGGTA GTTGTAGTTC TGACAATCAC ACTATCGTTA GACCCATTTC TAGTGACGAT
GACGGGTTTA CGGAAGCCGA TGAAATTAAG CGTATGGCAA AGGAACAGCA AGAAGCAGAA
ATGAAGCAGC AGCGCGGGAA AGACAGCAAA AAGGGATCCC TCGTTCAGGC CGCGAAGTTG
ACTCAGCTCG AGCAAAGTGT CCCTGCCGAT GCTCAATTGT CTAACGACGC TGTGGAAAAC
TTCTTCGATG GTTTGGAAGG CGCACCATTG GGTATCATCA CGCTTGAGGA TGTCCTGGAA
GAGCTTATTG GAGAAGAAAT TTATGACGAG TAAGTCATCT TTGTAATGTA TAGAGAATTC
TTGACTGATA AGCGCGATAG GTACGACGAA CATGGAGTGC CTCGTTCAGC AGCATCCGCT
TTCGTCCCTC TTGAAGCTAT GCTTGCTGCT CGAAAAGCTG CTCTCGCCCG TCAAGAACTC
GCTATTGCCC AATCTACCCC AATTCCTCCA GTCAGCGATG CCGATGTTGA GCAAGTTGTT
GCCGCACCTA CTCCTGGTGC CGGTCGACGG GTCAAGGCGA AGATTCAGAT CCCCAAGTTT
TCGTTGAAGA AACCCGTTTC TCAACCCGGC AGGCCTAGAA CTGAGCGTTT GGCTGCTAAT
GAAACGCCTG CCGCTACTCC CCCAGCTGAT CCACCTTGTT ATACCAATAA ACCCGACGAG
AAGTCCACAT ACAGCGCTCA AATTATCACA ATCCCTGGTG AATTGGATGA ACCGCCTTTA
GCTTGTGCAC AGTCAGATAG CAGGCTTGTT AACCGCCGTC TCTCTACTCC ACTTGATTCT
GGTGCTACCG CCACTCCTTC ACCTCAGCCC CAGTATAGCT TATCAGTCCC AGGCCCTGCA
TCTGCTGTAC CCACTCGTCT CCTTCCTGCT GGAGGGACTG TTTCAACCCC AGCTGTCATA
CCTACATCAC ATCAACCGAG CTTGTTGAAC GAAGCAATCT TCATTGGGCG TGAGCGAAAG
CGTATGGCCG CTTCCCACCC TGGTCAGGTA CGAAGCCAAA GTTCTGGTCC TACCGTATCT
TCTAAGCAAC CTACCCCTCC GATTTCAGTT TATGATTTTG GTATTCAAGC GGCGGGAGTG
AATGGAGAGA GAGTTGAAGG ACAAGAGATT GTATCTCCCC GGCCGATAGC AGCGCAACAG
AAGAAGATCC CCAAGTTCAA GAGTGTACCC ACGCCTGTTG CTACTCCCTC TTTTGGCTTC
GATCCATCAG CCGAAAGCAA GAGTAGCAGT CGGGAGGAAA AGGAGTAAGA GTAGTGGTAA
GAGTTTGATG TAACGAGGTA ATGTGCGTTA TGTCTGAATG AACGAAGGCA GCGAAAGGCG
AAGGGGGAGT GAGTGTAAAA TGCAGGATGG GGATCTTGGA TTAGATATGT ATATGGGATA
TTTTGTCTAT AGTTATAACT CAAAACGATG C
 
Protein sequence
MAEDNHNEPP PPNIPTPTFP QSKTINTAFA RSPTTDSEIE PADEEEGDHR PFNPRTSTTD 
EHQTLFYQDA SHFNVEHHVG RVCRSFAPLA LLAIIALAFI PVASATDLDP YTSCPALPQK
NTPILSTTGY VFNHVVRFIG LDDSFVGQHL ANVKRNLDNT SIIEACMVPV LVLLSGMFAG
LTLGYFSVDQ TQLQVLAISG TPKHQEYARL IMPVRKNSHL LLTTLILGNM IVNEALPVVM
DGLLSGVVSV VVSTAMVVIF AEIIPQSICS RYGLLIGARM AWPVRIMIWI AYPIAWPIAK
LLEWVLGAHH GIIYRRGELR ELIKMHAAGG EGGGDLDFDT VQITQGALDL ARKTVKDSMT
AIEQVFMLPI EAKLDYETLG HVVRSGHSRI PVYQMVEVPD IDLSAPTLGP TKTKMVKKVL
GSLLVKSCVL LDPEDATPLA SIPINAIPSI PFDEPLTNML NVFQEGRSHM AIVSRRVRRV
GPVDPEDAQS AMTAAAGGLR QRFFRRVAGI SGNRSFSDGD SSASGEEDLE KGEGGKKKKR
RRKNAEKHDR AGSCSSDNHT IVRPISSDDD GFTEADEIKR MAKEQQEAEM KQQRGKDSKK
GSLVQAAKLT QLEQSVPADA QLSNDAVENF FDGLEGAPLG IITLEDVLEE LIGEEIYDEY
DEHGVPRSAA SAFVPLEAML AARKAALARQ ELAIAQSTPI PPVSDADVEQ VVAAPTPGAG
RRVKAKIQIP KFSLKKPVSQ PGRPRTERLA ANETPAATPP ADPPCYTNKP DEKSTYSAQI
ITIPGELDEP PLACAQSDSR LVNRRLSTPL DSGATATPSP QPQYSLSVPG PASAVPTRLL
PAGGTVSTPA VIPTSHQPSL LNEAIFIGRE RKRMAASHPG QVRSQSSGPT VSSKQPTPPI
SVYDFGIQAA GVNGERVEGQ EIVSPRPIAA QQKKIPKFKS VPTPVATPSF GFDPSAESKS
SSREEKE