Gene CNC04670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04670 
Symbol 
ID3256188 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1411245 
End bp1414467 
Gene Length3223 bp 
Protein Length881 aa 
Translation table 
GC content50% 
IMG OID638255686 
Productconserved hypothetical protein 
Protein accessionXP_569724 
Protein GI58265136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTCAATCCT CCATCCATCC TTTCTAGAAG CCCTAATCCC GACCTCCTGG CTCCGCTTCT 
CTTCTTCTTC ATTCTCCGTA CAACTGGAAC CTGGTGTTCT GTACAGCCTT GACGGCATCT
ACTCTCGTTT AATCTGTTTG CTGGTCAAAC TAATCTGGAG TTTCAAACCA GTAACCGATC
ATTCCCTAGT ACCTCACTCT TTACTCATTG ACACCCATAA TGGCCCCCAT CATTGAAGCT
GCCGTGGTTC TCCTAGCTCT TACGGGCAAC ATTGTGGAGG CTAAGCCCCT GCGCACTCCC
GGTCGACATG CGCCCCCAGG TGTGGCTAAT GTCCTCGCTT CGTCCAAGAG GTCTCTTCAT
AGTCTTCTTG CCAGGTACTA CGGAACTGCT CACGGGCTGG TGGGTCTTTG TCTACTCTTA
TATACATCGT TTCTGATCAC GTCCCTCCGT AGAGTAAACC GCCTTCTCTT CCTACCAAGC
GTGACACGTC CTTACCGAAC GGGTGGTCAA CCTTTGGTTG CGTCGCCGAG TCTTATGACG
AGCGTTTACT GCAAGGATTT GCATTTTCAT CTTCCAGCCT TACTCCTTTT CTTTGCGTCA
CCGAATGTAC AAAATTAGGG TATACTATGG CAGGCACCGA GTACGGCGAT GAGGTGAGTC
GTGAAAGTTT GTTCGCAACT TGTCGCTAAC TGTCCGTAGT GTTACTGTGG CGATAATTTC
GTCGGTAATG GCGGTGGCAT GGCTCCTTCT TCCTTCTGCA GTATGCCATG CGAGGGCGAC
ACAAGTGAGA TGTGCGGCAA CAGTTGGTAT CTTAATCTCT ACACGTACAA CTCTAGCGCT
CTTCCTCTTT GTAGCGGGCC TACCAGCACG GTTTCTGCTC CTGTGGAAGA AACTAGTAGT
CTTTTGTCTA CGTATTACTC GACCTTTGAA TCCTCTGTAA CCGCCACCTC ATCTTTGGCG
TCTGCTTCTT CAACAGATGC CACCATTAGT GCCAATTCAT CGAGCTCCGT GATGTCTGCT
TCCGCGACTG CGACGGTATC CTCAAGCGCT ATAGAGACAG CCGGGTCTGT GACCAATAGC
GTCTCTAGTG ATACCGCTCC TGCAGCGACA TCCATTTCGA CTTGTCCCAT ACATGAGGAT
TCCGACGACT CTTCCGAGTG GTATGCGCTT GGCTGTGGCT TGGACTCCGA GGATCGGATT
CTATCGTCAT ACTTTATAAG CCTTGACAAT ATGACTGTCG ACTCTTGTCT CACAATCTGC
GAAACCCGTG GATACGTGTA TGCGGGGTAA GCATGCTCGT TAAATGGTGT TCGATTAATT
AGCTGACATT GTGGATCAGC CTGCAATACT CCGATGAGTG TTACTGTGGA AACTCATTAT
CCTCGTCTAC AAGTTACGAC AGCACTCGGT GTGATATGGA CTGCGCTGGG GACTCTGAGG
ATACTTGTGG TGGTACTTGG GCTATTGAAC TCTTCAGTCT CATCTCGTCA TCTTCAAGTT
CCTGTACCGA TAGTCTGTCC ACCGAGAGTG CAACCACAAC TCTTGTTACC TCAACTACTA
GCGGGTTCAA CACTGCAAAT ACCACCGCGA TTGCGAGTAG CACAGACTCT GCTTCTTCAG
TCATTGTTAC CTCCTCAGAA CGCGCTACAG AAGCTACTTC TGTTACCGAG TCTACGGCGG
GATCTGAGAC AGTCAGTGTC CCCGTCACGT CGGCATCCGT CATTTCCCCA ACTAGTACGA
CTTCTACGGA GTCCACCGCT TCTGCAGCCT CAACTTCTGT CCCATCTTCT TCCAGCACTC
ATCAAGTCTG GGCTCACTAT ATGGTCGGTA ATACCTACCC TTATACTGTT TCAAATTGGG
CTAGCGACAT TTCTGCCGCT TTAGCGGCTG GTATTGATGG GTTCGCACTC AACATGGGTT
CCGACGACTG GCAACCCGCT CGTGTAGCAG ATGCGTACTC TGCCGCCGCT TCTACAGGCT
TTAAGTTGTT CCTGTCTCTT GACATGACCG TTCTCAGCTG TTCGTCATCT TCGGATGCCG
CAAAGCTCGT CTCTATCGTT GAAGGATACG CAACTGCGAC CGCTCAAGCC ACCTACGAGG
GCAAGGTACT CGTCTCCACC TTTGCTGGTT CGGATTGTGC TTTCAGCTGG CAGACAGACT
TTGTAGACGT TCTCTCGTCT GCTGGAATCA ACATCTTCTT TGTACCTAGT ATCTTCTCCG
ACGTCAGCAC GTTCTCTTCC AACACTTGGA TGGACGGTGA GCTCAATTGG AATTCTGGGT
GGCCGATGGG AGCCGAGGAC ATCACTACTA CGTCAGATGA CGCGTACATG GCCGCCCTTG
GCAGCAAAGA ATACATGCCT GCTGTGTCTC CGTTCTTCTA CACTCACTTC GGTCCCAATT
CCTGGAATAA GAACTGGCTT TACCGTTCCG ATGATTGGCT CTACTGCACT CGATGGGAAC
AGATTATAGC CATGCGTGAC AGTGTGCAGA TGACGGAAAT TCTTACTTGG AACGATTTTG
GAGAATCCTC GTACATTGGT CCTATTGAAG GTGCTCTTCC TGCAGGCTCT GAGGCCTTTG
TTGATGGTTT CACACACACT GGGCTCTACT CCCTCACCTA CTATTACGCA ACTGCATTTA
AGACCGGTGC CTACCCGACT ATCACAGAAG ACGAAATCAT CGTATGGGCC CGCCCTCACC
CGCACGATGC AACTGCCTCG TCCGACTCCA TTGGCAGACC TACAGGCTGG TCCTACACAG
AGGATTACCT GTACGCAGTA GCCTTGACGA CAGACGCTGC TACCGTGACT CTTACATCAG
GTTCCACCAC TGAAACGTTC ACTGTCTCAG CCGGTCTCAC TAAGCTCAGG GTCTCCTTGT
CCGAGGGTTC CATCTCCGGT TCAATCTCTC GTTCGGGCAG CACAGTGGCT TCTTATGATG
CAGGCTCCGC CTTCACGTAC ACTACCTCAC CCACCACTTA TAACTTCAAT TACTTTGTTG
GATCTAGCTC TTCATAGTAT TTTTTGTGGT TGACTGGGTT GTGGTATATC AGGTGTTAAT
AGTTGTTCTT TTTTCTTCTT GATCTACAGA ATTTTTTCTT ATCTATATTG ACCCTTTTTA
ATCTTCATAA TGTCTCGCTG CCTGTAATGC CGGCATCCTG ATCATATTAT TACTTATTTA
GGACACATTT ATCTAGCAGA TGATGTACTT AATTTCAGCT GTG
 
Protein sequence
MAPIIEAAVV LLALTGNIVE AKPLRTPGRH APPGVANVLA SSKRSLHSLL ARYYGTAHGL 
SKPPSLPTKR DTSLPNGWST FGCVAESYDE RLLQGFAFSS SSLTPFLCVT ECTKLGYTMA
GTEYGDECYC GDNFVGNGGG MAPSSFCSMP CEGDTSEMCG NSWYLNLYTY NSSALPLCSG
PTSTVSAPVE ETSSLLSTYY STFESSVTAT SSLASASSTD ATISANSSSS VMSASATATV
SSSAIETAGS VTNSVSSDTA PAATSISTCP IHEDSDDSSE WYALGCGLDS EDRILSSYFI
SLDNMTVDSC LTICETRGYV YAGLQYSDEC YCGNSLSSST SYDSTRCDMD CAGDSEDTCG
GTWAIELFSL ISSSSSSCTD SLSTESATTT LVTSTTSGFN TANTTAIASS TDSASSVIVT
SSERATEATS VTESTAGSET VSVPVTSASV ISPTSTTSTE STASAASTSV PSSSSTHQVW
AHYMVGNTYP YTVSNWASDI SAALAAGIDG FALNMGSDDW QPARVADAYS AAASTGFKLF
LSLDMTVLSC SSSSDAAKLV SIVEGYATAT AQATYEGKVL VSTFAGSDCA FSWQTDFVDV
LSSAGINIFF VPSIFSDVST FSSNTWMDGE LNWNSGWPMG AEDITTTSDD AYMAALGSKE
YMPAVSPFFY THFGPNSWNK NWLYRSDDWL YCTRWEQIIA MRDSVQMTEI LTWNDFGESS
YIGPIEGALP AGSEAFVDGF THTGLYSLTY YYATAFKTGA YPTITEDEII VWARPHPHDA
TASSDSIGRP TGWSYTEDYL YAVALTTDAA TVTLTSGSTT ETFTVSAGLT KLRVSLSEGS
ISGSISRSGS TVASYDAGSA FTYTTSPTTY NFNYFVGSSS S