Gene CNH01750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01750 
Symbol 
ID3259249 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp635043 
End bp638295 
Gene Length3253 bp 
Protein Length912 aa 
Translation table 
GC content49% 
IMG OID638258313 
Productexpressed protein 
Protein accessionXP_572349 
Protein GI58270386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGTAGACCGG CTACCACTAA TCCTGATGTT CATGTCTCGT CTCCTACCCG TCCGTCTTTC 
ATGTCTCACA GGCGCGGTCT CCAATCTTCG GATTCAGCCC GTGATAAAGA TCCCATGGAG
AATCTTGAGG GTGTCCATGT CCGTGGTCAT CATCAGAAGA GGGTGAGTAT TTCGTCCCTT
GCGGACCCCG TTGTCAATCA TTCCAGGAGG GTGAGCTTCT CCAGTGAGGC TATCGGTGAT
GGCCAAAAGT CACCAATCTA TCAGATTCCC CAGCGTTTGG GCCGTGGCTC CCTTTCCATC
TCGCCCGAAA CTGAGGTGTC AGACAAGAAT TCCATTGCCA AACCCAGGGC TAAACCCCGG
CCCTTGTCTT TCCAGGGCTC TCCTGCCGCT AGCCTCTTGA GCCCCTCCTC ACAGTCCAAG
TTTTCCATTA CAGCTGCAGG TAAGGGTTTG ACAATTTCCC CTTCTCTTGC CCGAAATCCC
TCCTCTGGTT TTAAGTCCAT TTGGTCTGCG AGTGCCACAT TGCCAGATAA CACATATTCT
GGTTGGAACT CTCCAGTTAA CAGTCCGAAA ACTGTTGGTT CTGACTCGGG CACAATTGGT
AGTGCGGCGC TCGTCGCAAA GAAAAGGGGC TTGTCCATCG CAATTGTTAA GGATAATGGT
GGAGTTATAC CAGTCACTCC CAGCTTGGGA AGTGCTGGGT TCAAATCACA AGGGACTGCA
TTGGAGGGTG CGGAAATGAA GATGATCTGT GCCTCGAAGA ATGATAGGGA AATTGTTCAT
GCTACGGCAG CTTCTGGACT TATCACCGGC AAGCTGGATA GTGCAAAGAG TTCAGGCTCA
AAGAAGGAGA GTAAGTGGCT TTAGTCTTAT CATCAAAAAT GTTAAGTAAG AGCATGAACA
CTAACAGATG ATGTAGTCAT ACTCTGCAAG TTTTACCACA CCACAGGTCT TACGTGTACC
TCTCGCCCTT GCCGCTTTGT TCACGCTCTG TCATCAGCCA AGTCCCCATT GCCCACTGAT
GACATCTCTC AATATGCTAT GCTTTCCCCC ACTCGCCCCA TTGATCCGAC TAGTGGTACT
TTTGCTTTTG CCCAGCAGCA GATCAATCAG GTACCCAAAC AGCTGAAAGT GGATAGCGGT
GGAATTGATC AGGACGACGT TGAGATGGGC GAAAGAGTGG TATTAAAGGA TCAAAACGGT
CAGGAGGTGA CTGGTCAGGT GTTCTTGATG TCTGGAGGAG GCAAGGGGGC TGGCGGTAAA
GGCAGGAACA AGTACAAGAG TGAGTAATAC TGCTGAACAT TAGCAATATA CCTGACGTCG
CCGGAAAGCG GTTCCTTGTA AGGACTTTGC CGAAGGCCAT TGCCCATATG GCGATTATTG
CTCTTTCCTT CAGTGAGTGT TTCATCCGTG CATGTACTGT ATTTTATCTT ACGGGGGGCT
AGCGACGAAA AGACGCGGAA CGTACTGCCT ACCGACGATA AATCCTCCCG TACTAAGGAT
CATAAGCCCA GCAAGGCTGG TCACAAAGAT TGTCTTGTCC CTTCTATCAG AGTTGGCCCT
GATGTCATTC ACAAATCTCG ATCGAGTGGA CTTTCAGCTT TTGGAACCCG TTTTTCCCAG
ATCGCAGTAC AGGTTGACCG CTCCAGGGAG ACCGTTGCGG CTCCCACCCC AAGGCGCGCA
GTGTTTCCTG CTAGTTCCGC CACCCTCAAA GCGCATACGC CTCCTGTGCC TACTAGTGAG
CCAGCGTCTG TTCCCATGGC TGTCGTTACT TCAGCGCCCC CCAAGTCAAA CGCTTGGTCT
AAAGGTCCTC CTGCTCCCAT CAAGAAGATT AGAAGCCTCA AGAAAGTCAC CATCGCTGAT
ATGGGCCATA AGCGTGATCT CAATGTCAGT TTAGATGCGC CTCTCAAAAC CCCTACCTCT
GCTGGGCCTC TTGCCGTACC CCTGTCAGCC ATGTCTCTTT GGACTGAATC GGATCCTGCT
ACGCCTTTCG ATCCCATGAC ACACAGAAAG AGGATGTTGG AGATTGAAGA GCAGGCCAAG
AAGAACGGGC ATAGTAAGGT CATGCCCAAG TCAAGCCTTG TAAACCTTGC GTTTGACTCT
TCCCCTGCTC CGAGTCAAGT CTTTACCCCT TCCGAGTCAT TCCTCCCCTT CATGGCTCCC
ACTTACCCTT GGGGGATGCC CATGTCTCCC GTTTCAGTCG GAGCTTACCA GGATCCTTCG
ATTCCTGACA TTGAAGGAGG TTTGGGCGTT ATTTGGACAC CGACCGGCTG GGCTGTTCAG
GACGTTGCTA TGAAGCACGC ATTGAGGAGT ACCGAGATGA AGCAGAAGTT TGAAAATGCG
AAGGGGAGAA AACCTATGAG TTATTACAGA AGTATGTCGC ATACCCCATA TCTTTCGGCC
TTACTGACCG ATGTATTGCA TAGCTCGTCC TTGCAAGTTC TTTGCTGAAG GACACTGTCC
TCACGGTGAG GAATGCACCT TTCTTCATAT TATTCCAGCT TCGTCTCCCG AGCCGCTTTC
GTCATCCGAC AGCGATTCAG CCGACTACAA GCCCAAAGGG CAAGGCAACA GGCGGCTCAA
GACCTTGCCA TGCAAATTCT TCAACTCGGC GGCAGGGTGC ATCAATGGCG ATGACTGTGC
ATTCCTCCAC ACTCGAATTG TCCCAGAGTC GGCTCCACTT GTAGCGAGAC CTAGACCTTG
GAGAACGAAA CCTTGCAGGC ATTATCAGCT TGGAAGGTGT ATGCTGGGTG ATGCCTGCCA
CTTCGCCCAT GTGGACGATC CTACTTGGAT AGCGTCGGGC CGGAAAACTG GAATTATGAC
ACCTGTCAAG GTTGAAAATG CACTGGAACA ATTAACAGCT GAGAAAGTTG AGATGACTAT
CAAGCGGATT AGGGAAATGA GTAAAGAAAG GAAGGAGGAT GACGAAGAGG ATGATGAGGA
AGACGATATA CAAATTGTGA GTTAATCTAC TTGATTCTTG TTTAAGATTG TGATGTTGAA
AGCGCAACAG GTAACATACA GCACATTGAG CCCCTCGACC TCCAGTTATG GTTCCTCGTA
CGCTGTGTGA ATGACAGGAG CTCCGTATCC TTGCTCTGAA TATAGTTTAA ATGAGTACAC
TATCCTTCAA TTACAAAGAT TACAAGTTTG CAGTGGCTTT TTTTTCTCTT TTTTTTTCAG
AACTGAACAG AGCGATAAGG TGACGGAAAT TATGATTTGT AAAAGTACAC TAGATGATGT
GAAATGGACG TCT
 
Protein sequence
MSHRRGLQSS DSARDKDPME NLEGVHVRGH HQKRVSISSL ADPVVNHSRR VSFSSEAIGD 
GQKSPIYQIP QRLGRGSLSI SPETEVSDKN SIAKPRAKPR PLSFQGSPAA SLLSPSSQSK
FSITAAGKGL TISPSLARNP SSGFKSIWSA SATLPDNTYS GWNSPVNSPK TVGSDSGTIG
SAALVAKKRG LSIAIVKDNG GVIPVTPSLG SAGFKSQGTA LEGAEMKMIC ASKNDREIVH
ATAASGLITG KLDSAKSSGS KKEIILCKFY HTTGLTCTSR PCRFVHALSS AKSPLPTDDI
SQYAMLSPTR PIDPTSGTFA FAQQQINQVP KQLKVDSGGI DQDDVEMGER VVLKDQNGQE
VTGQVFLMSG GGKGAGGKGR NKYKTVPCKD FAEGHCPYGD YCSFLHDEKT RNVLPTDDKS
SRTKDHKPSK AGHKDCLVPS IRVGPDVIHK SRSSGLSAFG TRFSQIAVQV DRSRETVAAP
TPRRAVFPAS SATLKAHTPP VPTSEPASVP MAVVTSAPPK SNAWSKGPPA PIKKIRSLKK
VTIADMGHKR DLNVSLDAPL KTPTSAGPLA VPLSAMSLWT ESDPATPFDP MTHRKRMLEI
EEQAKKNGHS KVMPKSSLVN LAFDSSPAPS QVFTPSESFL PFMAPTYPWG MPMSPVSVGA
YQDPSIPDIE GGLGVIWTPT GWAVQDVAMK HALRSTEMKQ KFENAKGRKP MSYYRTRPCK
FFAEGHCPHG EECTFLHIIP ASSPEPLSSS DSDSADYKPK GQGNRRLKTL PCKFFNSAAG
CINGDDCAFL HTRIVPESAP LVARPRPWRT KPCRHYQLGR CMLGDACHFA HVDDPTWIAS
GRKTGIMTPV KVENALEQLT AEKVEMTIKR IREMSKERKE DDEEDDEEDD IQIVTYSTLS
PSTSSYGSSY AV