Gene CNI04020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI04020 
Symbol 
ID3259414 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1072024 
End bp1074508 
Gene Length2485 bp 
Protein Length778 aa 
Translation table 
GC content51% 
IMG OID638258897 
Producthypothetical protein 
Protein accessionXP_572614 
Protein GI58270916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.189745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTCCGCCTA CAGAAACCAC AATGGACCAC CTAGTCCTCG ACCCCGTCAT AGATCCAGCA 
CTCGCAGACA ACTCACCGCA GCATACAGAC GACCACCACC AGCCTCTGCA GAACTCAGCA
TCCACCTCAC CAGTTGCCCA GCCCTCCCAA CGCACAACAA AAACAGGTCG TCCATCCAAG
GCCCGCGGCA CTGTTCCCGG ATCCGCTGCC GACTTTAGAC GACGGGAAGC GAACCGTCTA
GCAGCAGACA GGTCGAGAAG TAGACAGGCT GAAAAGGGTG CGGCGCTTGA GAATGCTTCG
CGTATATTGT CAGAGGAGAA TGCGAGACTG AAGGAGCAGA TCGCTGCGTT AGAAGGTGGA
ATCGAAGGAA GCGATTTCCA ACATGGAGTT CAGGATGCTG AACATCAGCC TACTCCTGGC
CCCTCAAGAA CGGAAGAACC ACATGACGAT GATCAGCAGG TCTCACAAGA AAGTGATACT
CAGGCTCAAG AGCAAGAAGC GCACTCTCAC ACTATTCTGG CCGCGCTCAC GGACATTACT
GGTGTCGACT TTTCAGAAGG CAACGAAGCA AACTGGATGC AAGGGATGGA GACCTTCCTG
AAAGATTCTG AAAGTGGAAG ATTAGGGGAG CTCGCGGCTG TCGCCATGGG ACATGATGAC
CCGCAGTTGC AACATCAGGA AAAGGCTGAC GCGCCCCATA CCTCTCCCTT CAAAATCGGC
AACATGTCTA TCCCATTTCC ACTTCCGGGG CAGATGTCGA TTCCTGGAGC CAGTGCAGCC
ATTGGCCTCG CTGCAGCTCT CAATACGGAG ATGGAGAGGA TTATCATGGA AGACCTTGCT
CTGACCAAAG CGGCCATCGT CAGAGTCGAG AATCAGATAT CCCATTTGCG AGCCCATCCC
GATGCCGACA TTGATGCAGC TAGCAACGAC TCGTCCATTG CGCCTCTTTT GCCCAAGGAC
ATCTTCTCCG AAGATTTGGA AGCGTTGCAA GTGGTTCAGA CAGATATACA AAACTCTATC
TCTTACCTTG AAAATACGTT GCCTCCCGTC AGGGATGAAT TTGTGAGGAC GCGCGATGAC
AAGACAGCGG AGGAAAAGAG GATCATAGAC CTTGTGAAGG AAGTTAAAGA ACTAGAGGTA
AGAGATGAGG ATCAAAAAGA AAAAGTCTTG TCGAACCTGA GATCTGTTGG CACGTTTGTG
GAGAATTTGC TGACAGAGGA GCATGTAAGT GAATCGGCCG TTTTGCCATC AAAGCTGACT
TTTTTGTAGC CCGATAATCA ATACCTGACC GGTGCATTCT CTTCTCCCGC GCTTGCTCGT
CGACGTAGAG GTAGACCGCC AAAGGGCGAA GTCTCTCGTA CATTCTACCA ATCTTACCTC
ATTGGACCCC TTTCTCAAGA AGTCGACCCC AAGGGCAAAG GTAAAGCTCC AACAGCCAAG
AATCCTCGTA GAAGCAGATT GGGAGAATCA CCTCTCACCG GGCACGCTTC TTCCCATGAA
CAAGATGACC ATCATGTTGC TGAATCTTCT CAATCCGCAC AAGCACACCT TCCCGACTCT
CACGAACCGG GGACCCTACA GGCAGCTCAA CATGACCAAG ATCATCCCGA TGAGCACGAT
ACGACCGAAG CTGTTAACCG TGCCGAGGCG TATATTCTCT CTCATCTCAA TGCATCCAAC
GCTACGGATG GCCAAGACGC AGATCCTCAT CCAGAACGCG AAGATGCTAT CGGACTCGAA
TCTACTTCCT TTGCTGATTT CTTGCCCGCC CAGGAAGCAT TAGAACGACA GGTCGAGGGA
CCCGCAAGTC AACATGGCAA TGCTGATGCT TTCGGTCAGT CTGCTTCCTC AAACGGCCAA
AATCAGGAGT TAGTCCATGT TAGTCAGGGT GACCACGATG TCCCTCTGTC TGTTCTTTCG
CGACTCAAGC AGGGTCCTCC TGGGAGTTGC GATATCTGCA TGAGAACGGA AACGACTGTG
TGGAGGAAAT TAGTGCTGGG AGGTATCGAC CACAAAGTGT GCAACGGTTA GTAGAATATG
CTTCTAAACT GGTCACCAAA GCTGATTATG GTATAAGCCT GTGGCCTGTA CCACTCGAAA
TTCGGTGTCA TCCGTCCCCC GGAACTCTGG GGAGATGGTA AGTCCCTGAA AAAGCGTCGT
TCCACGAGAC CCGCAACGGA TGAAGAGGAT GCAGATGCAC ACGCGAAAAA GAAGGTAAAA
AAGAGCATTG GAGAGGGTTC GGAGCAACTT GAGTCGCTTG ATGACGAGCA CCAAGAGCTT
GGGAATATGG TAGAAAACCA CTTGTCTGCT CAGCAAAGAG AGGAAGAAGA GGCGGCGACA
GTAACGGCAG GTGAAAATGT AGAAGGGACG ATAGTTGATA GGAGTGTAAT CCAAGGGGCT
GTTCAACCTG GGATAGGTAT GGATGAAGCA GAGGGAAATG TGTTTGAGGT GTGATGATTA
AGAAGTGACT TTGCGTTGTA CTTCT
 
Protein sequence
MDHLVLDPVI DPALADNSPQ HTDDHHQPLQ NSASTSPVAQ PSQRTTKTGR PSKARGTVPG 
SAADFRRREA NRLAADRSRS RQAEKGAALE NASRILSEEN ARLKEQIAAL EGGIEGSDFQ
HGVQDAEHQP TPGPSRTEEP HDDDQQVSQE SDTQAQEQEA HSHTILAALT DITGVDFSEG
NEANWMQGME TFLKDSESGR LGELAAVAMG HDDPQLQHQE KADAPHTSPF KIGNMSIPFP
LPGQMSIPGA SAAIGLAAAL NTEMERIIME DLALTKAAIV RVENQISHLR AHPDADIDAA
SNDSSIAPLL PKDIFSEDLE ALQVVQTDIQ NSISYLENTL PPVRDEFVRT RDDKTAEEKR
IIDLVKEVKE LEVRDEDQKE KVLSNLRSVG TFVENLLTEE HPDNQYLTGA FSSPALARRR
RGRPPKGEVS RTFYQSYLIG PLSQEVDPKG KGKAPTAKNP RRSRLGESPL TGHASSHEQD
DHHVAESSQS AQAHLPDSHE PGTLQAAQHD QDHPDEHDTT EAVNRAEAYI LSHLNASNAT
DGQDADPHPE REDAIGLEST SFADFLPAQE ALERQVEGPA SQHGNADAFG QSASSNGQNQ
ELVHVSQGDH DVPLSVLSRL KQGPPGSCDI CMRTETTVWR KLVLGGIDHK VCNACGLYHS
KFGVIRPPEL WGDGKSLKKR RSTRPATDEE DADAHAKKKV KKSIGEGSEQ LESLDDEHQE
LGNMVENHLS AQQREEEEAA TVTAGENVEG TIVDRSVIQG AVQPGIGMDE AEGNVFEV