Gene CNG04120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG04120 
Symbol 
ID3258970 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1159382 
End bp1162765 
Gene Length3384 bp 
Protein Length885 aa 
Translation table 
GC content50% 
IMG OID638258035 
Producthypothetical protein 
Protein accessionXP_572163 
Protein GI58270014 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGGTCTGC AACGTCTCAT CCTCATCTCC GTCTTCCTCA CAAACCTCCA CGGCTGCTCG 
GCGACCATGA CAGCCCAATC CTTCGACAAG ATCGTAAAGC TTGCGACAAA GCCGAAGAAC
GCCCCTCCTA AAGCAAAGTA CATTGATTCA CTAATCGCTG CTACCTATGC GGACGACAGC
TCGATCAACG AGATTGCGAT CGTACTCGCA CAAAGGTTGA GAGATACCAA CGGTGTGGTG
AGCATTCAAT GGTGCTCTAG AGCAAAGAGT CACCGAGTCC TGACGGCATA CTAGGTCGTG
TTCAAGGGCC TCCTTACGTT GCACCAGATG ATACGCACTG GACAGACTGA AGCTCTGCTC
GATGTTCTCG CCAGAAACGA TGTCTTGAGA CTCCGGAATA TCTATAGCCA GCGGTTCCAA
GGTACGCTTT ATCATCAGGT CTTCCCTATC GGGGCGATAT TGCCAGATGC TGATTGATTT
CACTACAGGA TACGTCCCCC CTGCTAGCAT GGGTGCTTAT GCCGACTACC TCGACAACAG
AATTAGGGTG TATAGGGATT TAAAGCGAGA CCTTATAAGA GTGCAGACAG AGTCTAACAG
GAGGAGTGAT GGGCTGGGTG CTGCTTGTAA GTCGATTTAG GTCGATCTGG ATCTTGTCCA
TATGGAGGTA GCCGGGGCGT CAATTGTCGT TTAGCTAACG GGCGTATTAG CAAAAGCAAG
AAGATTAAGA CATCTTCCTG TAGAGAAGGG TTTGCTGAGA GAAGTCAAGG TGGTGCAAAG
GTTATTGGAC AGTCTCATCA AGTGCAAGGT ACGCTCCATG AATTGTAATG AGATTGGGCT
GATGGAAATC CAGTTTTATG ACGATGATTT GAGAGACGAA AACACAGTTT TGGCATTGAG
ACTTCTTGTA AAAGACTTGC TTGTCTTGTT CCAGGCCGGT AATGAAGGTG TCTGCAACAT
CCTAGGTGTG TTTTGTACTC ACACGCACGT CCCTGGCTGA CATACCGTTG CCCATCTAAC
AGAGCACTAT TTCGAAATGT CCAAGGTCGA CGCTACCGAC TCGTTTGAAA TCTATAAATC
TTTCATCAAG CAAACCGACA AAGTCGTTGA CTACCTCTCC TTTGCCCGAA AACTTCACCA
TGTCTTAAAT GTCCCTGTGC CGAATCTCAA GCATGCCCCA ACAGGGCTCG TCAAGGCGTT
GGAAGAGTAC CTCAACGATC CCAACTTTGA GCAAAATAGA ATGGATTACA AGAGGAGTCT
GGGAGTCGTC GAGGGAGGTA GTAGGCGTCC GAGTGATACA GAGCCCACAA GGAAAGCTTC
GCCCGATAAG AGCACATCGA CATCGACCAA GGCTGCATCT CCTGCGCCGG AAGTTAAGCC
TCAAGCTCCC GCGGGAGCTT CGAAAAAGAT CCAAGATTTC TTCGAATCTA TTCAAGCGGA
CCAGCAACCT ACCATGTTTG GCGGTGCTCC TCAACAGTAT GTTTCCGAGC CCGTGTAATC
ATTTTGATTT TAGACTAAGC ATTATATATA GGATCAACTA CGCTCAGATG ACTGTCAACC
AGCATCAGCA GTTCAATCCC TTCCGCCAAT CTATGATGAT GCCCCAGCAA ACTGGATTCA
TGCAGCCTCA GATGACCGGC TTTTCTCATC CACAACAACA AGGTTTCCTC CAGCCTCAAC
AAACAGGTGC CATGGCGTTT GGAAGACAGT CCATGATGCC TATGTCTACA GGACAACCAG
GTGCAGGAGG AGAATTTGGT TTCATTCAGC CACCCCATGC GCAAGCTCAA CAGCCGCAAA
TGCAGATGCA GATGCAGCCT CAGCAAACTG GATTCCTACA GCCTCAGGCT ACTGGATTTA
ACCCTTTCAG GCAGAGTATG ATGCTTACTG GCAATGGTAT GGGTATGGGC GGTTTGAGCG
GGCCCATGTC CCAACCTTCT TCTCCTTCGC CTTTTGCCCA ACCATCTCAT CAGACACAAG
GACAAGGCCA AATTCAACGT CCAGGGTCGA CGCCTGCATT CTCTACTCCT CCTTCCAACG
GCACAGCTGC CAGTTCCAGC TCCGAGGCTA AACCTTTGAC GGCCCAGACA ACAGGCTCGA
AGAACCCCTT CGCCCCTGCA GGCGGCGCTG TTCCTCCTGT ACCTACTCTT CGATCCCAGC
ATCAGCCGCC GCAGAAGAAG CCGACAATGA ATGAGATGAT GATGGGTCTT CATACCGGTA
ATAGTGACGG AGCATGGGGT CAGCCTCAAG CGCAGCAACA GCAGACGCAA CCAGCGGATC
AAGCGGGCCA GCAAGGGAGT GCTCAGGGTA CAGGGATGTC GAGCATTGCG AGCGAGTTTG
CATCAAACAA GAACCAGACG AATGGTTCTG CCAATGCGAA CACTGGCGGT GGTGGGACGG
ACTTCTTGTC TCAATTCGGA TCTTTGTCTG TTAATCGTCC TGGTGCTTCC TCTCCATCTA
CACAAACAGC TTCCTCTTCG AACCCGCTGT CGTTCTTGTC TACAAACCCT ACAGGTAGTA
CCAGCGCGAC TTCAGGGCTC ACTTCACAGA CCACAGGCGC AAACACCAAC AGCAGTGCGA
ATGGTTTCAT TCAACCCCAA CCTACCGGGT ATGGCGGCTC TAACATTAAG CCATTCAAAC
CTTCGAGCAG CTTTGGTAAT CAGTTGATGG AGAACTTACC ACCCCTACCT GAATCTGGTG
CTGGATCAAA CCCCGGTTCT GCTGTAGCCT CACCGAGTGG CGCTCATGGA GTTGGAGCCG
TTCAGCCGCA GAGTACAGGT TTCCCCGGCT TGGGATCCTT GTCATTCCAA AACACTGGTA
ACCCTGCGGG ATCTTCAGCG GGAACTGGCA GCGGTCTGGT GCCCCAAATG ACAGGAGCTC
CCAACCCGTT TAGGCAATCT ACCATGCTTG GAGGATCGTC ATCTAATGCT GGAAGGTTGA
ACCCGCAGAT GACAGGGATG GGCGCCTTTA GCGGGTTATC GGCGTTTGGT GGACAAAACC
ATGGACAGGG ACAAGGAATA TTTGGACAAC AGCAGCAGCA ACAGCAGCAG CCGTTCCAAC
AGCAAGCCCA GCAGGGATCG TTGATCTGAT CTGATGTGAT GTCTGGATAA CTGGCGAAAG
CGTCAGACGT CAATGAGAAT GGTGTGGGAG GGAGACGTTT GTAAAAGCTT GGTCTCTTCA
AAAGTGTTGA AATATGATAC ATAGCATTGA GCGGTTTTTG TTTTGTAAGG GATCTTTTTT
CTATATACCC CCAATTTTAC CCCTTAGTAT TTCGTCGTCC GCGATACATC AGATCTACTG
TACGCCATAC GATTCTCGGT AATAGGTTCA TAGGGACTCG ATTTCAAAGG CTTTCCAGTT
CTAATTTGAT TAGAATATGA TATG
 
Protein sequence
MTAQSFDKIV KLATKPKNAP PKAKYIDSLI AATYADDSSI NEIAIVLAQR LRDTNGVVVF 
KGLLTLHQMI RTGQTEALLD VLARNDVLRL RNIYSQRFQG YVPPASMGAY ADYLDNRIRV
YRDLKRDLIR VQTESNRRSD GLGAASKARR LRHLPVEKGL LREVKVVQRL LDSLIKCKFY
DDDLRDENTV LALRLLVKDL LVLFQAGNEG VCNILEHYFE MSKVDATDSF EIYKSFIKQT
DKVVDYLSFA RKLHHVLNVP VPNLKHAPTG LVKALEEYLN DPNFEQNRMD YKRSLGVVEG
GSRRPSDTEP TRKASPDKST STSTKAASPA PEVKPQAPAG ASKKIQDFFE SIQADQQPTM
FGGAPQQINY AQMTVNQHQQ FNPFRQSMMM PQQTGFMQPQ MTGFSHPQQQ GFLQPQQTGA
MAFGRQSMMP MSTGQPGAGG EFGFIQPPHA QAQQPQMQMQ MQPQQTGFLQ PQATGFNPFR
QSMMLTGNGM GMGGLSGPMS QPSSPSPFAQ PSHQTQGQGQ IQRPGSTPAF STPPSNGTAA
SSSSEAKPLT AQTTGSKNPF APAGGAVPPV PTLRSQHQPP QKKPTMNEMM MGLHTGNSDG
AWGQPQAQQQ QTQPADQAGQ QGSAQGTGMS SIASEFASNK NQTNGSANAN TGGGGTDFLS
QFGSLSVNRP GASSPSTQTA SSSNPLSFLS TNPTGSTSAT SGLTSQTTGA NTNSSANGFI
QPQPTGYGGS NIKPFKPSSS FGNQLMENLP PLPESGAGSN PGSAVASPSG AHGVGAVQPQ
STGFPGLGSL SFQNTGNPAG SSAGTGSGLV PQMTGAPNPF RQSTMLGGSS SNAGRLNPQM
TGMGAFSGLS AFGGQNHGQG QGIFGQQQQQ QQQPFQQQAQ QGSLI