Gene CNL03970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL03970 
Symbol 
ID3254818 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp87464 
End bp91176 
Gene Length3713 bp 
Protein Length1013 aa 
Translation table 
GC content48% 
IMG OID638253869 
Productnucleus protein, putative 
Protein accessionXP_567953 
Protein GI58261086 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.730253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCAGGGATC GTAGTATGGC TATGACAGCC CTACATCCCA CAGTTTCATC TTCAGATATA 
CACCTCGACT CCCCACACAG CTATACGCAC TCCGGATCCA CCCTCCTGCC CGACCCATCC
GCTTCCCCAT CCCCTAGTAC CTCTCACGCT CCCCTACCCG GCGCATCAGC AGGGGACATG
CAAGCAGAAG CACCTGAATT GGTAGAAGAG GTTCCAGAAG GAGAATCGGA AGGAATGGGA
GACCAAATCA TTCTTGATTC ACAGCCAGAG GTTGACGTTG AAGACCTTAC GGGGGAGATG
GAATACGAGA ACTTGAGGTA TCGGACGGAG AGAGAGCGGA AACGCGTTAA GGTATGTATT
CTTCTGTCTC TACTTACTCG TATAACAACG CTGGCCATTC TGTGTCTTGC ATAATAACTG
ATGTTCGTGG CGTTCCAATT CAGGTGTACG AGCTTCGTGA CGAGTCATGG TTCGACCGAG
GGACAGGCAT TTGTCGTGGG ATGATCAATG CGGATGGCCA TGCAGTCATT TTAGTGGAGG
CTGAATCACC TCAAGTTCAG GAGAATGAAG ATGAGCCTGG AGGATTCTTG ACCAAAGATA
TCTTGTTGAA TTCCAATGTT GAAAGGGATG ATATCTACGG CAAACAACAA GGTTTGTTTC
CTGCAAGTAC GCGTTCCGAG TCTGACAATC TATCTGGTAG ATACTCTGAT AGTTTGGACG
GATCCAGAAT CGAAACTTGA TATTGCTCTT TCCTTTCAGG ACGCCGATGG ATGCGAGGAT
ACTTGGCAAT TTATATGTGA AGTTCAAAAG CACCTCATCA GCGTCGGTAC GTTTTTTTCA
CTGAGGAAAT ATGTTACCGT TGCTAATGCT TATCTTTTGG CGGTTTGAAA GAGGATGAAA
CGCAGGTGCC ATCATCTTCA TCGCCCATTG GTGGTTCTCC AATGATGGTT GCCAACGCGC
ACATGGTCAA TGCCGAGCAC AAGCTACCGT GGCAGCCTCC TACGTTGGCA AACATTAGGT
AAATCTCGTT TCTGACAGGG AAAATAAAGT TGTATTAACG GATGTACAGG GAACAAGAGT
TTTGCATAAG AGCACAGGCC AAGTCGGCAA TGGGTCGGGA GAGAGCAATG GAGCATATCC
TGAACGAGGC AAGTTGGATG TAATCCGCAG CAAGAATGCT CTGACGTCTT GTTGTAGGAT
TATATCAAGC AATTGATCAA TGTTCTCGAG CAAGCGGAAG ACTTGGAAAG CTTGGATGAT
TTGCACGCTT TATGTTCCTT GATGCAGACG ATCTGTGAGT GCCGTCGTTC TCCTGTCAAG
GCAAAGCTGA CGTGAATCAG TATTGTTTAA CGACAATGGT ATATTCGAAT ATATTCTTCA
GGATGATGTA TTCCTCGGTG TTATTGGAAT GCTTGAATGT GAGTTCTAAT AGCATAATTA
TGACTAACGC GGCAGACGAT CCCGAATTCC CCGAACTCAA AGCCACCTAT CGTCAATATT
TTCAAGAAAA CGCTCGTTTC CGAGAAGTCG TTCCCATCCC CGACCCTATT ATCTGCAACA
AGGTCCATCA GACATACCGC CTCTTATTTC TCAAAGACGT CGTGCTTGCC CGTGTACTTG
ATGATTCAGC ATTCAATATT CTTAATGGTT TCATCTTTTT CAACCAAGTG GATATCATCA
ACTACATCCA ACAGAGCGAT GGCTTCCTCA CGCAATTATT CGAAGCCTTC CGTGATCCCT
TACCTCCCCC TCCTCCTAAA GATACACCTC CAGAACCGCT CGATGATAAG AAACGTGACA
CAGTCATGTT CCTCCATCAA CTTGTCATGA TGGGCAAGTC AATTCAACTT CCTCCACGTT
TACAACTCTA TCGGACCCTG GTCGACCGCG GACTTTTACG CGTTATCGAA TGGTCTTTCC
GCCGTCCTGA AGCAAAGATT TTACATGCGG GTGCGGAGAT GTTGACCCTT GTGGTGGAGC
ATGATGCGTC GTCGGTGAGA AGTTATGTTT TCAAGGAGCA GGAGCAGAAG GAGCGGACGT
TGGTGAAGGA GATTATTGAG TTATTGCACA AAACGACGAA TGCGGGGTTA ATGGGACAGA
TGGCGGATAC GCTGAAGACG ATGTTGGAGG TTCCCCCGGA TAATGAGGTT TGTCGATCAT
CTTTCCGTAA GGTTGATTAG GAAATGGCAT TGACACAGCT ATAGTCATTC ATGGCGAAGA
AGGAAGGGCC TCTGGCGGAA CAGTTCATGA CTCATTTCTA TGAGACTTGG GCTACGTATC
TCTTCAAACC GTTATTGGAT ATCCCAGATT ACAAAACTGA ACAGCCCACG AGTATATCAC
TGTTATTTGG AAATCAAAGT CTCAGCTGAC GGGACGTATA GCAAAATTAA CTAGAGAGTA
TACTTCGCTT CTTCAAAATC TCGTTGAACT CTTATCGTAC TGTCTTCTGA ATCACCCTCA
TAAAGGCTCG TACTTTATAT TGTCAAATCC GATATCGAAA AAGGTCGTCG CATTATTGTA
CATTCGGGAT AAGCCTTTGA GACATGGTTC GTTTTTCTTC TTCTGAAGTG TTCAGAGTGG
GCATTGACTT GTTATTGAAA ATTTAGCCGC TCTGCGTTTC CTCAAGGCTT GTCTGAGAAC
GCCCAATCAC TTTATTCATC GACATTTTGT CAAGAACGAT CTGTTAGGAC CTCTTTTGAT
GTTACTGGAG GAGGAGAGTT TAAGGGATAA TATGATGAGT TCTGCCTGTA TGGAAGTCGT
CGAGCAGATC CGCAAGGTAA ATAAATCGTC CGTTGTATAG CCCTTGTCAA GCTTATACGT
GATACAGGAC AATTTGAAGA CTGTCATCAA CTATCTCTTT GAAAACTATA CACCCCGTCT
CGAAGCACTT TCGCGTCGGC CGCTCATGCG GGGTATCATG ATGGGAATTC GATCACGCTG
GGAGATGAAC AACGAGCCTA CACCAAGCAT GCCCCTCGTC GCTGCTACAT CAGCTACATC
GATCGGCGGA GAAGACGGCT GGGTGAACGA GGAGAAGAAG GAGGATGATT ACTTTAACGG
ATCGGACGAT GAAACGGATA GGACGGTGGT CGACGATACG GAGAATGTGA TAGGGGAAGA
AGAAGGAGAG GAAGGAGCCG TGCCTGCAAA GAGGAAGAGG CTGCAAAGTG GCGGTGGACC
GAAAAAACGG GCCCAGCGAA CGGGCAGTGC GCTCGGATTG GATTATGACG ATAATTCAGA
CCCAGAATCA CCAGTCTCCA CACCACAACA TACCGAACAC TCTTCTTCTT CCACCCCTGT
GTTAACCACC ACGACATCTC TCCTTGAACG AACAGTCTCT AGAGCGCAGG CAGTCGCCGA
AAAGGAGAAG AACACATCAG AGCTGGAAGA AGATCTGGGA GACGTGCAGG CTAAAATGCG
GGAAAAGCGA CGTCGGGAGG AAGAGGAAGA AGAGGAAGGC GGGTTTGCGG GGTTGTTGGT
TGGTGCCAAG CCGCAGCCGG TAGCGACTGT CGCCGCAAGT GCAGCGAGTG GAGGCGGGGC
GATAGGAGCA GGAAAGGGCG AAGAGGGAAA GGACGATGAG CGAGATACGG CTATGCAGAG
TGAGGTGTCT ACGACGGGGG AAGGAAAGAA GGGGTTGAAG GATATGGGCA AAAAGATACG
GCTGAATTTT GGGCTGGGTA AGAAGTTTAG CAAGTAGGTA ATTGTGCATG TAA
 
Protein sequence
MAMTALHPTV SSSDIHLDSP HSYTHSGSTL LPDPSASPSP STSHAPLPGA SAGDMQAEAP 
ELVEEVPEGE SEGMGDQIIL DSQPEVDVED LTGEMEYENL RYRTERERKR VKVYELRDES
WFDRGTGICR GMINADGHAV ILVEAESPQV QENEDEPGGF LTKDILLNSN VERDDIYGKQ
QDTLIVWTDP ESKLDIALSF QDADGCEDTW QFICEVQKHL ISVEDETQVP SSSSPIGGSP
MMVANAHMVN AEHKLPWQPP TLANIREQEF CIRAQAKSAM GRERAMEHIL NEATEDLESL
DDLHALCSLM QTILLFNDNG IFEYILQDDV FLGVIGMLEY DPEFPELKAT YRQYFQENAR
FREVVPIPDP IICNKVHQTY RLLFLKDVVL ARVLDDSAFN ILNGFIFFNQ VDIINYIQQS
DGFLTQLFEA FRDPLPPPPP KDTPPEPLDD KKRDTVMFLH QLVMMGKSIQ LPPRLQLYRT
LVDRGLLRVI EWSFRRPEAK ILHAGAEMLT LVVEHDASSV RSYVFKEQEQ KERTLVKEII
ELLHKTTNAG LMGQMADTLK TMLEVPPDNE SFMAKKEGPL AEQFMTHFYE TWATYLFKPL
LDIPDYKTEQ PTTKLTREYT SLLQNLVELL SYCLLNHPHK GSYFILSNPI SKKVVALLYI
RDKPLRHAAL RFLKACLRTP NHFIHRHFVK NDLLGPLLML LEEESLRDNM MSSACMEVVE
QIRKDNLKTV INYLFENYTP RLEALSRRPL MRGIMMGIRS RWEMNNEPTP SMPLVAATSA
TSIGGEDGWV NEEKKEDDYF NGSDDETDRT VVDDTENVIG EEEGEEGAVP AKRKRLQSGG
GPKKRAQRTG SALGLDYDDN SDPESPVSTP QHTEHSSSST PVLTTTTSLL ERTVSRAQAV
AEKEKNTSEL EEDLGDVQAK MREKRRREEE EEEEGGFAGL LVGAKPQPVA TVAASAASGG
GAIGAGKGEE GKDDERDTAM QSEVSTTGEG KKGLKDMGKK IRLNFGLGKK FSK