Gene CNL04160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04160 
Symbol 
ID3254809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp137655 
End bp141899 
Gene Length4245 bp 
Protein Length1414 aa 
Translation table 
GC content56% 
IMG OID638253889 
Productretrotransposon nucleocapsid protein, putative 
Protein accessionXP_567971 
Protein GI58261122 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0468988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACG TCATCGATGA CGCCATGCTT GAGGCACGTT TCCTTCAGGT GCCGACACGC 
CCCCGCCACC CACCAACCGT CGCCAACGTC ATCGCTCCCG TCCCCGCGCC TCCACTCATT
GCCAAACTCG CCCCCGCCCC CTCCACCACC ACGAAGGGCC ACACTACGCA ACAGCTCGAC
TGGCTCGACC CCGCCAAACG GCTCCCCCTC GGTGACGCCG GCCGATCCGC CCGCGCCTAT
CTTCAAAGCA TCAACGCGTG TTTCTCATGC TGCGTTGTCG GCCATCACCG CCTTATTTGC
CCAACCCGCC CCCCCTCCAC CCCGCCCAAC GCCTCCGCGT CCGTGCCGGT CGCTAACCTC
GTCTCCCTTG CCGACGACGA TGAGTCCGAC CACCACGGCG TTTTCGCTGT CGACCCTGTC
ACAGACACTC TAGTACAGGA TGCCTCGTCT GCGCTCGCTG GGTCAGTACC TCTCATCATG
GTCAATTGCC GTTTCAAGGC TGACGGAGAC ACTGTCCCAG CACTCGTTGA TTGCGGCGCT
GGCATCAACG TCATTGACCG GGCGTACGCA GAGCAACAGG GATGGCAAGG ACGGCCGATT
ATACCGGTGG GGACCAAAAT GGCAGACAAT CGGGCGGGTC CAGTCGTAGA CCAGGAGTAT
GTAGTGGATG TAATCATTGG TGACACTACC TACAACGCTA CCCCATTCTA CGCCATGGCC
CTTGGTCCAC GATACCGCCT TATCCTCGGT TTACAGTTCT GTCGTCAACA CCGCCTATTT
GATGGGGCGG AGCGTTTAAA TCACCTCCTC AATGCAGGGG GGTCATCCTA TACATCGCTT
GTGCAACTAC AACTCAACTC CATCACACCA GTCGAATCCC CGACCGTAAG CACTGAACGC
CACTCCCACT CTGACGCCAT CCTCCGTGAA TTTGCCGACA TCCTTCCAGC CAATATCTCT
GACGTCTCCC ACTACCCGCC CATTTGTTCG TCCACCTCCC AAGTCCGCCA CCGAATAAAC
ATTCTTCCTG ATGCGATGCC TGTCGCTCGA GCTGGATTTC GAGTACCGTT AGCGTGGCGC
GACACCCTTC GACAAGAAAT CGAGAAACAC TGTACCGCAG GCCGCCTCCG TCCATCCAGT
TCCCCTTGGG CCGCCCCTGC TTTCCTCATT AAGAAAGAAA ATGGCAAATT CCGGTTCCTC
TGCGATTTTC GCGGCCTCAA TAGTGTCACG GTTAAAGATC GCACCCCGGT TCCCAACATT
GACGACATTC TCCAACGCGC CGCCCGTGGC AAGGTTTTCG CCAAACTCGA CCTTACCGAT
GCATTTTTTC AGACGCTCAT GCACGAGCCC GATATCGAGA AAACGGCAAT TAGCACTCCC
TGGGGTTTAT ACGAATGGGT TGTGATGCCG CAAGGCGCGT GCAACTGGCC GGCAACACAA
CAACGCCGCC TCAACGAGGC TTTACGTAAC CTCATCAGTG TTTGTTGTGA AGCTTATGTC
GATGATATCA TCATTTGGGG TGCGACCGAC TCTGACTTAG CGAAAAATAT TCGCGCGGTT
CTCACGGCTT TACGTAACAG CGGGTTTGTT TGCTCGCCTA GCAAGTCGAA ATTTTTCGTC
GACTCAGTCT CCTTCCTGGG CCACGTAATC TCCCCCAATC ACATTGGGCC AGATCCGAAG
AAAGTCGAAG CACTACGCGC ATGGCCATCT CCTGGTTGTG TGAAAGACCT CCGATCCTTT
CTTGGCCTTC TCCAATATTT ACGCAAATTC ATCCCACACA TCGCCACCAA GACGTCCGTT
CTCACGGCTC TTCTCCCTCC GAACAAGACA GCAGAGAAAG CGTATGAATC CCGTAAACGT
CAACTGGCTA AGGGCCTCCC AGCTGAGCGA TTAGAATCAC TGAGTTGGGT ATGGAAGTGG
ACAACGTCGG CGCAAGACGC GTTTGAGGCG CTGAAGGAAA TGGTGGCACG TATCACAGGT
CTGTCCCCCC TTTCCCATGA AGCTATCCTC GCAGGTCAAA CCAATCTCTA CCTTTTCACC
GACGCAAGCA ACACCGGCCT CGGCGCCTGG TTGGGCACGG GTCTATCCCC CGACAACGCT
CAACCTATCG CCTACGATTC CCGCTCTCTC ACCGCCGCCG AACGAAATTA TCCGGTACAC
GAAAAAGAGT TATGCGCCAT CATCCACGCC CTCAAAGAGT GGCGGCCTCT ACTTCTCGGC
GTCCCGGTGC ACGTCATGAC GGACCATGCG ACTCTCAAGT GGTTCTTTCA ACAACCAAAT
CTGTCCGAAC GTCAGAAGCG ATGGCTACTA GTACTCGCCG ATTACGACCT CCAGATTTCC
CATATTCCAG GGGCCACTAA TGTCATCGCC GACGCTTTCT CCTGGCTCCG AAACTCCGAT
GCCCACGTCA ACGCCCTCAC CATGATGGTT CTCTCACCAA ACACAGCTTT CCTGGATGCA
GTGGCTGAAG GGTATGGGCA GGACCCAGTA ATGAGCATTT GGAGGGAAGT AGACCGCTGC
CATCCGGGTG TCCGCACAGC CGAAGTCAAC GGAGCACGGG GGGTCAGGAC GGTGCTGACA
TACGAGGACC GGCTCTGCAT TCCCGACGTA CTCACCTTGC GAGAACAGTG CCTACAGGAA
TGTCACGATG CGATGGGCCA TTTCGGGGTG GAGAAAACAC TTGAACTATT GCGTTGTAAG
TACTTCTGGG ATGGTATGGC TAGTGACGTA AAGGACTTTG TCAGCACTTG CCCAGCCTGT
CAGACATCCA AAGCTACCAC CACTAAGCCT CCCGGACTAC TACACTCATT ACCAGTTCCT
CCCGCCAAAT TCTCCGACAT AGGCATAGAT TTCGTGGGGC CACTACCGCA ATCACACAGC
TTCGACTATC TCATCGTCAT TACCGATCGC CTCACCGGCT GGGTCGCTCT CATACCAACA
GTCATGACGC TCACGTCCTC CGCTTTTGCT CAACTCTACT ACGACCACTG GGTTTCTAAA
TATGGGGTAC CACAATCCAT CGTCTCAGAC CGCGATAAGT TATTCACTGC TGCGTCATGG
CGTCGGTTGA ATTCACTCCT GGGCACTAAG CTAAAGATGT CCACAGCATA CCACCCCCAG
ACCGATGGTA TATCAGAACG ATCAAACAAG ACAGTCATCC AGATCCTGCG AACCTGGACT
GACGACCAAG GCCGAAATTG GGCAGCCAAC CTACAGCGGG TCGCCTTCGC AATGAACAAC
ACCATCCGAC GCTCAACCCA CCACACCCCC GCCGAGCTCG TTTTCGGGAA ACGCCTGTCA
CTCACTCCGC CGCTTCTCCC CTCAACATCA GCTACGGACC AGTCCCTCGC CCAACCTACA
GCCTCCGAAT GGGATCTCGC TGCCCAACGC ATGGCCCTCG AAGAGGGCAT CGCTCGTGAC
GAACTGCTTC TCGCTAAGCA TCGGCAAAGT GTTCAAGCCA ACAAGCATCG TCGGCCGGAC
CCGGTCTACC GCCCGGGAGA CAAAGTCTAC TTGAACACAG CTGAGTTCCG TCACGAATAT
AAGACAGCCA CTAACCGTTC TGCGAAGTTC ATGCCCCGTT GGGAAGGCCC CTTCACCATC
CTCAAGGCCT TTCCCGAGCA ATCACTCTAT GAATTGGATG TTCCCGTCAC CTCAACACAG
TCGACGCCTC GCCGCCATGT TTCGCGCCTC AAGCCATACC GGGAGTCCGA ACAGTATCAC
CAGCACGCGG TTCCTCGCCT ACTTGACCAC CCGGCTGTTT CCTCGCCACG CATCCTCCAA
ATCCTTGAAG ACCGCACCCT CACCCCCAAG GGAAATCATC CAAAGGTCTA TTGTAAGAGG
CTAGCCTCCG ACCCAGTGAC CCACGGTCAC CACTTTCCCC AGCTTCTAAT TTCGAGCCAC
GTTTCAATTT GCTTCCGCTT CTTCCTCCAA GTTGCGCGCG TCAAATTGCA TCCGCGCTCC
CTTTCCCAGG TCACGTGCTT TATTTTTCCC CTGTTTCTCC GAGCCGCGTC TCAATTTACT
CCCGCTCCTT TCCTCCAAGT CGCGCGCGTC AAATTGCATC CGCACTCTCC CTCCCAGGTC
ACGTGCATCA TTTTACTTCC GCTTTTTCTC TTCTTGCTTT CTCTTCCATT TCTCGTCCTA
AGTCCGAGTT TCTTGTCCGA TTTTTCATGC CACAAGCCCC GGTGTTTACC GGGCTTTGGT
CGTGACTGGT TTCGAGAGCG AAGGCAAAGT GCATGGAAAG GTTGA
 
Protein sequence
MMDVIDDAML EARFLQVPTR PRHPPTVANV IAPVPAPPLI AKLAPAPSTT TKGHTTQQLD 
WLDPAKRLPL GDAGRSARAY LQSINACFSC CVVGHHRLIC PTRPPSTPPN ASASVPVANL
VSLADDDESD HHGVFAVDPV TDTLVQDASS ALAGSVPLIM VNCRFKADGD TVPALVDCGA
GINVIDRAYA EQQGWQGRPI IPVGTKMADN RAGPVVDQEY VVDVIIGDTT YNATPFYAMA
LGPRYRLILG LQFCRQHRLF DGAERLNHLL NAGGSSYTSL VQLQLNSITP VESPTVSTER
HSHSDAILRE FADILPANIS DVSHYPPICS STSQVRHRIN ILPDAMPVAR AGFRVPLAWR
DTLRQEIEKH CTAGRLRPSS SPWAAPAFLI KKENGKFRFL CDFRGLNSVT VKDRTPVPNI
DDILQRAARG KVFAKLDLTD AFFQTLMHEP DIEKTAISTP WGLYEWVVMP QGACNWPATQ
QRRLNEALRN LISVCCEAYV DDIIIWGATD SDLAKNIRAV LTALRNSGFV CSPSKSKFFV
DSVSFLGHVI SPNHIGPDPK KVEALRAWPS PGCVKDLRSF LGLLQYLRKF IPHIATKTSV
LTALLPPNKT AEKAYESRKR QLAKGLPAER LESLSWVWKW TTSAQDAFEA LKEMVARITG
LSPLSHEAIL AGQTNLYLFT DASNTGLGAW LGTGLSPDNA QPIAYDSRSL TAAERNYPVH
EKELCAIIHA LKEWRPLLLG VPVHVMTDHA TLKWFFQQPN LSERQKRWLL VLADYDLQIS
HIPGATNVIA DAFSWLRNSD AHVNALTMMV LSPNTAFLDA VAEGYGQDPV MSIWREVDRC
HPGVRTAEVN GARGVRTVLT YEDRLCIPDV LTLREQCLQE CHDAMGHFGV EKTLELLRCK
YFWDGMASDV KDFVSTCPAC QTSKATTTKP PGLLHSLPVP PAKFSDIGID FVGPLPQSHS
FDYLIVITDR LTGWVALIPT VMTLTSSAFA QLYYDHWVSK YGVPQSIVSD RDKLFTAASW
RRLNSLLGTK LKMSTAYHPQ TDGISERSNK TVIQILRTWT DDQGRNWAAN LQRVAFAMNN
TIRRSTHHTP AELVFGKRLS LTPPLLPSTS ATDQSLAQPT ASEWDLAAQR MALEEGIARD
ELLLAKHRQS VQANKHRRPD PVYRPGDKVY LNTAEFRHEY KTATNRSAKF MPRWEGPFTI
LKAFPEQSLY ELDVPVTSTQ STPRRHVSRL KPYRESEQYH QHAVPRLLDH PAVSSPRILQ
ILEDRTLTPK GNHPKVYCKR LASDPVTHGH HFPQLLISSH VSICFRFFLQ VARVKLHPRS
LSQVTCFIFP LFLRAASQFT PAPFLQVARV KLHPHSPSQV TCIILLPLFL FLLSLPFLVL
SPSFLSDFSC HKPRCLPGFG RDWFRERRQS AWKG