Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04160 |
Symbol | |
ID | 3254809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 137655 |
End bp | 141899 |
Gene Length | 4245 bp |
Protein Length | 1414 aa |
Translation table | |
GC content | 56% |
IMG OID | 638253889 |
Product | retrotransposon nucleocapsid protein, putative |
Protein accession | XP_567971 |
Protein GI | 58261122 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0468988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGACG TCATCGATGA CGCCATGCTT GAGGCACGTT TCCTTCAGGT GCCGACACGC CCCCGCCACC CACCAACCGT CGCCAACGTC ATCGCTCCCG TCCCCGCGCC TCCACTCATT GCCAAACTCG CCCCCGCCCC CTCCACCACC ACGAAGGGCC ACACTACGCA ACAGCTCGAC TGGCTCGACC CCGCCAAACG GCTCCCCCTC GGTGACGCCG GCCGATCCGC CCGCGCCTAT CTTCAAAGCA TCAACGCGTG TTTCTCATGC TGCGTTGTCG GCCATCACCG CCTTATTTGC CCAACCCGCC CCCCCTCCAC CCCGCCCAAC GCCTCCGCGT CCGTGCCGGT CGCTAACCTC GTCTCCCTTG CCGACGACGA TGAGTCCGAC CACCACGGCG TTTTCGCTGT CGACCCTGTC ACAGACACTC TAGTACAGGA TGCCTCGTCT GCGCTCGCTG GGTCAGTACC TCTCATCATG GTCAATTGCC GTTTCAAGGC TGACGGAGAC ACTGTCCCAG CACTCGTTGA TTGCGGCGCT GGCATCAACG TCATTGACCG GGCGTACGCA GAGCAACAGG GATGGCAAGG ACGGCCGATT ATACCGGTGG GGACCAAAAT GGCAGACAAT CGGGCGGGTC CAGTCGTAGA CCAGGAGTAT GTAGTGGATG TAATCATTGG TGACACTACC TACAACGCTA CCCCATTCTA CGCCATGGCC CTTGGTCCAC GATACCGCCT TATCCTCGGT TTACAGTTCT GTCGTCAACA CCGCCTATTT GATGGGGCGG AGCGTTTAAA TCACCTCCTC AATGCAGGGG GGTCATCCTA TACATCGCTT GTGCAACTAC AACTCAACTC CATCACACCA GTCGAATCCC CGACCGTAAG CACTGAACGC CACTCCCACT CTGACGCCAT CCTCCGTGAA TTTGCCGACA TCCTTCCAGC CAATATCTCT GACGTCTCCC ACTACCCGCC CATTTGTTCG TCCACCTCCC AAGTCCGCCA CCGAATAAAC ATTCTTCCTG ATGCGATGCC TGTCGCTCGA GCTGGATTTC GAGTACCGTT AGCGTGGCGC GACACCCTTC GACAAGAAAT CGAGAAACAC TGTACCGCAG GCCGCCTCCG TCCATCCAGT TCCCCTTGGG CCGCCCCTGC TTTCCTCATT AAGAAAGAAA ATGGCAAATT CCGGTTCCTC TGCGATTTTC GCGGCCTCAA TAGTGTCACG GTTAAAGATC GCACCCCGGT TCCCAACATT GACGACATTC TCCAACGCGC CGCCCGTGGC AAGGTTTTCG CCAAACTCGA CCTTACCGAT GCATTTTTTC AGACGCTCAT GCACGAGCCC GATATCGAGA AAACGGCAAT TAGCACTCCC TGGGGTTTAT ACGAATGGGT TGTGATGCCG CAAGGCGCGT GCAACTGGCC GGCAACACAA CAACGCCGCC TCAACGAGGC TTTACGTAAC CTCATCAGTG TTTGTTGTGA AGCTTATGTC GATGATATCA TCATTTGGGG TGCGACCGAC TCTGACTTAG CGAAAAATAT TCGCGCGGTT CTCACGGCTT TACGTAACAG CGGGTTTGTT TGCTCGCCTA GCAAGTCGAA ATTTTTCGTC GACTCAGTCT CCTTCCTGGG CCACGTAATC TCCCCCAATC ACATTGGGCC AGATCCGAAG AAAGTCGAAG CACTACGCGC ATGGCCATCT CCTGGTTGTG TGAAAGACCT CCGATCCTTT CTTGGCCTTC TCCAATATTT ACGCAAATTC ATCCCACACA TCGCCACCAA GACGTCCGTT CTCACGGCTC TTCTCCCTCC GAACAAGACA GCAGAGAAAG CGTATGAATC CCGTAAACGT CAACTGGCTA AGGGCCTCCC AGCTGAGCGA TTAGAATCAC TGAGTTGGGT ATGGAAGTGG ACAACGTCGG CGCAAGACGC GTTTGAGGCG CTGAAGGAAA TGGTGGCACG TATCACAGGT CTGTCCCCCC TTTCCCATGA AGCTATCCTC GCAGGTCAAA CCAATCTCTA CCTTTTCACC GACGCAAGCA ACACCGGCCT CGGCGCCTGG TTGGGCACGG GTCTATCCCC CGACAACGCT CAACCTATCG CCTACGATTC CCGCTCTCTC ACCGCCGCCG AACGAAATTA TCCGGTACAC GAAAAAGAGT TATGCGCCAT CATCCACGCC CTCAAAGAGT GGCGGCCTCT ACTTCTCGGC GTCCCGGTGC ACGTCATGAC GGACCATGCG ACTCTCAAGT GGTTCTTTCA ACAACCAAAT CTGTCCGAAC GTCAGAAGCG ATGGCTACTA GTACTCGCCG ATTACGACCT CCAGATTTCC CATATTCCAG GGGCCACTAA TGTCATCGCC GACGCTTTCT CCTGGCTCCG AAACTCCGAT GCCCACGTCA ACGCCCTCAC CATGATGGTT CTCTCACCAA ACACAGCTTT CCTGGATGCA GTGGCTGAAG GGTATGGGCA GGACCCAGTA ATGAGCATTT GGAGGGAAGT AGACCGCTGC CATCCGGGTG TCCGCACAGC CGAAGTCAAC GGAGCACGGG GGGTCAGGAC GGTGCTGACA TACGAGGACC GGCTCTGCAT TCCCGACGTA CTCACCTTGC GAGAACAGTG CCTACAGGAA TGTCACGATG CGATGGGCCA TTTCGGGGTG GAGAAAACAC TTGAACTATT GCGTTGTAAG TACTTCTGGG ATGGTATGGC TAGTGACGTA AAGGACTTTG TCAGCACTTG CCCAGCCTGT CAGACATCCA AAGCTACCAC CACTAAGCCT CCCGGACTAC TACACTCATT ACCAGTTCCT CCCGCCAAAT TCTCCGACAT AGGCATAGAT TTCGTGGGGC CACTACCGCA ATCACACAGC TTCGACTATC TCATCGTCAT TACCGATCGC CTCACCGGCT GGGTCGCTCT CATACCAACA GTCATGACGC TCACGTCCTC CGCTTTTGCT CAACTCTACT ACGACCACTG GGTTTCTAAA TATGGGGTAC CACAATCCAT CGTCTCAGAC CGCGATAAGT TATTCACTGC TGCGTCATGG CGTCGGTTGA ATTCACTCCT GGGCACTAAG CTAAAGATGT CCACAGCATA CCACCCCCAG ACCGATGGTA TATCAGAACG ATCAAACAAG ACAGTCATCC AGATCCTGCG AACCTGGACT GACGACCAAG GCCGAAATTG GGCAGCCAAC CTACAGCGGG TCGCCTTCGC AATGAACAAC ACCATCCGAC GCTCAACCCA CCACACCCCC GCCGAGCTCG TTTTCGGGAA ACGCCTGTCA CTCACTCCGC CGCTTCTCCC CTCAACATCA GCTACGGACC AGTCCCTCGC CCAACCTACA GCCTCCGAAT GGGATCTCGC TGCCCAACGC ATGGCCCTCG AAGAGGGCAT CGCTCGTGAC GAACTGCTTC TCGCTAAGCA TCGGCAAAGT GTTCAAGCCA ACAAGCATCG TCGGCCGGAC CCGGTCTACC GCCCGGGAGA CAAAGTCTAC TTGAACACAG CTGAGTTCCG TCACGAATAT AAGACAGCCA CTAACCGTTC TGCGAAGTTC ATGCCCCGTT GGGAAGGCCC CTTCACCATC CTCAAGGCCT TTCCCGAGCA ATCACTCTAT GAATTGGATG TTCCCGTCAC CTCAACACAG TCGACGCCTC GCCGCCATGT TTCGCGCCTC AAGCCATACC GGGAGTCCGA ACAGTATCAC CAGCACGCGG TTCCTCGCCT ACTTGACCAC CCGGCTGTTT CCTCGCCACG CATCCTCCAA ATCCTTGAAG ACCGCACCCT CACCCCCAAG GGAAATCATC CAAAGGTCTA TTGTAAGAGG CTAGCCTCCG ACCCAGTGAC CCACGGTCAC CACTTTCCCC AGCTTCTAAT TTCGAGCCAC GTTTCAATTT GCTTCCGCTT CTTCCTCCAA GTTGCGCGCG TCAAATTGCA TCCGCGCTCC CTTTCCCAGG TCACGTGCTT TATTTTTCCC CTGTTTCTCC GAGCCGCGTC TCAATTTACT CCCGCTCCTT TCCTCCAAGT CGCGCGCGTC AAATTGCATC CGCACTCTCC CTCCCAGGTC ACGTGCATCA TTTTACTTCC GCTTTTTCTC TTCTTGCTTT CTCTTCCATT TCTCGTCCTA AGTCCGAGTT TCTTGTCCGA TTTTTCATGC CACAAGCCCC GGTGTTTACC GGGCTTTGGT CGTGACTGGT TTCGAGAGCG AAGGCAAAGT GCATGGAAAG GTTGA
|
Protein sequence | MMDVIDDAML EARFLQVPTR PRHPPTVANV IAPVPAPPLI AKLAPAPSTT TKGHTTQQLD WLDPAKRLPL GDAGRSARAY LQSINACFSC CVVGHHRLIC PTRPPSTPPN ASASVPVANL VSLADDDESD HHGVFAVDPV TDTLVQDASS ALAGSVPLIM VNCRFKADGD TVPALVDCGA GINVIDRAYA EQQGWQGRPI IPVGTKMADN RAGPVVDQEY VVDVIIGDTT YNATPFYAMA LGPRYRLILG LQFCRQHRLF DGAERLNHLL NAGGSSYTSL VQLQLNSITP VESPTVSTER HSHSDAILRE FADILPANIS DVSHYPPICS STSQVRHRIN ILPDAMPVAR AGFRVPLAWR DTLRQEIEKH CTAGRLRPSS SPWAAPAFLI KKENGKFRFL CDFRGLNSVT VKDRTPVPNI DDILQRAARG KVFAKLDLTD AFFQTLMHEP DIEKTAISTP WGLYEWVVMP QGACNWPATQ QRRLNEALRN LISVCCEAYV DDIIIWGATD SDLAKNIRAV LTALRNSGFV CSPSKSKFFV DSVSFLGHVI SPNHIGPDPK KVEALRAWPS PGCVKDLRSF LGLLQYLRKF IPHIATKTSV LTALLPPNKT AEKAYESRKR QLAKGLPAER LESLSWVWKW TTSAQDAFEA LKEMVARITG LSPLSHEAIL AGQTNLYLFT DASNTGLGAW LGTGLSPDNA QPIAYDSRSL TAAERNYPVH EKELCAIIHA LKEWRPLLLG VPVHVMTDHA TLKWFFQQPN LSERQKRWLL VLADYDLQIS HIPGATNVIA DAFSWLRNSD AHVNALTMMV LSPNTAFLDA VAEGYGQDPV MSIWREVDRC HPGVRTAEVN GARGVRTVLT YEDRLCIPDV LTLREQCLQE CHDAMGHFGV EKTLELLRCK YFWDGMASDV KDFVSTCPAC QTSKATTTKP PGLLHSLPVP PAKFSDIGID FVGPLPQSHS FDYLIVITDR LTGWVALIPT VMTLTSSAFA QLYYDHWVSK YGVPQSIVSD RDKLFTAASW RRLNSLLGTK LKMSTAYHPQ TDGISERSNK TVIQILRTWT DDQGRNWAAN LQRVAFAMNN TIRRSTHHTP AELVFGKRLS LTPPLLPSTS ATDQSLAQPT ASEWDLAAQR MALEEGIARD ELLLAKHRQS VQANKHRRPD PVYRPGDKVY LNTAEFRHEY KTATNRSAKF MPRWEGPFTI LKAFPEQSLY ELDVPVTSTQ STPRRHVSRL KPYRESEQYH QHAVPRLLDH PAVSSPRILQ ILEDRTLTPK GNHPKVYCKR LASDPVTHGH HFPQLLISSH VSICFRFFLQ VARVKLHPRS LSQVTCFIFP LFLRAASQFT PAPFLQVARV KLHPHSPSQV TCIILLPLFL FLLSLPFLVL SPSFLSDFSC HKPRCLPGFG RDWFRERRQS AWKG
|
| |