Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03140 |
Symbol | |
ID | 3258200 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 928874 |
End bp | 933660 |
Gene Length | 4787 bp |
Protein Length | 1484 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257431 |
Product | retrotransposon nucleocapsid protein, putative |
Protein accession | XP_571377 |
Protein GI | 58268442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGGTC CATCCACTCG TAGTGGTCGA GCTGTGGAAA AGGTCGAAAA AGGCAAGGAA GTAGAACATA CCCTCGATGA CGACCACAAT CCCGCGGGTT CATTCAACAC CAATCCTCAA TCCGATCCAT TTACCGCCGA CAACACATCA AACGACACCC AAACCCAACT CGAGATGATG CGCAACCATA TTGCAAGACT TGAGCAACAG AAGGAAGAGT TGACGCATAA GTTGGAAGAG AGCAAAGTCG AACGACAGAT GTTTGATAAT AGCGAAGATA ATGAGGGCAA AAACGAACAA GAATTGGATG AAGAAGACGA AGAACCTAGA TCAAGCCGCG ACCTGTCAGC GCAAACACCA TACCCAGAAG TTAAAAGGGA ATACAGCCGA CAACGCACCT CAGTACCCTC CAGTAGACCT CAAGAACCGA AGGTATCCCA ACCCGAGTAC TACCATGGGC AGTATACCAA GCTCTCAACC TTTATCACTC AAGTGACAAT GGTGATTACC CTCCAACCTT CTCGTTTCCC TACCGAGACC TCCAAAGTCC TATACGCCGG ATCTTTCCTC CGAGATACCC CATTCTTATG GTTCCAACCC TTCGTAACCA TCGATCCCCA GCCCAAGTTT ATGCTGGACT TCAAGAAATT TTGTGCCGAA TTAAGGAAGA ACTTCGGAGA TCCAGACGAA GAACAGACAG CAGAACGACA ACTAAACACT GTTCGTCAGC AAGGTTCTGT ATCCTCATAC CTCGCAACCT TTATGCGTTA TGCCACCTTG GTTCAGTGGA ACGACGAAGC GAAGAAGGCT TGCTTCTACA GGGGCTTGAA GGATGACATC AAAGACGAAC TTGCCAGACT ACCCAAAGCC AAGTCATTCA AGAACCTCCA AGACATGGCC ATCCGCATTG ATAGCCGTCG ATATGAACGA GTATTAGCAA AGCGAGACCA GCAACCAAAG GCGCCTTTCA ACGCCACCCG AAGCGACTAC ACCCGCACCT CTTACAATAA CAATCGCCCC AACAACTTCA GACGTTTCTC TGCGGCGAAT AGCATGCCGA TAAGGTCTAC CACTGCCAAC ACCACCTTCA ATAAAGAGGT GACGCCGGCA GTCAACCTGA GGGCTGCATT TGTCCCAAGT TCAGCTAGGA TAACCAGACG TGGACGTCTG ACTCCAGAAG AATATCAGAG GCGAAAGGAT CATAACCTCT GCCTCTATTG CGCCGACAAA AACCATCAAG TCGCCAAGTG CCCAGTGGTC CCCTCGCAAC AATCCAATAC TACTCTCCCT TCAAAAAACT AGATATGCTC TTGTCTGCCA AAGTCGAAGA AGGTAGCCGG GAACAAGAGC GTACTATCAA ACCGGCCAAT ACAGACTCCT GCGAATATCT CCAAACTCTC GAAGATAATA ACAAAAACAA CAAAAACCAA CTCACAATCG ACTTTCTCTT TCACAACAAT GTTTATCAAG CTTTAATCGA TTCCGGTGCC TCTACAAACT TCATCGACAA AAGATTCGTC CAGACCTTTA ACCTCAAAAC CACGAAAATA GAAGATTCGA TCCCATTATA CCTATTCAAC GCTGCGGGTC AGCGAACTAT AATTGAAGAA GAAGCCAACA TCCTGGTCAA CTTCCAGAAA CCATTCGGAC ACACCTTACT CCGACTCCTC ATAACCGACA TCGGCTCCTA TCCCATCGTC TTAGGTATCA CCTGGTTACA AGAGCACAAT CCGTCCATCA GCTGGGAAAC ACTTTCCATA CACCCACCTG TATCACAGAC GACGAGTGCC AACTTAGCCA TGGTCATCAC CAATGACAAA CCTCCAAAAG AAAACACCGA TGCCGAAATA GTACCTAAAG AATACCATCA ATATCTAGAT GTATTCGACA AGAAAAGCGC CGATACACTC CCAGAACATA GGTCTTTCGA CCACCATATC CCTCTCGAAG AAGGAAAGAA CCCACCTTTT GGTCCCATAT ACAATCTCTC CGAAACAGAA CTTGAAGCTC TCCGCGAATA CCTTGATGAG AATCTTAAGA AAGGTTTTAT CCGACCGTCC GAATCACCAG CCGGAGCACC CATACTCTTT GTCAAAAAGA AAGACGGATC GCTTAGGATG TGTGTCGATT ACCGGGGAAT CAACAAGATC ACCATCAAGA ATCGCTATCC TCTACCATTG ATCGCCGAAC TCCTAGATCG ACTCAAATCA GCCAAAGTAT TCACCAAGAT CGACCTGCGA GGAGCCTACA ATTTACTTCG CATTAAGGCA GGCGAAGAAT GGAAAACAGC TTTCCGTACT CGCTATGGGC ATTTCGAATA TTTGGTAATG CCGTTTGGCC TCACCAATGC CCCTGCATCC TTCCAACATC TCATGAACCA CAATTTCCGC GACTTGCTAG ACATATTTGT TATCATCTAC CTCGACGACA TCCTCATCTA CAGCCCAGAC TTGGAGACTC ACCAGTCACA CGTCATACAA GTCCTAGATC GCCTCCGCCA AACCCAATTA TATGTCAAAG CTTCAAAGTG CGAGTTCCAT CAAACCTCAG TAGAGTTCCT AGGTTTCGTT GTCAGCGACC AAGGTCTATC AATGGACACC AAGAAAGTAA AGTCTATCAC GGAATGGCCG ACACCTCGCA ATCTCCGTGA TACCCAATCC TTCCTTGGGT TCTGTAACTT CTACCGAAGG TTCATCAAGG ACTACTCTAG TATCGCCAAA CCTCTTATCG ACTTGACAAA GAAGGACTTA CCCTTTGTAT GGGAAGAACC TCAACGAACA TCTTTCGAAG CACTCAAAAA GAGTTTCACC TCTGTTGATC TCCTACGTCA TTACGATCCG ACCAAGCAAC TCATCCTTGA AACCGACGCC TCCGACTATG CCATCGCAGG TATCTTATCA CATGAAATCG ACAAGAAACT CGAACCAGTT GCTTTCTTCT CTCACAAAAT GTTGCCTGCC GAGTTAAACT ATCCTATTCA CGACAAAGAA ATGTTAGCAA TTGTTTCAGC ATTCAAAGAA TGGCGACATT ACTTCGAAGG TGCTAGAGAA ACCATTCGTG TCTACACCGA CCACAGAAGC CTGGAGTACT TTATGACTAC CAAGCAACTC AATCGACGAC AGGCGCGATG GTCTGAATTC CTAGCCGACT TTGACTTCAA TATCATCTAC CGACCAGGCG TACAAGGCAC AAAGCCTGAC GCACTCACCC GAAGACATGA TTATCATCCA CTCGAGAAAG GCTCCAGCCT TACTACTGCT GCCAATCCTC AGAATTTCCA GACTCTCCTT CGCCCTGGAC AGTACTTGGG TACTGCCACA ACCGGACTCG ATCGGTTGGA AATATCTTCG CCCATCAAGT CGTTGTTGAA AACCGGTCTA GAAACCGATG AATCAGCAAA ACCATTCTTG GACAAAGCCA ACCATCCCTC CGAAGCTCAC CCATATACTC GAGACGATGA AGGACTCCTC AGATATGGCG AATCATTCTA TGTCCCAGCC AATAACGAGC TACGCACCCT CGTCACGAAA GAATGCCATG ATGCACTCAC TAGTGGGCAT CCCGGACGAC GCAAGACTAT CCAACTCATC CGACGCCATT ACTGGTGGCC AGGCCTAAAA GGCTTCGTCA ATCACTACAT TGATTCCTGC GATCTTTGTT GCAGAACTAA GACAAGACGT CATCAGCCCT ATGGCGAACT CAAGTCTCTA CCCATTCCCC CATATCCCTG GTCATCTGTA TCGATGGACC TCATTGAACA ACTTCCCCCA TCACACGGCT ACAACACCAT CCTTGTGATC GTAGACCGAC TCACCAAGAT GGCTCTCTTT ATCCCCACAA CGACTAGCCT CAACGCCGAG GAACTCGCCC AATTATATGT CACCCACGTC TTCTCCAAGC ACGGGATTCC GACCAGTATT GTATCAGATC GTGGATCTGA ATTCACATCC CGCTTTTGGC GAGCATTCAC ACAACTCCTA CACATCGAGT TAGAACTCAG TACAGCTTTT CACCCAGAAA CAGATGGACA AACCGAACGA GTGAACCAGG TCTTAGAACA ATATCTGCGC CTTTATACCG ATTATAAGCA AAAGGAATGG GCACCGCTAC TCCCAGTTGC GGAATTCACT TACAACAATA CGCCCCATTC GTCCACTACC ATGTCCCCCT TCTTTGCCAA CAAAGGGTAC CATCCCAGGG CATCGTTTAC CCCCGATGAC AACGTTCCTA TTTTCAGCCC ACCTGCCAGA GCCTCCATCA CCGACTTGAG CAAGCTCCAC GAACACCTCA AGATAGAAAT GTCCAAAGCA CAAGAGAGTG CAGCACTACA GTTTGATAAG CACCGTGCCC CACTTCCCGA ATATACTATC GGCGACAAAG TCTGGCTATC TGCCCGTAAC ATCAAAACGA AACGACCCAC CAAGAAATTA GATCACCGTT ATCTCGGTCC CTACACCATT ATCGCGCGCG TTTCTTCCCA CGCGTATCGC CTTGAGTTGC CGAAATCAAT GCGTATCCAC GACGTCTTCC ACGTCCAATT GCTTGAGAAA TATATTGAGA ATGAGATCCC AGGGCGAACA CAAGTCGCAC CATCACCTAT CGAAGTCGAA GGTGACCTAG AATACGAAGT CGAGTGCATC CTCGATCATC GATTTTACCG AAAACGCCGC CAATTCCTTA TCAAGTGGCT CGGCTACAGT GCCGAACACA ACAGTTGGGA ACCCGAAACC GCTCTAGAAA ATGCTTCAGA GATTGTTGAT CAGTATAAGT CAACACACCG ATTATAG
|
Protein sequence | MSGPSTRSGR AVEKVEKGKE VEHTLDDDHN PAGSFNTNPQ SDPFTADNTS NDTQTQLEMM RNHIARLEQQ KEELTHKLEE SKVERQMFDN SEDNEGKNEQ ELDEEDEEPR SSRDLSAQTP YPEVKREYSR QRTSVPSSRP QEPKVSQPEY YHGQYTKLST FITQVTMVIT LQPSRFPTET SKVLYAGSFL RDTPFLWFQP FVTIDPQPKF MLDFKKFCAE LRKNFGDPDE EQTAERQLNT VRQQGSVSSY LATFMRYATL VQWNDEAKKA CFYRGLKDDI KDELARLPKA KSFKNLQDMA IRIDSRRYER VLAKRDQQPK APFNATRSDY TRTSYNNNRP NNFRRFSAAN SMPIRSTTAN TTFNKEVTPA VNLRAAFVPT LIDSGASTNF IDKRFVQTFN LKTTKIEDSI PLYLFNAAGQ RTIIEEEANI LVNFQKPFGH TLLRLLITDI GSYPIVLGIT WLQEHNPSIS WETLSIHPPV SQTTSANLAM VITNDKPPKE NTDAEIVPKE YHQYLDVFDK KSADTLPEHR SFDHHIPLEE GKNPPFGPIY NLSETELEAL REYLDENLKK GFIRPSESPA GAPILFVKKK DGSLRMCVDY RGINKITIKN RYPLPLIAEL LDRLKSAKVF TKIDLRGAYN LLRIKAGEEW KTAFRTRYGH FEYLVMPFGL TNAPASFQHL MNHNFRDLLD IFVIIYLDDI LIYSPDLETH QSHVIQVLDR LRQTQLYVKA SKCEFHQTSV EFLGFVVSDQ GLSMDTKKVK SITEWPTPRN LRDTQSFLGF CNFYRRFIKD YSSIAKPLID LTKKDLPFVW EEPQRTSFEA LKKSFTSVDL LRHYDPTKQL ILETDASDYA IAGILSHEID KKLEPVAFFS HKMLPAELNY PIHDKEMLAI VSAFKEWRHY FEGARETIRV YTDHRSLEYF MTTKQLNRRQ ARWSEFLADF DFNIIYRPGV QGTKPDALTR RHDYHPLEKG SSLTTAANPQ NFQTLLRPGQ YLGTATTGLD RLEISSPIKS LLKTGLETDE SAKPFLDKAN HPSEAHPYTR DDEGLLRYGE SFYVPANNEL RTLVTKECHD ALTSGHPGRR KTIQLIRRHY WWPGLKGFVN HYIDSCDLCC RTKTRRHQPY GELKSLPIPP YPWSSVSMDL IEQLPPSHGY NTILVIVDRL TKMALFIPTT TSLNAEELAQ LYVTHVFSKH GIPTSIVSDR GSEFTSRFWR AFTQLLHIEL ELSTAFHPET DGQTERVNQV LEQYLRLYTD YKQKEWAPLL PVAEFTYNNT PHSSTTMSPF FANKGYHPRA SFTPDDNVPI FSPPARASIT DLSKLHEHLK IEMSKAQESA ALQFDKHRAP LPEYTIGDKV WLSARNIKTK RPTKKLDHRY LGPYTIIARV SSHAYRLELP KSMRIHDVFH VQLLEKYIEN EIPGRTQVAP SPIEVEGDLE YEVECILDHR FYRKRRQFLI KWLGYSAEHN SWEPETALEN ASEIVDQYKS THRL
|
| |