Gene CNF03140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03140 
Symbol 
ID3258200 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp928874 
End bp933660 
Gene Length4787 bp 
Protein Length1484 aa 
Translation table 
GC content48% 
IMG OID638257431 
Productretrotransposon nucleocapsid protein, putative 
Protein accessionXP_571377 
Protein GI58268442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGGTC CATCCACTCG TAGTGGTCGA GCTGTGGAAA AGGTCGAAAA AGGCAAGGAA 
GTAGAACATA CCCTCGATGA CGACCACAAT CCCGCGGGTT CATTCAACAC CAATCCTCAA
TCCGATCCAT TTACCGCCGA CAACACATCA AACGACACCC AAACCCAACT CGAGATGATG
CGCAACCATA TTGCAAGACT TGAGCAACAG AAGGAAGAGT TGACGCATAA GTTGGAAGAG
AGCAAAGTCG AACGACAGAT GTTTGATAAT AGCGAAGATA ATGAGGGCAA AAACGAACAA
GAATTGGATG AAGAAGACGA AGAACCTAGA TCAAGCCGCG ACCTGTCAGC GCAAACACCA
TACCCAGAAG TTAAAAGGGA ATACAGCCGA CAACGCACCT CAGTACCCTC CAGTAGACCT
CAAGAACCGA AGGTATCCCA ACCCGAGTAC TACCATGGGC AGTATACCAA GCTCTCAACC
TTTATCACTC AAGTGACAAT GGTGATTACC CTCCAACCTT CTCGTTTCCC TACCGAGACC
TCCAAAGTCC TATACGCCGG ATCTTTCCTC CGAGATACCC CATTCTTATG GTTCCAACCC
TTCGTAACCA TCGATCCCCA GCCCAAGTTT ATGCTGGACT TCAAGAAATT TTGTGCCGAA
TTAAGGAAGA ACTTCGGAGA TCCAGACGAA GAACAGACAG CAGAACGACA ACTAAACACT
GTTCGTCAGC AAGGTTCTGT ATCCTCATAC CTCGCAACCT TTATGCGTTA TGCCACCTTG
GTTCAGTGGA ACGACGAAGC GAAGAAGGCT TGCTTCTACA GGGGCTTGAA GGATGACATC
AAAGACGAAC TTGCCAGACT ACCCAAAGCC AAGTCATTCA AGAACCTCCA AGACATGGCC
ATCCGCATTG ATAGCCGTCG ATATGAACGA GTATTAGCAA AGCGAGACCA GCAACCAAAG
GCGCCTTTCA ACGCCACCCG AAGCGACTAC ACCCGCACCT CTTACAATAA CAATCGCCCC
AACAACTTCA GACGTTTCTC TGCGGCGAAT AGCATGCCGA TAAGGTCTAC CACTGCCAAC
ACCACCTTCA ATAAAGAGGT GACGCCGGCA GTCAACCTGA GGGCTGCATT TGTCCCAAGT
TCAGCTAGGA TAACCAGACG TGGACGTCTG ACTCCAGAAG AATATCAGAG GCGAAAGGAT
CATAACCTCT GCCTCTATTG CGCCGACAAA AACCATCAAG TCGCCAAGTG CCCAGTGGTC
CCCTCGCAAC AATCCAATAC TACTCTCCCT TCAAAAAACT AGATATGCTC TTGTCTGCCA
AAGTCGAAGA AGGTAGCCGG GAACAAGAGC GTACTATCAA ACCGGCCAAT ACAGACTCCT
GCGAATATCT CCAAACTCTC GAAGATAATA ACAAAAACAA CAAAAACCAA CTCACAATCG
ACTTTCTCTT TCACAACAAT GTTTATCAAG CTTTAATCGA TTCCGGTGCC TCTACAAACT
TCATCGACAA AAGATTCGTC CAGACCTTTA ACCTCAAAAC CACGAAAATA GAAGATTCGA
TCCCATTATA CCTATTCAAC GCTGCGGGTC AGCGAACTAT AATTGAAGAA GAAGCCAACA
TCCTGGTCAA CTTCCAGAAA CCATTCGGAC ACACCTTACT CCGACTCCTC ATAACCGACA
TCGGCTCCTA TCCCATCGTC TTAGGTATCA CCTGGTTACA AGAGCACAAT CCGTCCATCA
GCTGGGAAAC ACTTTCCATA CACCCACCTG TATCACAGAC GACGAGTGCC AACTTAGCCA
TGGTCATCAC CAATGACAAA CCTCCAAAAG AAAACACCGA TGCCGAAATA GTACCTAAAG
AATACCATCA ATATCTAGAT GTATTCGACA AGAAAAGCGC CGATACACTC CCAGAACATA
GGTCTTTCGA CCACCATATC CCTCTCGAAG AAGGAAAGAA CCCACCTTTT GGTCCCATAT
ACAATCTCTC CGAAACAGAA CTTGAAGCTC TCCGCGAATA CCTTGATGAG AATCTTAAGA
AAGGTTTTAT CCGACCGTCC GAATCACCAG CCGGAGCACC CATACTCTTT GTCAAAAAGA
AAGACGGATC GCTTAGGATG TGTGTCGATT ACCGGGGAAT CAACAAGATC ACCATCAAGA
ATCGCTATCC TCTACCATTG ATCGCCGAAC TCCTAGATCG ACTCAAATCA GCCAAAGTAT
TCACCAAGAT CGACCTGCGA GGAGCCTACA ATTTACTTCG CATTAAGGCA GGCGAAGAAT
GGAAAACAGC TTTCCGTACT CGCTATGGGC ATTTCGAATA TTTGGTAATG CCGTTTGGCC
TCACCAATGC CCCTGCATCC TTCCAACATC TCATGAACCA CAATTTCCGC GACTTGCTAG
ACATATTTGT TATCATCTAC CTCGACGACA TCCTCATCTA CAGCCCAGAC TTGGAGACTC
ACCAGTCACA CGTCATACAA GTCCTAGATC GCCTCCGCCA AACCCAATTA TATGTCAAAG
CTTCAAAGTG CGAGTTCCAT CAAACCTCAG TAGAGTTCCT AGGTTTCGTT GTCAGCGACC
AAGGTCTATC AATGGACACC AAGAAAGTAA AGTCTATCAC GGAATGGCCG ACACCTCGCA
ATCTCCGTGA TACCCAATCC TTCCTTGGGT TCTGTAACTT CTACCGAAGG TTCATCAAGG
ACTACTCTAG TATCGCCAAA CCTCTTATCG ACTTGACAAA GAAGGACTTA CCCTTTGTAT
GGGAAGAACC TCAACGAACA TCTTTCGAAG CACTCAAAAA GAGTTTCACC TCTGTTGATC
TCCTACGTCA TTACGATCCG ACCAAGCAAC TCATCCTTGA AACCGACGCC TCCGACTATG
CCATCGCAGG TATCTTATCA CATGAAATCG ACAAGAAACT CGAACCAGTT GCTTTCTTCT
CTCACAAAAT GTTGCCTGCC GAGTTAAACT ATCCTATTCA CGACAAAGAA ATGTTAGCAA
TTGTTTCAGC ATTCAAAGAA TGGCGACATT ACTTCGAAGG TGCTAGAGAA ACCATTCGTG
TCTACACCGA CCACAGAAGC CTGGAGTACT TTATGACTAC CAAGCAACTC AATCGACGAC
AGGCGCGATG GTCTGAATTC CTAGCCGACT TTGACTTCAA TATCATCTAC CGACCAGGCG
TACAAGGCAC AAAGCCTGAC GCACTCACCC GAAGACATGA TTATCATCCA CTCGAGAAAG
GCTCCAGCCT TACTACTGCT GCCAATCCTC AGAATTTCCA GACTCTCCTT CGCCCTGGAC
AGTACTTGGG TACTGCCACA ACCGGACTCG ATCGGTTGGA AATATCTTCG CCCATCAAGT
CGTTGTTGAA AACCGGTCTA GAAACCGATG AATCAGCAAA ACCATTCTTG GACAAAGCCA
ACCATCCCTC CGAAGCTCAC CCATATACTC GAGACGATGA AGGACTCCTC AGATATGGCG
AATCATTCTA TGTCCCAGCC AATAACGAGC TACGCACCCT CGTCACGAAA GAATGCCATG
ATGCACTCAC TAGTGGGCAT CCCGGACGAC GCAAGACTAT CCAACTCATC CGACGCCATT
ACTGGTGGCC AGGCCTAAAA GGCTTCGTCA ATCACTACAT TGATTCCTGC GATCTTTGTT
GCAGAACTAA GACAAGACGT CATCAGCCCT ATGGCGAACT CAAGTCTCTA CCCATTCCCC
CATATCCCTG GTCATCTGTA TCGATGGACC TCATTGAACA ACTTCCCCCA TCACACGGCT
ACAACACCAT CCTTGTGATC GTAGACCGAC TCACCAAGAT GGCTCTCTTT ATCCCCACAA
CGACTAGCCT CAACGCCGAG GAACTCGCCC AATTATATGT CACCCACGTC TTCTCCAAGC
ACGGGATTCC GACCAGTATT GTATCAGATC GTGGATCTGA ATTCACATCC CGCTTTTGGC
GAGCATTCAC ACAACTCCTA CACATCGAGT TAGAACTCAG TACAGCTTTT CACCCAGAAA
CAGATGGACA AACCGAACGA GTGAACCAGG TCTTAGAACA ATATCTGCGC CTTTATACCG
ATTATAAGCA AAAGGAATGG GCACCGCTAC TCCCAGTTGC GGAATTCACT TACAACAATA
CGCCCCATTC GTCCACTACC ATGTCCCCCT TCTTTGCCAA CAAAGGGTAC CATCCCAGGG
CATCGTTTAC CCCCGATGAC AACGTTCCTA TTTTCAGCCC ACCTGCCAGA GCCTCCATCA
CCGACTTGAG CAAGCTCCAC GAACACCTCA AGATAGAAAT GTCCAAAGCA CAAGAGAGTG
CAGCACTACA GTTTGATAAG CACCGTGCCC CACTTCCCGA ATATACTATC GGCGACAAAG
TCTGGCTATC TGCCCGTAAC ATCAAAACGA AACGACCCAC CAAGAAATTA GATCACCGTT
ATCTCGGTCC CTACACCATT ATCGCGCGCG TTTCTTCCCA CGCGTATCGC CTTGAGTTGC
CGAAATCAAT GCGTATCCAC GACGTCTTCC ACGTCCAATT GCTTGAGAAA TATATTGAGA
ATGAGATCCC AGGGCGAACA CAAGTCGCAC CATCACCTAT CGAAGTCGAA GGTGACCTAG
AATACGAAGT CGAGTGCATC CTCGATCATC GATTTTACCG AAAACGCCGC CAATTCCTTA
TCAAGTGGCT CGGCTACAGT GCCGAACACA ACAGTTGGGA ACCCGAAACC GCTCTAGAAA
ATGCTTCAGA GATTGTTGAT CAGTATAAGT CAACACACCG ATTATAG
 
Protein sequence
MSGPSTRSGR AVEKVEKGKE VEHTLDDDHN PAGSFNTNPQ SDPFTADNTS NDTQTQLEMM 
RNHIARLEQQ KEELTHKLEE SKVERQMFDN SEDNEGKNEQ ELDEEDEEPR SSRDLSAQTP
YPEVKREYSR QRTSVPSSRP QEPKVSQPEY YHGQYTKLST FITQVTMVIT LQPSRFPTET
SKVLYAGSFL RDTPFLWFQP FVTIDPQPKF MLDFKKFCAE LRKNFGDPDE EQTAERQLNT
VRQQGSVSSY LATFMRYATL VQWNDEAKKA CFYRGLKDDI KDELARLPKA KSFKNLQDMA
IRIDSRRYER VLAKRDQQPK APFNATRSDY TRTSYNNNRP NNFRRFSAAN SMPIRSTTAN
TTFNKEVTPA VNLRAAFVPT LIDSGASTNF IDKRFVQTFN LKTTKIEDSI PLYLFNAAGQ
RTIIEEEANI LVNFQKPFGH TLLRLLITDI GSYPIVLGIT WLQEHNPSIS WETLSIHPPV
SQTTSANLAM VITNDKPPKE NTDAEIVPKE YHQYLDVFDK KSADTLPEHR SFDHHIPLEE
GKNPPFGPIY NLSETELEAL REYLDENLKK GFIRPSESPA GAPILFVKKK DGSLRMCVDY
RGINKITIKN RYPLPLIAEL LDRLKSAKVF TKIDLRGAYN LLRIKAGEEW KTAFRTRYGH
FEYLVMPFGL TNAPASFQHL MNHNFRDLLD IFVIIYLDDI LIYSPDLETH QSHVIQVLDR
LRQTQLYVKA SKCEFHQTSV EFLGFVVSDQ GLSMDTKKVK SITEWPTPRN LRDTQSFLGF
CNFYRRFIKD YSSIAKPLID LTKKDLPFVW EEPQRTSFEA LKKSFTSVDL LRHYDPTKQL
ILETDASDYA IAGILSHEID KKLEPVAFFS HKMLPAELNY PIHDKEMLAI VSAFKEWRHY
FEGARETIRV YTDHRSLEYF MTTKQLNRRQ ARWSEFLADF DFNIIYRPGV QGTKPDALTR
RHDYHPLEKG SSLTTAANPQ NFQTLLRPGQ YLGTATTGLD RLEISSPIKS LLKTGLETDE
SAKPFLDKAN HPSEAHPYTR DDEGLLRYGE SFYVPANNEL RTLVTKECHD ALTSGHPGRR
KTIQLIRRHY WWPGLKGFVN HYIDSCDLCC RTKTRRHQPY GELKSLPIPP YPWSSVSMDL
IEQLPPSHGY NTILVIVDRL TKMALFIPTT TSLNAEELAQ LYVTHVFSKH GIPTSIVSDR
GSEFTSRFWR AFTQLLHIEL ELSTAFHPET DGQTERVNQV LEQYLRLYTD YKQKEWAPLL
PVAEFTYNNT PHSSTTMSPF FANKGYHPRA SFTPDDNVPI FSPPARASIT DLSKLHEHLK
IEMSKAQESA ALQFDKHRAP LPEYTIGDKV WLSARNIKTK RPTKKLDHRY LGPYTIIARV
SSHAYRLELP KSMRIHDVFH VQLLEKYIEN EIPGRTQVAP SPIEVEGDLE YEVECILDHR
FYRKRRQFLI KWLGYSAEHN SWEPETALEN ASEIVDQYKS THRL