Gene CNA00710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA00710 
Symbol 
ID3253470 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp206616 
End bp209983 
Gene Length3368 bp 
Protein Length946 aa 
Translation table 
GC content50% 
IMG OID638252404 
Productpre-mRNA splicing factor prp1, putative 
Protein accessionXP_566496 
Protein GI58258167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.547118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTGCAGCTG GATTCTCTTC TTATTTTCTA ACGCCTAAAA CATCGCGAGT ATGTCCAACG 
TCGGCACAGT GAAGCATATT CCCAAGGAAG TGCGATACAA CTTCCTCAAC GTACGTATCT
CCTCAGTTTG TTGAAGCGGG ATCGGGAGCT AACAGTACCC ATAGATGGCC GCTCCGGCGA
GCTATGTTGC TGGTCTTGGT CGAGGGTAAG TGGCATCTGC GCCCAATTCC TGTGTGACTT
TAAACTGACA GACGTATAGT GCTTCAGGTT TCACAACTCG GTCCGATATT GGTCCCGCCC
GAGCTGGTCC CAGTGCAGAG GTTGTTGCGG AAGCTCAAGC TCGCCGTGGT GAAGAAGAAA
TCCCTGATCC AGATGCTTTC CAAGATCCGG ACGACGAAAG GAATCTGTTT GCTGGTACAG
TCTACGAAGC GGACGATGAA GAGGCCGACA GGGTGTGGGA CAGTGTTGAT GCCAGAATGG
ATGCGAGAAG AAAGGCCAGA CGGTGGGTCT TCTCTATGGA TATCTTGTCT CATGTTCGAT
ACTGATTAGA TATGCAGAGA CGCGGCAGAA GCGAAAGCGG CGGCTGAAGA ACGTGCTCGC
AATCCCAAAC TTCAAACGCA ATTTGCAGAC TTGAAACGAT CTCTATCGAG CCTCAACGAT
GCTGATTGGG ACGCCATCCC TGAAGCAGGA AACTTGACTG GAAAAAGGAG AAAGGCCAAT
TTGCGATTGG AGGAAAATCA GAATGGAAGA AGCTACAATG TCAGCGACAC TGTCATTGCA
GATGCCGTGA AGAGAAATGC CATGGTAGGA GAGTTGGATC CTGCAGAGGT GGGTAACCTA
GTAGTAACTC GTCATCGAAA CTAATCACTT CCACAGGCTG GTATTGGTAT CGATGGTACC
GAAACGGATC TTGTATCTAT CGGTAATGCC AGGGACCGAG TATTGTCGCT GCAGCTTGAC
CAAGTGAGTT TAGATTTAAT CGTGGGTTAT GGTATTAGTC ATTGACAGCT TTGGTTAGGC
CACAAGAGAC GCCTCAAACG GCTCTTCTAC CAGCATCGAC CCTAAAGGCT ATATGACCGC
TCTTAACAGT CAGATTGTTC AAACAGACGC TCAAATTGGT GATATTAAGC AAGCTCGCCA
GCTCTTGCAA AACCTCATTC AGTCTAATCC CAAACACGCC CCAGGATGGA TCGCCGCCGC
TTCCTTGGAA GTACACGCGA AGAAGATGGT CGCTGCCAGG AAGATTATCG CTGAAGGATG
TGAGAAGTGT CCGAAAAACG AGGATGTTTG GTTCCATGCC GCTGAACTCA ACACACCGGA
GAACGCGAAA GTTATCTTGG GTCGAGCTAT ACAGCACGTT CCTCAATCTG TTAAAATTTG
GCTCAAGGCT GCTTCTCTAG AAACAGACAT AAACGCCAAG AAGCGCGTTC TCCGAAAAGC
CCTTGAATTC GTTCCCAACT CTGTGGGGTT GTGGAAGGAG ACTGTCAACC TGGAAGATGA
TCCTGAAGAC GCCCGCGTTC TCCTTACCCG TGCTGTTGAA GTCATCCCCA ACTCTGTGGA
GCTCTGGCTT ACTCTGGCCC GTCTTGAAAC TCCTGAAAAC GCCAAGCAGG TTCTCAATTC
TGCGCGCAAG CGTATCCCTA CCTCTCACGA AATCTGGATC GCTGCCGGTA GGCTTGCTGA
GCAGTCACCT TCCGCCGTGG CTGTCAAGCC AGAGGTCAAG ATGGAGGACG AAGCGGAATA
CGAGGCTGAG CAAAGAAAGA AGCTTGCTCA GCAGGTCAAC AAACTCATGG CTGGTGCAGT
CAATTCATTG CGCAAGAATC AGGTCATTCT TTCGCGAGAA CAATGGTTGC AAGAGGCCGA
GAAATGTGAA CAGGACGGCT CACCTCTTAC AGCGCAAGCT ATCGTGAAGG CTACCATCGC
TCAGGACGTC GAGGAAGAAG ATAGGAGATC TGTCTGGATT GAAGATGCGG AGAGGGCGAC
AAAGGGTGGA TTTTACGAGG TCGCGAGAGC TTGTTACGCC GTCACTCTCG AGGCTTTTCC
TAATACTCCA TCAGTCTGGA GAAAAGCCGC CGAGTTCGAA AAGGCCCATG GCACACCGTG
AGTTACTTTA TGTAATTGCA AGGCCAGATG ACTGACAAAA GATTGTAGCG ATGCTGTCCA
AGAAATTCTC GCCCAAGGAT CCCAACACTG TCCTCATGCG GAGGTTCTCT GGCTTATGGC
TGCGAAAGAG AAGTGGGTCG GCGGCGATAT CCCCGGTGCT CAAGCCATTC TTGCCGAAGC
TTTCAAACAA AACGAAGATT CCGAATCTAT CTTCCTTGCT GCCGCCAAGC TAGCAGCTGA
GACCGGCGAG ATGGAGGCTG CTATCCAGAT CCTTGAGAAG GCCAAGGCAC AGGCAGACAC
AGAGAGAGTC TGGATGAAGT CAGCGGTACT GTTGAGGCAA TTGGGCAAGT TGGACGAGGC
TCTTTCAACC TTGGAAGTTG CAATCAAGAA ATTCGCTTCC TTTGACAAAT TGCACATGAT
CCGAGGGCAG ATCTACGAGT CCCGTAATGA GGTTGCGCTT GCGCGAAATG CATATGCTCA
AGGATGCCGA TCATGTCCGA AGAGTATCCC ATTATGGATC TTGTCGGCTC GTCTGGAGGA
GAAGGCGGGT GTGACGATCA AGGCAAGGGC ATTGCTCGAA AAGGCGAGGT TGCATAATCC
CAAGAATGAT GAATTATGGG CGGAAAGTAT CAAGATTGAA GAACGAACGG GCAGCCCACA
GCAAGCGAAA TCTGTCCTTG CTCGAGGTAA GTATCACTTT ATTTCTAGCC CATCCTGCGA
TTTACTAACA GCTTCATTCT ATAGCAATGC AAGAATGCCC CGCCTCTCCT CTTCTTTGGT
CCATGGCCAT CTTCATGGAG ACTCCTCAAC AAAGAAAAGG TCGTTCCGTT GACGCAATCA
AAAAGGCCGG CGAACATCCG GCCGTCATCT TGGCGGTCGC GAGAAACTTC TGGAGTGAAA
GGAAGATTGA AAAGACGAGA CAGTGGATGG CCAATGCTAT TACCGCCGAT GAAGATTGGG
GAGATGCCTG GGGTTATTGG CTGAAGTTCG AGAGGCAACA TGGAGAGAAA GTGAGCTGAT
TCTTGTTTTC AGTTTCTAAA AGGATATCTG GACTGATACA TGAATATATA GGAACGTCAA
GAAGCGGTCG TTGAAAAATG CATCGCGGCA TCACCACGCC ATGGTCCGGT ATGGCAGTCG
GTATCAAAGG ATTTGGCCAA TGTTGGCAAG TCTACAAAAG AGATACTGGA GTTGGTCGCG
GACAAACTGG AATAATGTTT TAGTATAGTG CCAGTTCTTT TGGTATGTCA TGATCACAAT
ATGTCCCG
 
Protein sequence
MSNVGTVKHI PKEVRYNFLN MAAPASYVAG LGRGASGFTT RSDIGPARAG PSAEVVAEAQ 
ARRGEEEIPD PDAFQDPDDE RNLFAGTVYE ADDEEADRVW DSVDARMDAR RKARRDAAEA
KAAAEERARN PKLQTQFADL KRSLSSLNDA DWDAIPEAGN LTGKRRKANL RLEENQNGRS
YNVSDTVIAD AVKRNAMVGE LDPAEVGNLA GIGIDGTETD LVSIGNARDR VLSLQLDQAT
RDASNGSSTS IDPKGYMTAL NSQIVQTDAQ IGDIKQARQL LQNLIQSNPK HAPGWIAAAS
LEVHAKKMVA ARKIIAEGCE KCPKNEDVWF HAAELNTPEN AKVILGRAIQ HVPQSVKIWL
KAASLETDIN AKKRVLRKAL EFVPNSVGLW KETVNLEDDP EDARVLLTRA VEVIPNSVEL
WLTLARLETP ENAKQVLNSA RKRIPTSHEI WIAAGRLAEQ SPSAVAVKPE VKMEDEAEYE
AEQRKKLAQQ VNKLMAGAVN SLRKNQVILS REQWLQEAEK CEQDGSPLTA QAIVKATIAQ
DVEEEDRRSV WIEDAERATK GGFYEVARAC YAVTLEAFPN TPSVWRKAAE FEKAHGTPDA
VQEILAQGSQ HCPHAEVLWL MAAKEKWVGG DIPGAQAILA EAFKQNEDSE SIFLAAAKLA
AETGEMEAAI QILEKAKAQA DTERVWMKSA VLLRQLGKLD EALSTLEVAI KKFASFDKLH
MIRGQIYESR NEVALARNAY AQGCRSCPKS IPLWILSARL EEKAGVTIKA RALLEKARLH
NPKNDELWAE SIKIEERTGS PQQAKSVLAR AMQECPASPL LWSMAIFMET PQQRKGRSVD
AIKKAGEHPA VILAVARNFW SERKIEKTRQ WMANAITADE DWGDAWGYWL KFERQHGEKE
RQEAVVEKCI AASPRHGPVW QSVSKDLANV GKSTKEILEL VADKLE