Gene CNB01640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB01640 
Symbol 
ID3255930 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp488074 
End bp491080 
Gene Length3007 bp 
Protein Length699 aa 
Translation table 
GC content47% 
IMG OID638254814 
Productpre-mRNA splicing factor, putative 
Protein accessionXP_568833 
Protein GI58262846 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGATGCCTC CTCTCCAGTT CTGGAAGCCC GGTACAGCTG CTCCAGGTGG GTTCTCCTAG 
CATTGGACAC TATTAATGCT CAACAGGCAG CTCTCTTGAT AGAGAATCCG AGAAAGACGG
TTCATTACTC CCCATTAACA CATCACAAAA TGCCCATCTC AGTTTGGATG CCCAACGACA
AAGATTACCC ATCTACAAAC ATCGAGAGAA GCTGCTATGG TGTGTTGAGA AGTACCCTGT
GGTGATCGTC GTTGGCCAAA CTGGCAGTGG AAAGTCTACA CGTATGTCGT TCTCCTGTTC
TCCGCATGCG GCTGCCAGGG CTAAGCTGAA CGCCCTGTAG AATTGCCGCA GTATCTCCAT
GAGGCAGGGT GGACAGGACA AAATCATGTT GTAGCTTGTA CTCAACCTCG TCGGGTCGCT
GCTACCAGCG TTGCCACGCG AGTAGCCGAA GAAGTGGGCA GCATCCTTGG TGACGAGGTA
GGATGCTTTC TGGCTGCCGA CGCAACTGCT CAACTGACTT GTATATAGGT TGGATATACA
ATCAGGTTCG AAGATTTATC TCATCCCACA CGTACCAAAA TAAAGTATAT GACGGATGGA
ATGCTTTTTC GAGAAACCAT GATGGATCCA TTACTGAGCA AGTACAGTGT CATTATGGTG
CGTCATTTGC GTCGGTTGAT GGGCAGAGCT CACGTCGCTT CGTGTAGATT GATGAGGCGC
ATGAAAGGGG AGCATATACC GACCTCTTGC TAGGTTTGCT CAAGAAGTAC GTACAGCCAG
GATTTCCCCT ACCTCGCTGC TGACAAAATT CTTCCAGGAT CATGAGAAAA AGGCCAGAGC
TGCGAGTCAT CATCTCCTCG GCCACCATCG ATGCGTAAGT TTTAATCCCA CGGTGTTCTA
GGCATATACT GATATAAGCT CCCTCAGAGA GGATTTTTTA GAGTACTTCA ATACAAACGC
AGACGGTACA GATCGGTCAA AAGACGATGC CATCAGTACG CCGTCTGCGT TGTCAGATTA
GTTCACAAGC GCTGATCCAT TCTTAGTTGT CAGCTTGGAA GGTCGGATGT TCCCAGTAGA
AGTCTGCTAC CTTAAGGAAC CATGTGCAGA TTATACCCAG GCGGCAGTCC AGACAGTCTT
TGACCTACAT TTGAGGGTAT GTTTCCATAT ATTAGCCAGA TACACGAGGC TAATTAGAGC
AGGAACCACT TGGAGATATC CTGGTTTTCC TTACTGGGCG GGAAGAGATT GATCAAGTAA
TACAAGAAGT TGCAGATAGA TTATTATCGT GAGTCCCACT CTGCGGCATT TTTAGCTAAT
CACGTCTCAG TCTACCCAAA GCCGCACCCA AACTACTCGC TCTGCCGCTT TATGCCACTC
TCCCACCCGA AGAACAATCA CTTATTTTTG ATCCTCCTCC GCGCGATACC CGAAAAGTCA
TCTTCTCAAC CAACATTGCT GAGGCGAGTG TTACCATTGA TGGTATAAAA TACGTTGTCG
ACTCTGGCTT TGTCAAAGTA GGTCTTTTAG CCTTTCATTG ATCCTTGACA CTGATTTCTT
GTCAGATCAA AACATACAAT CCTAGAACAT GTATGGACGT CCTCACCACC ACCCCTTGTT
CACTTGCCTC TGCAAATCAG CGTGCCGGCC GTGCCGGTCG TACATCTGCT GGCAAATGCT
TCCGCCTTTA TCCCGCTTCC ATTCTACCCA CCACCAACCC ATCATCACCC ATGCCACTGA
CTACCCCTCC TGAGCTCGTG CGCTCCGATA TATCGCTTTA TCTTCTTCAA CTTAAAGCTT
TGGGTATCGA TAATCTCGCC AAATTCGATT TTATGAGCCC CCCGCCGAGC GAGATGATGA
TCAGGGCGTT GGAATTCCTT TTCTGTTTGA AGGCCATAGA CGATGAAGGA AGACTGACAA
GACCCATGGG TGAAAGGATG GCAGAGGTTC CTCTGGATCC TATGATGGCT GCTATTGTAA
GTGTCCACTG AAAAGATTCG TTTGATTGGA TACTGAAGAT ACTTGGCAGC TGTTGAATTC
CCATGAGTTC CGATGTGGAG AAGAGATTCT GACTATTGCT GCCATGACTT CAGTGCAGGT
AAGCTCTTCC TATGCCTTTA CCATTGTATC CACTGATTTA TGGATGGTAG AACGTGTTTA
TCACAGCTCA AGGAGGAACG AAGGCTACAA TGGCTGAACT GGAGAGGCGG AAATTCACAG
CGGAGGAAGG GGTGAGTAAC AACTTTATTG TCTGTTGAAA TTCGATGCTC ACTTTTGATT
GTAGGATCAT TTGACGTTAT TAAATGGTAC GAATGTCCCA TTTTTCACTA CCTGTACCTT
GGGCTGACAC TTTCACCGTA GCGTACAACG CTTTCGCTCG ATATGGTCAA AACAACAAGC
AATGGTGTGG TAACCATCGT CTTAATTACA AAGCTCTCTC GCGTGCCATG TCAATCCGAA
AACAGCTCAA GAAGTATATG GAGAGGTTCA GGATTCCGAT CGTCAGTTGC GAAGGTGACG
CAATCAGGTT GAGAAAATGC TTGGTTTCTG GTTACTTCAA GGTAAGCAAA TTCCCTGGAC
TTAGGCGGTT TCTGTATGTG ATTCGGAATC TGAAACGATG GCTCTACAGA ACGCGGCCAA
GATGATGCCA GACGGGACAT ACAGATCAGC TCGTGAAAAC GCACCTTTAC ATGTCCATCC
TTCATCGGTC ATGTTCACTC GTCAACCTTC CACAGGTTGG GTAATATATC ATGAAGTGGT
GGAGACAACG AAGAGTTTTA TGAGGGATTT GACGGTCATC AATGAGGACT GGCTAGTGGA
ATTAGCGTAA GTGCTTGCGG AAAGGTTATC CGATGTGGAT GCTGACAGTG ATCAATACTA
GGCCTCACTT CTACAAGTTC AAGGGAGGCG GATTGAAGCA GCATTTCTGA TTGCTGAGAA
AAAAAAGGGA GGGCCCTAGG CGATGTACAC TGTCCACTTG TCTACGTTGT AGCACAATAA
ATGCAAA
 
Protein sequence
MPPLQFWKPG TAAPGSSLDR ESEKDGSLLP INTSQNAHLS LDAQRQRLPI YKHREKLLWC 
VEKYPVVIVV GQTGSGKSTQ LPQYLHEAGW TGQNHVVACT QPRRVAATSV ATRVAEEVGS
ILGDEVGYTI RFEDLSHPTR TKIKYMTDGM LFRETMMDPL LSKYSVIMID EAHERGAYTD
LLLGLLKKIM RKRPELRVII SSATIDAEDF LEYFNTNADG TDRSKDDAII VSLEGRMFPV
EVCYLKEPCA DYTQAAVQTV FDLHLREPLG DILVFLTGRE EIDQVIQEVA DRLLSLPKAA
PKLLALPLYA TLPPEEQSLI FDPPPRDTRK VIFSTNIAEA SVTIDGIKYV VDSGFVKIKT
YNPRTCMDVL TTTPCSLASA NQRAGRAGRT SAGKCFRLYP ASILPTTNPS SPMPLTTPPE
LVRSDISLYL LQLKALGIDN LAKFDFMSPP PSEMMIRALE FLFCLKAIDD EGRLTRPMGE
RMAEVPLDPM MAAILLNSHE FRCGEEILTI AAMTSVQNVF ITAQGGTKAT MAELERRKFT
AEEGDHLTLL NAYNAFARYG QNNKQWCGNH RLNYKALSRA MSIRKQLKKY MERFRIPIVS
CEGDAIRLRK CLVSGYFKNA AKMMPDGTYR SARENAPLHV HPSSVMFTRQ PSTGWVIYHE
VVETTKSFMR DLTVINEDWL VELAPHFYKF KGGGLKQHF