Gene CNC00790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC00790 
Symbol 
ID3256540 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp222743 
End bp224767 
Gene Length2025 bp 
Protein Length584 aa 
Translation table 
GC content52% 
IMG OID638255296 
ProductRNA-binding protein, putative 
Protein accessionXP_569380 
Protein GI58264448 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.253852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGC ACTCTGCACA CAAGGGCAAG GCCCCAGTCA CAGACAAGAA CAATAAGCGC 
AAGAGAGACG CACAGGATAG CAGCCAGGGG GACGCACCCG TCGCTCTCTT TGCTGCTGTC
AAGGATACCG AGTTGGACGA CATTTTTGCA AAGAGCGTAA GTGTCTTGCC CTTTTTTTTT
TCTGCAAAGC GATCGATGCT CATATACCGA CAATCCAGAA TGCCTTTTCT GCACCTGCAC
CTGTTGCCAG TACATCAAAA GCTCATCTCC AGACTACATC TGCTCCCGCA GCCAAAAAGG
GCAAGAAGAG CCCTTCACCC GAAATGGAAG AGGACGTGCG AGTAGACGAT CAGGATGAGA
GCGAGAGTTC CGAGGAAGAC GAAGATGGAG AGGACAATGA GGCCGAGCTT TCCGAGATTG
ATGAAGAGTT GCCGGACGAC GACGACGAGG ATGTTTCTGA TGCCGAGAGT GACAGTGAGG
CGGCCAAGGC CAAGGAGACT AAAAAGACAA AGAAGAGCAA GCTCGGCAAG TATGTGCCTG
CTGAGGAATC GGTACAGGAC AAGAACAGGA GGACTGCGTT CATTGGTAAC CTTCCTATCG
ACGCTGCAAA GTCCAAGGTG AGCTCTATTC TTTGCTTCCC CCCTTTTCGT TTGGATACGC
CAATCTGATA ATCGAAACGC AACAGTCGAC ATTGAAACAA CTCCGAGCCC ACATCATGTC
ATTTGTCCCA TCTGCCAAGA TTGAATCTCT CCGATTCCGA TCTGTTGCCT TTGCCACCCC
TACCGCTGCG CTCCCCACTG AGGATCCTGA GAAGGACGCC AACCAGCGTG CGAAACGAGA
AAAGGAGCGT GCTGCCGCTT GGAAGGCCAA GCAAAACGCT GATGGGGAGG ATGCGGAGCT
TGACAAGGCC AAGGTGTTTA TCGATGCCAA GGGAAAGAGA AAGGTCGCTT TCATCAAAAA
AGACGTGCGT CCCTGATACT AGCTACCCAT CATCTCGCAT ACATAACATT GGCTGACAAG
GTTATTCATA CAGTTCCACT CTGAGATTGA CTCTTGTAAC GCCTATGTCG TCTTTGCTTA
TCCCCACCCT GACCGAGCTG CCAATGTCGC CCCGATCCTC GACCCCTTTG AGGCTGCCGC
CAAGTTCATC GCGTCTGCGA ACAGCAGTAC CTTTTCCGGG CGTACGATCC GCGTCGACTC
TGTCCGCTTA CCTTCTTCTG TCGGTCTCGC CGGCGCGTCC ACTTCCCTGA GCAAGCGGGA
CGCATGGCTG CCTAGTAACA CGGATCCCAA GAAGAGTCTC TTTGTGGGCG GTTTAGACTA
TGCGGCCAAG GAGGAGGATG TCAGGGTGTT CTTTGAAGAG TTGGTCAAGG CTGAGAGGGG
TGCGAACAAG GAGGGGAGCG GAAAGTGGGT TACTGGTGTG AGGATTGTGA GAGATAAGGA
GACTCAGCTC GGTAAAGGTT TCGGTTACGT TCACTTTGCT GTGAGTATAT TCCCGAGGAA
AAGAAGAAAA AGGGTCTTAT GCTGATATTC AATAAAATAG GACCGAGAAA GCGTGGAAGA
GATTCTCGCA ATGGACGCTA AACAAATCAA ATTCGCCAAA CGAACGCTTC GTGTGCAGCC
ATGCAAGACC ATCCCTACCG CCAACACTCT TCAAAACACT ATCAAAAAGA TCGCTGCCGG
CTCTGGCGGC GCTTCAAAGG ACAAGACCAA GAAAGCTTAT GTCCGACCGG GTGTCATCCC
CAAGGGCGAC CCTGCGCTCG GTGACAAGCT CAAGAACCTG TCCAAGGAGG AACGAAAGAC
TATCAAGAGC TCTGACGCTG ATCGGCAGGC GAGGAGGTTG GCGAAGAAGA AGGCCAAGAT
GTCGTTGGAG AAGGACAAGG CGAAGGGTGC GGTCAAGTTG ACGTTGACAA AGAGCGAGAG
GGAGAAGACG AGCGCGTCAA AGAAGCCCAA GGCGAAGAAG GGGAAGAAGA GGGCGCCGTC
TGCGGTTGCA AAGATGAAGG GTTCAAGGGA GTAGACGTGT GTATT
 
Protein sequence
MAKHSAHKGK APVTDKNNKR KRDAQDSSQG DAPVALFAAV KDTELDDIFA KSNAFSAPAP 
VASTSKAHLQ TTSAPAAKKG KKSPSPEMEE DVRVDDQDES ESSEEDEDGE DNEAELSEID
EELPDDDDED VSDAESDSEA AKAKETKKTK KSKLGKYVPA EESVQDKNRR TAFIGNLPID
AAKSKSTLKQ LRAHIMSFVP SAKIESLRFR SVAFATPTAA LPTEDPEKDA NQRAKREKER
AAAWKAKQNA DGEDAELDKA KVFIDAKGKR KVAFIKKDFH SEIDSCNAYV VFAYPHPDRA
ANVAPILDPF EAAAKFIASA NSSTFSGRTI RVDSVRLPSS VGLAGASTSL SKRDAWLPSN
TDPKKSLFVG GLDYAAKEED VRVFFEELVK AERGANKEGS GKWVTGVRIV RDKETQLGKG
FGYVHFADRE SVEEILAMDA KQIKFAKRTL RVQPCKTIPT ANTLQNTIKK IAAGSGGASK
DKTKKAYVRP GVIPKGDPAL GDKLKNLSKE ERKTIKSSDA DRQARRLAKK KAKMSLEKDK
AKGAVKLTLT KSEREKTSAS KKPKAKKGKK RAPSAVAKMK GSRE