Gene CNA05460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA05460 
Symbol 
ID3253284 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1463402 
End bp1466321 
Gene Length2920 bp 
Protein Length558 aa 
Translation table 
GC content47% 
IMG OID638252866 
Producthypothetical protein 
Protein accessionXP_566868 
Protein GI58258911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATATATAGT CTTACAACCA AATATCCTAT ATGGTATAAT AGTACTTCTT CTCCGTCGGA 
CACCGTAATC TTTACACGCT CTCTAAGTCA TATATAAGTG AGTTGTTCAT CCTGCGTCAA
CGACTGTCCT TGCTACCTCT ATCCCCGGAT TCTCGCCCGC CATATATGTA TTTCCACTGA
TAAATCCCAC AGTCACGCTT CCCTTCGGCT CTGCTCTGCT TGCACAGTTT GGATCTGACA
ACAATTTCCA CGCAGCTTCT TCGTCTCTAT AATCCGATAT TTATCGAAGG TGAGTCTGCG
TGAAACTTCT GCAGAAGGTT GGTCGTACAT AGTTGTCATG GTAGCTCTGT TGTGGCCATT
GGTATTCTTT CTTCAAGGCT TAGCCAAGAG CCAAGAGGGT TGAGCTGCCT TGTACCGCGC
TTGGTTCGAG GGGCCAGGCC CAAAAAAGGG AAGGGCAATA ATACTGGTCC AGCCTCGGTC
TTGAGCATTT CTGATGCTAG CATACGTTGC AACAGTGTTT TATTGGCATA ACGGATGGGG
AAGAGTTCTG ACATCGAAGT TAGCAATATT TTTTATCTCT CCCTGTTCTG CCTGTTTACC
GCCCCTACGT GCTAGGTCAC GCAAAGAACT GGATTCAAGC CTCCTGCCGT CAAACCGTGC
CGAACTATTA CGTCAAATCA GCTCTTATAG TTCAATTTTT AGATTGTTCT ATTTTGAGAG
AATTGCCCTA AGGACTCCCC ATTCACGTGT CAATGTTCCC CTCAACAGGC AAAGACTCGA
TGCTCATGGA GATTGCCGGC CCAACGTCCC CACTTTCAAC CAAAAAAAGG AAAAGAATGG
ATCAAACATC CAGCCATCCA TTACCAACAA ACCATGAGGT CGACATGCCC GATATGGCAT
CCAGAACTGG TCGAGAGGGA TGGTAAGGCC GAAGTTGCCT GATCACGGCG GAGAATGTGA
CTTACAAATT CGGTGGAATA GGACATACCA TCTTGAAATC TTACAAGAGC CTCTAAGGGC
CAGGGCTTGT GGTTTTGGTA ATAAGGTGAG CTGTCCACCA TGGTTTTAAA ACTAAACAAC
TCAGGATCGC AGACCCCTTA CGCCCCCTCC TATTATCCGA TTGTGGATTC AAGATGCTTT
AGGAAATCAA GTGGATCCCA AGTAAGTCTG CCGAAACGCT ATTTATTCAT GAAGCATGCT
GAACGTACTC TTAGCATAGT TGATTCCAAC ACCATAATCC TCCAGGTCGA CTTATGTTCG
GCTGACGGGC ACGAAGGAAG AAACGTGGTA AGACATCCAG TGGGACCTGG CAGTGTTCCT
GCAGTTGTCT CAGTGAGCGA AAATGTTAGG TCAGGTCATG TAGATCCTTC GACTCTTCCA
ACTACATCCA CAGCGGTGTC TGAACAGCCC TATAGGAATA GTAAGCATCT CCAGATAAAA
GCAAAATATG GCCTGCTCAC AACTCCTCAA ATTACAGGTA GCATGGACTA TTCTTCATGG
CCTGAGCAAT ATTCCTCGGA ATCTTCCGAC CGGACCACTC CGGGGTGGAC TTATCAAGAT
GCCTTTGTCT CTTCACCACG CATTGCCGGA ACTCGCCCTC CCATGGCTCG ACGAGTACGC
ACTCCCACTC GACCTTCCAC TGCGCCCTCC TTACGTCCTC CGGCATGGAG CGTTGATGTC
TCTCAACGGC CATTGTATGA GGAAGCTCTT CCTCCGATTT CTGCACTTGC AGAAAATATT
CGCGACCCCT CTGGGCGCCC CTGGTTTCCC GACCCCCACA TTAATACGCA AAGGCCGACA
AGTAGCAGTT CTTTGCGGAG CCGCCCTCAC ACATCCCACT CGACTGATTT GAGTACAGCT
CCCACCGACT ACTCATTTGG GAGACCTACG ACCACAAGTT CCACCGGTAG CTGGCATCTA
TCGGCCGATT CAGAGTATAA GGGTTTCGCA CTTGAAAACC AAGCTGCTGA AGGCTCAAAA
CCTGGCCCCG GAGCAAAATC CAATGATCCT AACGTACCTA TATCATCCGC GGAAACTCAA
GCAACATCAC CTGAATCTTT TCTTCCCGGT TCTTTTACTG ACCGCTTCAC TCAAAACGAC
TCGCCCCGTG TACCATACTC GTACCACAAC TCCCATTATC GGTCGGATGT CGCGCCGGGC
AAGGAGCGGG ACTCTTTCCA TAGTCCAGAC CTAGATACTA TGAAGGAAAC CCGCTGTTTC
GCACCAGCAA GCGTTGCGTC GACTTTGGTT TTGGTTGGGA AACGTCACAC CCCTTGTAAC
AAACTAAAAG ATGAGCATGG ACGGTTAGGT CTATTTTTCT TCGCCACGGA TCTGGGAGTG
CGGACAGAAG GGAGATTTTG CCTGCGAATG AAGATCATGG ATCTAAGCCT GTACGTATCA
AAGTTCAAAA ATGATCCAAT TAACATCTTT TCCTCAGCTT CTTACGGCCT CCTAACCCAG
GCGATAGCAC GCCCATATTG GCGGAAACTA TCAGTCAACC AATAGAAGTG TAGTAAGTCC
ATTGAAAAGC AGGACCTATT TTGCCGGTGA ACTAATTTCG ACTACTGTCG TTGTGCTTGT
GTACGACAGT TCTGCAAAAC GCTTTCCTGG AGTCATACCT ACTACTAAAC TCACTCGTTT
GTTTGCTGCT CAGGGAATCA AATTGGCTGT CAGAGAAAGT CATAAACAAA AGCACCGCAG
CAAAGAGAAC GTAGATATGG ATGTTCAAGA TGAAATCGAG GAAGATGAAG ATGGTCAATG
AGACATGGGG GATTACGATT GTGCGAGTTG CGCCGTGCGA TGGGGTGTTA AGAAATTAGT
AATATTGTAG AAGTAATATA CCGACGTTTA ATCGCAAATT CATACTGTTC AGATAATTAA
ACTTTCGATA ACCAGGGTAA ACATGAAGGC AGGGCATAAT
 
Protein sequence
MFPSTGKDSM LMEIAGPTSP LSTKKRKRMD QTSSHPLPTN HEVDMPDMAS RTGREGWTYH 
LEILQEPLRA RACGFGNKDR RPLTPPPIIR LWIQDALGNQ VDPNIVDSNT IILQVDLCSA
DGHEGRNVVR HPVGPGSVPA VVSVSENVRS GHVDPSTLPT TSTAVSEQPY RNSSMDYSSW
PEQYSSESSD RTTPGWTYQD AFVSSPRIAG TRPPMARRVR TPTRPSTAPS LRPPAWSVDV
SQRPLYEEAL PPISALAENI RDPSGRPWFP DPHINTQRPT SSSSLRSRPH TSHSTDLSTA
PTDYSFGRPT TTSSTGSWHL SADSEYKGFA LENQAAEGSK PGPGAKSNDP NVPISSAETQ
ATSPESFLPG SFTDRFTQND SPRVPYSYHN SHYRSDVAPG KERDSFHSPD LDTMKETRCF
APASVASTLV LVGKRHTPCN KLKDEHGRLG LFFFATDLGV RTEGRFCLRM KIMDLSLFLR
PPNPGDSTPI LAETISQPIE VYSAKRFPGV IPTTKLTRLF AAQGIKLAVR ESHKQKHRSK
ENVDMDVQDE IEEDEDGQ