Gene CNN01490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01490 
Symbol 
ID3255518 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp432474 
End bp435801 
Gene Length3328 bp 
Protein Length876 aa 
Translation table 
GC content50% 
IMG OID638254564 
Producthypothetical protein 
Protein accessionXP_568619 
Protein GI58262418 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTC CAGCCGAAAA GCAGTTCAAG GCAAAAAATT ACGGCCCCTA CGAAGCCACC 
TCACCAAAGT CTTTGAAACC TCCATCGCTC CCCCACAAAC CGAAGTCGAA CAGCTCGAGG
CCATCACAGC CGCCCACACC CACAGACACG ACCACGGATG CTTCCGGGCA ATCCACGTAC
AAGGTTGCCA GAGCAATTTC CAGTTGTACT CGATGTAGAT CACGAAAGCA AAAATGTGAT
GGTAAGCTTC CAGCATGTAG TGCATGCGAG AGAGCCAAAG TGGAATGTAT CGGCTTTGAC
GCGATAAGCA AAACAAACAT CTCAAGAAAG TGGGTTATAA CTTTTATATG GGTTGTATCC
AAGGATGAAC TGGAACTGAC ATCCTAGAAT AGCTATCTGC ATCAGCTTGA ACAAGAAGTC
GCCTCCTTAC GTGCCCAAGT CGCTGCTCTC ACTTCAAACG ATCCCATTGG TCGCACGAAA
AAAGGAATTG CTGCCAGTGC CTCTCAGGCG CTTTCGGCCT ATCGCTCTGA CTTCCCTATT
GATCCATCCC TCCCCGATGA TCGATCCCCA CACTCTTCAT ATGACTCTTC CATCCCTGTT
GGATCCTCGT CCTATTCTTC TCAACATCAA CGACGACTTT CGGATTTCCC CTTTTCATCC
ACCCCCGCAA ACGACTCACC CCGGTCGACT CTTGCTCATG CGTCTACGTC AAGGCCGCAT
CCTACAGGTC AAGCTGCGAG TCTTCATGCA ACATCCCTTA CTAGACTGGT ACATGATGCT
GCGCTCAGAA CGGGCCATGC AACCAACGGA GGCGGGTCGT TAAACCCATC TGGTTCCAAC
GCCTCAAGTA GTGATCGTGG GTCTGTCCAT GGTGGAGTCG ATTCGCCCAT TTTAGGGATT
GATCGTGAAG AGCCTCGTAC CTCGCCTGAC ATGGCGTTTA CGCCTCTTTC TGGCCATAAG
CGTCACCCGA GTGCATCACC TCTTGCTGCT TCTGCTACGC TTTCAGTGTC GAGTGGCGGC
AGACCCAAAC GCAAAATGAC CATTCCACCT TTGCCGCCTC AGCCAGCTGT CGAAAGGCTT
GTTGCAGCCT ATGTCGATTT TGTCGGTGTT ACGGGGCCTA TCATCCATAT TCCGACTTTG
GGCAAGCAGT TGGTGAAGAT TCGTGAAGGA ACGGATGTGG AGGAAGGCGA CATCTTTGTT
GTAATGATGG TGCTCGGTGA GTCTTTGTGA GGCTTCACAC GACTGAAGAT TCCTTCTCAC
AAATATTAGC GTTGAGTACT ATGGCTTCTT CGAGGTTTGT CGACCCGCCA GAAGAATTGC
GAGCATGTTC CGAGGCTTTC CACGCTGAAG CATTGAAACA CCTTGATGCT GTCTTCGAGG
AACAAAGTTA TGGTATGTTC TCGTCTTATG ATCTCTCTTC TCCCATCTAA TATCACTTTG
CAGTCAGTCT CCAAGCTATT CTCCTTCTAG TGTGGTATTC CCTTCTCAAT CCCGATAAAG
GCTCCATTTG GTTCCTTGTC GGCCTTGCCA CTCGAGCATG TGTCGATCTC GGATACCACA
ACGAACATAA TGTTCAGGTC GATCAGCTCG ACGCGCTTGA ACTGGATATG CGACGAAGGC
TGTTTTGGTG TACATACAAA ATGGACCGGC TGTTGTCACA GTCTTTGGGG AGACCGCCTA
GCATCCCCGA TGGTTTCATC AATGTCCCTG TAAGTTACCT TGCGAGCGAT CGTTGAGGGG
ATACAGCTAA CCATGGTCCA AGCTTCCATC CAATCTTCAC GATATCGACA TCCACGCTGG
GCATTACGGC GCGCTCATTG GCGAACCATG TTCCTACAAA GCTGTTTTCC TCCATACCAC
CAAGCTTCGT CAGCTCCAGT CTGAAATTCT TTTCAACACT TACGGTGTGC ATGGGTCAAC
TGGAATATTA CCGTCGGATG AATGGTTTGA TGATTGTTAC GGCAGGCTGA AACATTGGCT
CAAGACTGCT CCGGAGCCAA GAGGAACTGT TTCGACAGAA GGGTTTGAGC TCTCGTTCCA
CAGTAAGTGA TTAATGGCAA GCGCACGATT CTATTCACTA ACGTGAGCAG ACTCATGCTT
GTTATTATTC CGCCCTTCGC CTGGATGTCC TCGACCAGGA AGAAAAGCCC TCGCGACGGT
ACTCACGAGC TCGGGATATA TCATCCGGAT CTATAGGAGA ATGCAGCTGA ATAATCGAAT
CAGTTGGCTC TGGATGACCG TAAGTAACTC TCGAAAGACG CTATCTAAAA GTGCACCGCT
GACATGGAGC AGAGTCACTT TTCATTCATG GCAGGGTTGT CGTTCCTGTA CGCTTATTTC
AATTTATACA GTATGGGTGG AGGGCATGAT GTGCCTTCGA TTGATGACGC GATGATGGAT
ATTGAATCGT GTCTCGGCGT GCTCGAGTTC CTTGCTCGTA AGTTTTACAT CGACAACGAG
TCTGCTTCCA AACTCATGTA TCTCATAGCT CGAGTGCCCT CGGCAGCTGC ATGCCACGAC
ACTATGCGTA CTTTGTCTCA GGCGATCTTC AAACAGCTCT CCGATCTTGA TCCCCCTCAT
ACTACAGTCT CGTCATCCCC CAACCGCCGT TCCTTGCTTC GTGGTGCGCC AATCTCCTCA
TCTCGAGAAG ACCCTCTTCC AAACGTTTTC CCTGCGCCTT TGCCTGCTGT GGCTCTACCG
TACGAGCTTT CGTTACTCGA CAACTTGTTC CGAAACCCCA TGGCCTCGCA TAACAAGGCG
TCCGACTATA CCAGTAACAA GTTGTGCGGC AAGCGCAAGG CCGAAGCTGG GGGGTATCAC
CACTCTACAC CCTCAAGTAT GCCCTCAAGG GCATTCCACT TGGCTGGCGA GTATCCTGCA
TCGCAGTCAG TTTCTGGCCA TGGGAATGGG CCTACTGCAG ATCCTAGTAC TGTGATGGGT
ACGTTTGGTA TGGCCGCTCA CCTGAATTTG CATCCGGAAA GCAACTTGTC CGTATTCCCG
CCTTCTAGCG GTCCAAATTT GTCGACGTTG GCCAACACAG CTGCTGCCGC TGCCGCGGGT
TCATTCGCAC CAGCTGGGGA AGGATCTAAT GGAGCCATGC TGAATGTGGA TAATGAACAG
AATGGGTTTG ATATATTTAG CTTTTTGATG GACGATGAGG GAGGATTGGG AACGAATGGG
TTTTCTTTGG ATGTGCCTGC GGATTTCTCC TTGTGGGGCT AGGCTGGAAC TGGGCTGCGG
ATAAGAATCA TGTATAGGTG TATCAATGTA TTAGTAGTAA TTATAGACGT GTTCTTGATT
TTGGGATTTT GTAATCATGG ATGGAGTT
 
Protein sequence
MKAPAEKQFK AKNYGPYEAT SPKSLKPPSL PHKPKSNSSR PSQPPTPTDT TTDASGQSTY 
KVARAISSCT RCRSRKQKCD GKLPACSACE RAKVECIGFD AISKTNISRN YLHQLEQEVA
SLRAQVAALT SNDPIGRTKK GIAASASQAL SAYRSDFPID PSLPDDRSPH SSYDSSIPVG
SSSYSSQHQR RLSDFPFSST PANDSPRSTL AHASTSRPHP TGQAASLHAT SLTRLVHDAA
LRTGHATNGG GSLNPSGSNA SSSDRGSVHG GVDSPILGID REEPRTSPDM AFTPLSGHKR
HPSASPLAAS ATLSVSSGGR PKRKMTIPPL PPQPAVERLV AAYVDFVGVT GPIIHIPTLG
KQLVKIREGT DVEEGDIFVV MMVLALSTMA SSRFVDPPEE LRACSEAFHA EALKHLDAVF
EEQSYVSLQA ILLLVWYSLL NPDKGSIWFL VGLATRACVD LGYHNEHNVQ VDQLDALELD
MRRRLFWCTY KMDRLLSQSL GRPPSIPDGF INVPLPSNLH DIDIHAGHYG ALIGEPCSYK
AVFLHTTKLR QLQSEILFNT YGVHGSTGIL PSDEWFDDCY GRLKHWLKTA PEPRGTVSTE
GFELSFHRHD VPSIDDAMMD IESCLGVLEF LAPRVPSAAA CHDTMRTLSQ AIFKQLSDLD
PPHTTVSSSP NRRSLLRGAP ISSSREDPLP NVFPAPLPAV ALPYELSLLD NLFRNPMASH
NKASDYTSNK LCGKRKAEAG GYHHSTPSSM PSRAFHLAGE YPASQSVSGH GNGPTADPST
VMGTFGMAAH LNLHPESNLS VFPPSSGPNL STLANTAAAA AAGSFAPAGE GSNGAMLNVD
NEQNGFDIFS FLMDDEGGLG TNGFSLDVPA DFSLWG