Gene CNG01960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG01960 
Symbol 
ID3258739 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp550111 
End bp552205 
Gene Length2095 bp 
Protein Length432 aa 
Translation table 
GC content47% 
IMG OID638257814 
Productconserved hypothetical protein 
Protein accessionXP_571891 
Protein GI58269470 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGACCTATA TAAAACCGTG ACCTTCCAAC TCGATTGTTC CCCGTATCAA AAATAAGACA 
CTTGCAGGAC AACAAAGTGA AGGAAACAGT AATCATAATT CACGAAAAAT GATGTCTTCC
ACTGCTACTT TAGTTGACGA AGACTTTTCT GTTGGCACAC TCAAACTCAA CGGCGCTCAT
GAGGTTACCG ACGTTCAGGT TAGCTCTTCT CCCGAAGTAG AGAGGACGAA GAAGAATGAA
AGCGTTGCTG TTGACCCATT CAACTACGTG GTGAGTTGGT TAATTCATTA TTTCTCTCTC
TGATACCCTA CTATCATTTG GTGGCCGACC TTGCCAATAA ATTGCTGACA CAGCGCGACG
CATCATTTAG GGAGAAGTCT TGGGCGCCGG TCCTGGAGCA GACTACCCAT ATGCCGAGTT
TTTGCGTAAG TCGATCCCAT TCATACAGTT GCAAAGGAAA GATCGATCTC CGAATACTAA
TCATACATGA CGGCATCTTG GACAGCCCAC AATCCTCCCC GTACTCAATC CGACCCTCCG
CTCCCTTTCT TTGACATTGA AGATCGCGGT CACCGTGCCG ATCCTAACGT GGCTAGACTT
CGCGCTTTCG TCGAGGCTAG GGGTGGCAGG TTGAAAGATA TGTTAGTGGC GATTGGGACT
GTTGTTGAAG GAGACGTGAA GCTGGAGGAT CTCGGAGAGG CTGAAAAGGA TGATCTGTAA
GTTTTGATTA GATTCTCCTT GCTGGACACT CGGAGTTACA TCCTGACCAT TCAACTATCA
GTGCTTTGCT TGTTGCACAA CGGGGTGTAG TCTGTACGTT GCTGTCACCC CTTATTCTTT
GATGTCTGAT CATAGTTACT GATGACTTCA TGGTGCAATA GTTTTTAGAA ATCAGCAATC
TATGACTATC GAACAGCAAC GTGAACTCGG GAAGCACTTT GGTCCTCTCC ACAAGCACGC
CACCTATGCA ACTCCTCGTC GAGGTGACTT GGATGATGTT GTTGGTGAGT TCAATCCGTT
TGTTTCTCAG TTTGCTGTTT GCTGATTTGC ATGCGCAGTT GTCTATTCTG ATCGAGACTC
TAGGCCGGAC CTTTATGCCT TCTCTCGAGC TGAGCTTTTC CACTCTGATG TGACCTACGA
GGTCCAACCT CCTGGGACTA CCATGTTGCG TCTTCTGACC ACTCCTGAGG TTGGAAATGA
TACTCTTTGG TCCTCTGGGT GAGTATCTCT GCAGCATTCA ATGCTTTACA TGCTCTGACG
ACGGTACAGT TATTCCGTTT ATTCTTCTCT CTCCAAGCCT TTCCAACAAT ACCTCGAATC
TCTCTCAGCC ATCCATTCAG GATTTGATCA AGCCTCCTCT CGAACTAACT TCTCCAAGAT
TCCACGTCGC GAGCCTATTG AGACTATCCA TCCGGTTGTC CGTGTCCACC CTGTTACTGG
TATGAAGTCG GTGTTTGTCA ATCCTGGTTT TGTCACTAGG TTGGTAGGCG TACCGAAAGC
AGAAAGTGAC ATGGTTCTCT CTTTCTTGAA GGACTGTTTT GCTCAGCAGA CTGACGCCAC
AGTCAGGTGG AGGTGAGTCA CCATGTGCTG TTGCTCCTCG ACAGCCTGGT TTGTTAACAT
TGTTTGGATG GTAGCTGGGC GCCTGGAGAC GTCGCAATCT GGGATAACCG TAATGTCAAT
CACTCGGCTA CGTTTGATGC CTACGTAGGT TTTAATGATC GCTATTTGAC CCCAGCTCAC
CAGCAGCCCA GCCCTCCCTA CGACACGGTC TCCGAGTCAC TGCGCACGGC GAAAAACCCC
TTTCTGTGGA GGAGTATGAG GAGATTTATC AAAAGCCAGC CAAGGACTGG CTTGAGGAAA
GATTCAAGAC ACTTGGTATA ACTGGTCCTG CCCGAGATGA CGGGAAAACC AAGAAGAAGG
CTTTCAGGGA TTAGGAGTGC AGAGACGCCA AGCGAGAAGA AGGGGATACG CATAGAGGCC
TCAACGGCTC AGAATCATCA TTTGGCATGG CCTTAACGTC GATTACCAAT AAAAATACAG
TGTTTTAATA GTTGTTACAT ATACAAATGG AAAATTGTGA TAGTATTCCT TTTTT
 
Protein sequence
MMSSTATLVD EDFSVGTLKL NGAHEVTDVQ VSSSPEVERT KKNESVAVDP FNYVGEVLGA 
GPGADYPYAE FLPHNPPRTQ SDPPLPFFDI EDRGHRADPN VARLRAFVEA RGGRLKDMLV
AIGTVVEGDV KLEDLGEAEK DDLALLVAQR GVVFFRNQQS MTIEQQRELG KHFGPLHKHA
TYATPRRGDL DDVVVVYSDR DSRPDLYAFS RAELFHSDVT YEVQPPGTTM LRLLTTPEVG
NDTLWSSGYS VYSSLSKPFQ QYLESLSAIH SGFDQASSRT NFSKIPRREP IETIHPVVRV
HPVTGMKSVF VNPGFVTRLV GVPKAESDMV LSFLKDCFAQ QTDATVRWSW APGDVAIWDN
RNVNHSATFD AYPSLRHGLR VTAHGEKPLS VEEYEEIYQK PAKDWLEERF KTLGITGPAR
DDGKTKKKAF RD