Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG01960 |
Symbol | |
ID | 3258739 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 550111 |
End bp | 552205 |
Gene Length | 2095 bp |
Protein Length | 432 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257814 |
Product | conserved hypothetical protein |
Protein accession | XP_571891 |
Protein GI | 58269470 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGACCTATA TAAAACCGTG ACCTTCCAAC TCGATTGTTC CCCGTATCAA AAATAAGACA CTTGCAGGAC AACAAAGTGA AGGAAACAGT AATCATAATT CACGAAAAAT GATGTCTTCC ACTGCTACTT TAGTTGACGA AGACTTTTCT GTTGGCACAC TCAAACTCAA CGGCGCTCAT GAGGTTACCG ACGTTCAGGT TAGCTCTTCT CCCGAAGTAG AGAGGACGAA GAAGAATGAA AGCGTTGCTG TTGACCCATT CAACTACGTG GTGAGTTGGT TAATTCATTA TTTCTCTCTC TGATACCCTA CTATCATTTG GTGGCCGACC TTGCCAATAA ATTGCTGACA CAGCGCGACG CATCATTTAG GGAGAAGTCT TGGGCGCCGG TCCTGGAGCA GACTACCCAT ATGCCGAGTT TTTGCGTAAG TCGATCCCAT TCATACAGTT GCAAAGGAAA GATCGATCTC CGAATACTAA TCATACATGA CGGCATCTTG GACAGCCCAC AATCCTCCCC GTACTCAATC CGACCCTCCG CTCCCTTTCT TTGACATTGA AGATCGCGGT CACCGTGCCG ATCCTAACGT GGCTAGACTT CGCGCTTTCG TCGAGGCTAG GGGTGGCAGG TTGAAAGATA TGTTAGTGGC GATTGGGACT GTTGTTGAAG GAGACGTGAA GCTGGAGGAT CTCGGAGAGG CTGAAAAGGA TGATCTGTAA GTTTTGATTA GATTCTCCTT GCTGGACACT CGGAGTTACA TCCTGACCAT TCAACTATCA GTGCTTTGCT TGTTGCACAA CGGGGTGTAG TCTGTACGTT GCTGTCACCC CTTATTCTTT GATGTCTGAT CATAGTTACT GATGACTTCA TGGTGCAATA GTTTTTAGAA ATCAGCAATC TATGACTATC GAACAGCAAC GTGAACTCGG GAAGCACTTT GGTCCTCTCC ACAAGCACGC CACCTATGCA ACTCCTCGTC GAGGTGACTT GGATGATGTT GTTGGTGAGT TCAATCCGTT TGTTTCTCAG TTTGCTGTTT GCTGATTTGC ATGCGCAGTT GTCTATTCTG ATCGAGACTC TAGGCCGGAC CTTTATGCCT TCTCTCGAGC TGAGCTTTTC CACTCTGATG TGACCTACGA GGTCCAACCT CCTGGGACTA CCATGTTGCG TCTTCTGACC ACTCCTGAGG TTGGAAATGA TACTCTTTGG TCCTCTGGGT GAGTATCTCT GCAGCATTCA ATGCTTTACA TGCTCTGACG ACGGTACAGT TATTCCGTTT ATTCTTCTCT CTCCAAGCCT TTCCAACAAT ACCTCGAATC TCTCTCAGCC ATCCATTCAG GATTTGATCA AGCCTCCTCT CGAACTAACT TCTCCAAGAT TCCACGTCGC GAGCCTATTG AGACTATCCA TCCGGTTGTC CGTGTCCACC CTGTTACTGG TATGAAGTCG GTGTTTGTCA ATCCTGGTTT TGTCACTAGG TTGGTAGGCG TACCGAAAGC AGAAAGTGAC ATGGTTCTCT CTTTCTTGAA GGACTGTTTT GCTCAGCAGA CTGACGCCAC AGTCAGGTGG AGGTGAGTCA CCATGTGCTG TTGCTCCTCG ACAGCCTGGT TTGTTAACAT TGTTTGGATG GTAGCTGGGC GCCTGGAGAC GTCGCAATCT GGGATAACCG TAATGTCAAT CACTCGGCTA CGTTTGATGC CTACGTAGGT TTTAATGATC GCTATTTGAC CCCAGCTCAC CAGCAGCCCA GCCCTCCCTA CGACACGGTC TCCGAGTCAC TGCGCACGGC GAAAAACCCC TTTCTGTGGA GGAGTATGAG GAGATTTATC AAAAGCCAGC CAAGGACTGG CTTGAGGAAA GATTCAAGAC ACTTGGTATA ACTGGTCCTG CCCGAGATGA CGGGAAAACC AAGAAGAAGG CTTTCAGGGA TTAGGAGTGC AGAGACGCCA AGCGAGAAGA AGGGGATACG CATAGAGGCC TCAACGGCTC AGAATCATCA TTTGGCATGG CCTTAACGTC GATTACCAAT AAAAATACAG TGTTTTAATA GTTGTTACAT ATACAAATGG AAAATTGTGA TAGTATTCCT TTTTT
|
Protein sequence | MMSSTATLVD EDFSVGTLKL NGAHEVTDVQ VSSSPEVERT KKNESVAVDP FNYVGEVLGA GPGADYPYAE FLPHNPPRTQ SDPPLPFFDI EDRGHRADPN VARLRAFVEA RGGRLKDMLV AIGTVVEGDV KLEDLGEAEK DDLALLVAQR GVVFFRNQQS MTIEQQRELG KHFGPLHKHA TYATPRRGDL DDVVVVYSDR DSRPDLYAFS RAELFHSDVT YEVQPPGTTM LRLLTTPEVG NDTLWSSGYS VYSSLSKPFQ QYLESLSAIH SGFDQASSRT NFSKIPRREP IETIHPVVRV HPVTGMKSVF VNPGFVTRLV GVPKAESDMV LSFLKDCFAQ QTDATVRWSW APGDVAIWDN RNVNHSATFD AYPSLRHGLR VTAHGEKPLS VEEYEEIYQK PAKDWLEERF KTLGITGPAR DDGKTKKKAF RD
|
| |