Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG03750 |
Symbol | |
ID | 3258888 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 1051236 |
End bp | 1053050 |
Gene Length | 1815 bp |
Protein Length | 427 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257998 |
Product | conserved hypothetical protein |
Protein accession | XP_572080 |
Protein GI | 58269848 |
COG category | [S] Function unknown |
COG ID | [COG3268] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATACTTTTCA TTTATATACC ATGTAGCCAT GAGTCTATAT CATACTCATT CATCTCTGAC CCAACAACTC AGCTTATTGA ACGAAAATGG CCCAAACAAC AAAGCCGGTA CTCGTCATCT ATGGCGCGAC AGCGTACACC GCTCAGCAGT TATTTACGTA CCTTGAGGAG CACCCTGAGG CAGGAGACTT TGACTTTATC CTTGCTGGTC GTAACCAAAC CAAGCTTGAC AAGTTGAATG GGAGTCTCAA GATACAAAGA GAGGTTATCG CCTGCGAGTT GAGTGACGAG GAAGGGGTTG AGGCGATGGT CAAGAGGGGA AATGTCATAG TCAATTTCGC TGGTAAGTAA CTCTTGATTT CACCTTATGC ATTCCAAAAT GTTGATCTAA TTGTTAGGTC CTTACCGATG GCACAATGCT GAAGCCATAA TTCGGTTCGT TTATCTCTTA CCTTGGCCAA CACCACTTCA CAAAGACTGA CCAATACCAA AGTGCATGTT CCAAGGCTGG AAAGCACTAC ATCGACCTTT GCGGCGAATC TGCATGGCTG GCCAAAGACA TCATTCCAAA GTACCATTCG ATCGCCAGCT CCACGGGGGC CTGTATTGTT CCTTCCTGTG GTTTTGATTC TGTTCCTTCG TGAGCAACTT TATCTTAGTT GATTGGAATC AAGCTGATTG GTTGATCTAA GAGACCTGAT CGTGCATCTT GCTAATCAAA CCCTCCAAAC GGTTAGACCC GGATCCACAT TGGCTGACTC GACCTCCATA TTCAAAGTGA AAGGCACCAT CAGCGGTGGA ACTGTCCAGT CTATGATCAC CCTTACCGAG CTTCCCAAGG AAGAGCGAAG GGCCGGTGAA TTCACCCTCT GTCCTGGTAG TAAGTCGATT CCCCTCTTCG GCGACCTCAT CAAGCTCAAT TGTGACAGTC CAACTCCCGT CCACACCTCC TGCGCTTACC TTCTCCCTTC CTTCTACCCC TCTTACTCCC GCTCGTTTTG CATCCTTTTT CTTCATGTAC GTCTACAATC GAACTGTCGT CCGCCGTTCC CAGTTTCTCT CTGGCGCCTT GTCTACAAAG TCCGGTGGTA AGGTTATGAA GTACGCTGAA GGCTTAGACA TTGGATATGG TAAATTCGGG TCAGCCTTAG CAACCATCGG GATGATGGTT TTCGGAGGCC TGTTTTTCGG TTTTAAATGT GTAAGTAAAG AATTTCCTGA CATGTGTAAC GTATCTAACT CATTGCCGCA GCTTAGGAAC ATAATCCTTC GATACTTGCC CAAGCAGGGA GAAGGAGCTC CCCTGGAGTA GGTCATGCAC CTGTGACAAC ATGCGACCTG TGAATGCTAA TAACTTACGA TCGTAGGCAG CTGAAAGCTG GTCACTACCA AGTCACGAAC CTTTCCACTG AGGAATCCTC CGCTCCCGAC CACAAGCCTG TGAAGATCCT CACAAGGTTC GACGGTGAAG GCGACCCTGG ATACCTTAAC ACTTGCTGTA TGTCCTTTTC AACTTCTTAC TATCCGCATC GCTAACAACA TCTGACAGAC TTACTTGCTG AGTCCGCCTT GGCCCTGGTT CTGCCTGCCC CCAAGGGCAC TTCCCGTCCA CCTCTAGCCA AGGCTGGTGG CCTTTTAACC CCTGCGACAG CTATGGGTGA TGTTCTTATT GAGCGATTGA GAAAGAGTGG CAAGTTCCAG ATTACCAGCG AGGTGTTGAG TGAGGAGAAG AAGAAGGATA TCTAATTTCT CAATATTCTT TTGATGAGGA GTTTTAGGTT TTCCACATTC TATGTTTTAC ATATGGTAAT AATAT
|
Protein sequence | MAQTTKPVLV IYGATAYTAQ QLFTYLEEHP EAGDFDFILA GRNQTKLDKL NGSLKIQREV IACELSDEEG VEAMVKRGNV IVNFAGPYRW HNAEAIIRAC SKAGKHYIDL CGESAWLAKD IIPKYHSIAS STGACIVPSC GFDSVPSDLI VHLANQTLQT VRPGSTLADS TSIFKVKGTI SGGTVQSMIT LTELPKEERR AGEFTLCPGI QLPSTPPALT FSLPSTPLTP ARFASFFFMY VYNRTVVRRS QFLSGALSTK SGGKVMKYAE GLDIGYGKFG SALATIGMMV FGGLFFGFKC LRNIILRYLP KQGEGAPLEQ LKAGHYQVTN LSTEESSAPD HKPVKILTRF DGEGDPGYLN TCYLLAESAL ALVLPAPKGT SRPPLAKAGG LLTPATAMGD VLIERLRKSG KFQITSEVLS EEKKKDI
|
| |