Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04340 |
Symbol | |
ID | 3258552 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 1226566 |
End bp | 1228522 |
Gene Length | 1957 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258058 |
Product | transcription factor iiia, putative |
Protein accession | XP_572108 |
Protein GI | 58269904 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0235343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCTCC ACTTTTTCGC GGTAGGGGGG GACCCTTCAG GCAACAGTTT TCGTATTCTA AATCCTCGAC ATCTTCGCTC TGCAACTTCA GTTATAACTG GCTCATCTCC GCATAGGATG ACGCTCGTCC TAAAACCACC GTCACTGGGT GACCTTATGG TATTTGAGGA CTTTCTCACT CATCCTACTC ATCACCAACC CAAGTCCATG GCTGAGAGAA AACATGACTG GCCAAGTCGG ACGGAAAAGA GATACAAATG TGCCTATGAA GGATGCGATA GAGCTTACAC CAAACCATCA AGACTTGCTG AGCATGAGAT GACCCATCGA AATGAGGTGC GTATCGTTTG TGCCCGACTC TGTAATTAGC TGACGGAATA CCCAGCGACC CTTCGCCTGT TCACAATGCC CTCGGACATA CTTCAGAGAA GACCACCTCA AAGCCCATGC GCGCACCCAT TCGAACGTAA AGATCAAGCC CTTCCCGTGT ACCCGTGAAG GCTGCAAACA GTCTTTCTGG ACTGCATCAA AACTTCGTCG ACATGAAGAG GTTCACGATA AGGACGGTGC CTATCCTGTG AGCTGAAGCA CTAGCGTCTA TATATTGTCC GATGCTGACG CCGGCCACAG TGTGACAAGT GTGAGGCTGC GTTCAACAAG CACCACCTTC TCCGAGAACA CGTCGCTGTA GCCCATATGC CTCCCGGTAC CAAACCTTTC ATCTGCACTC ATGAAGGTTG CAGCGCTTCT TTTGCAACGA AAGCTCATCT CAAGAATCAC GAGAAGACCC ACGACGGTGC GTTACCTTTA GTTGTCATTC CCTTGCCATA CTGATATAGG CTTTAGAAAG ACGGTATATT TGTTCTCACC ACGATCATGG CGAAGATTTC CCCAAATTTT CCAAGTGGAC AGAACTTCAA AAGCACATAT CTACGGAACA CCCGCCCACA TGTCCTCATC CTGAATGCAA CGGTCGAATT TTCAAGAACA ATCAACGTCT GAGGGATCAT TTGCGTGTAC ACGCGGATCA ACAGGCCGAC AAAGCTGCGC TTGCGGACCG ACGCGAGGAA GAAATGCCAC AATTGCTGTT AGAAGGGTTG GGCAAAAGTA GAAAGAAGAG GAAATCTTTT GCGCAACGGG AGGCAGAGGA CAACGGACCT AGGAAGTTGA GAAAGATTCT CAACGGTGAT GCTGGAAAAG ATTGGGCTTG TGAGCATGAG AGCTGTGACA AAAAATTCAA ATCTGTCAGT TCCTTTCTTC ACTGACTTCA TCTTTGCTGA CTATCTTGCA GCGATATGCT TTAGAAACCC ATATCAAAGT TGTCCACCAA AACATCCGTG AACATGTTTG CCCCCGTGAA GGGTGTGGCA AAGCGTATGC CTACAAAACC AATCTGAATC AACATCTCGC CAAACATAAT TTATACGCGG GACCTTCGAA AACCGCGACA TCTGAAAGCG GGATGTTGAC TGGGATGGTC AAGGAGATGA GGAGATTCAT CTGTCCTGCA TGGGCATTAG GCGTCTTTCC AGAGAACGGG GATATGATTG TCACTCCACC GCGGCCTGAG CTTCTTACGG AGGACAATAA TGGTAATGAA CAACTTCAAT CTATCGTCAG ATGCAGAGAC CCTGCACCGG AGTCAACCAC TACCGCCAAC ACTTCAGCAA TACCAACACA ATCGACACCC GAAGATTTGA TTGGCAAGAG GTGTATCCTT CGATTCTGGA GAGTGTACGA CGTTCGACGG CATCTCAAGT CAGAGCACCG TGTCGAGCTT GACGATATGC AAATTAGGAG ATTGTTGCTC AGCACTGGTC AAACCGGGGA ATAGAATGTT AGAATGTAGA TGGATAGAAG GAAAGATGGG ATAATGTATT AGTAGTAGAT ATTACTCGTA TTGTTGTAGT TACTGGCTAG TAGGATTTGT ATTTCCGACT TTTTGCTGTG GTTTTTT
|
Protein sequence | MYLHFFAVGG DPSGNSFRIL NPRHLRSATS VITGSSPHRM TLVLKPPSLG DLMVFEDFLT HPTHHQPKSM AERKHDWPSR TEKRYKCAYE GCDRAYTKPS RLAEHEMTHR NERPFACSQC PRTYFREDHL KAHARTHSNV KIKPFPCTRE GCKQSFWTAS KLRRHEEVHD KDGAYPCDKC EAAFNKHHLL REHVAVAHMP PGTKPFICTH EGCSASFATK AHLKNHEKTH DERRYICSHH DHGEDFPKFS KWTELQKHIS TEHPPTCPHP ECNGRIFKNN QRLRDHLRVH ADQQADKAAL ADRREEEMPQ LLLEGLGKSR KKRKSFAQRE AEDNGPRKLR KILNGDAGKD WACEHESCDK KFKSRYALET HIKVVHQNIR EHVCPREGCG KAYAYKTNLN QHLAKHNLYA GPSKTATSES GMLTGMVKEM RRFICPAWAL GVFPENGDMI VTPPRPELLT EDNNGNEQLQ SIVRCRDPAP ESTTTANTSA IPTQSTPEDL IGKRCILRFW RVYDVRRHLK SEHRVELDDM QIRRLLLSTG QTGE
|
| |