Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04450 |
Symbol | |
ID | 3258924 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 1266933 |
End bp | 1269739 |
Gene Length | 2807 bp |
Protein Length | 789 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258069 |
Product | forkhead transcription factor 3 (freac-3), putative |
Protein accession | XP_572127 |
Protein GI | 58269942 |
COG category | [K] Transcription |
COG ID | [COG5025] Transcription factor of the Forkhead/HNF3 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.198611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGGAG CTTTGACTAA GTCATACGGT AACCCCTCTC CAACACGTAC ATCCCCTCAA AAAACATCAT CATCCCGCTT CTACCCCACT CCTGCCTCCA TGTCGCCAAA CACCACTTCT AGCTCCTCCA CCCTCAACTA CCCTCCCTCT TTTGACCCTC ACCACACCTC TTCCCGATAT CCGTTGACAA CTCATTTTGC CCCTTATTCC CCCTACCGAC CTGGCGAATA TGGTCGGTCT GAACGGGGTT TGGCAAGTGA GAGGACGAGA ATCGAGAGGA AATTGTTTGC GGATGGGCAA GAGGAAGATG GGGAGGTAAG AAGAGGTAGA GGAAGCTCGC CTTTGGCGCC GGCATTCGAA GCAAAGATGG TTTTGCGAAA CGGTGCAAGC ATTGATTTGA TCTCTTGGTG AGTATCTGTC TCCCCCCTTG GACCTTTTGA CTAGAGCTGA CTTCCCACTC AGGCTCCGAA CAAACTTCCA TCATGTCCCG CCGCCACACA CACCTCTAGT TCCGTCCATG CCGCTTAATG GCATCCGTCA CCTCATCCTT GAACGTTTTC CCCGCGCGCC TGAAGCCCAA GAGATCCACA AGGCCGTCCT TGCCGCTTTC CCCCATTCTC AGTGGGACTA TCCCCCTCCT GAATCGTCTG AGCCTCCTAC CATCCGAGGT CTTATCTGGC ACGGGAAAGA TATCGTCAAC GATGATGAAC TCGATGATAG ACCCCGCTCG GCGCCCAAGG ATGTTGCAGC GCATACCCGC AGTAGACTAT CGGGAGATAT CATTACGACC GCAAACTCAA ACTCGACATC GAAAGGAAAA CAGCCGTCAA GGAGCCCGAC GCTATCTACG CTTGTTTCAC CCATCTCTAG CTCAAAACGA CACTTGCCGG ATACCCCTGC ACGTTCAGTG CTGGAAGAAT TTGCAGAGAT TGCCACATTG GCGGAGAAGA CACCTGTTAG CCGTACGAAA TCCTTACCCG GAGAACCGCT CGCAGCATCC CCGGAGCAAG AAGTAGCGCA TCTTTCCCAT AACCCGTTTG AGCAAAGTAT GCTTCTCAGA GGCCGAGGAA AACGGCGAGC AAGCCAATCG CCTGAGCGAG GACATCGTAG ACGCGCAAGT ACACCAGACA AGCTTCACGG CTTGTTGGCA GCTGCCGAAG CAGTGGAAGG ATCACCCATC ACTTCTGTTC TCGGCCACAA ACGTCGAAGG ACTATCGGCG GTCCCGCACC GGCAAGGGAG ATCATGTCTT TTCCTCGACG GGCAATGTCT TCTCGGGGAA CCATGTCACC TCCTCCCACT CGCGGACTGG CGATACTACC CCACGTCGAA GAGAATATCG ATTATCTCGT CCCGTTAAAC GATACTGCCT CTGCCGGTTC AAGGATCTCG GAGGAAGATG CAGCTTCAAA TGCGTTACCT TCAATTGCGG GTGCGTCTAC AGCGAGAAGG GCACCGACGT CGTCTTCGAC GGCATCGCAA TCATCTGCGA TTTCGCACCA TCTTGTGTCT GCGCCGATTG CAGGGTCTTC TGGGTCTATG CACTCTTACC CACACGACCA CATCCGTGGT CACGGCCACT CCCTTTCCCA TTCTCACTCG CATCCCCACG CCCATCACCA TTCCTCATCC CAAACCCAAC AGTTTTCGCC TAACCTCCCT CGCGTACACG CTCCCCGAAC AGGCGGCGGC GGCCGTAAAG TCAATGAACT CCCTACTGAA GGCGAACGAC CCGGATACGA CTGCAAGCCG CCCTATCCGT ACCATGAAAT GATTCGGCAT GCGATTGAGA ATGCGCCTGA TAGGAAGCTA CAGTTGAATC AAATTTATGC AAGTATCGCG GAAAGGTTTC CGTTTTTCAA GACGTTGGAT GAGAAGAAGA CGGCTGGGTG GCAGAATTCG ATTAGGCATA ATCTTAGTTT GAAGTGAGTC TTTTTCCTCC TTCTCTGTAA ATCTGTAACA GGTATAGAGA ACTGACGGTA TCACTTTCGC TTCTTATAGG AAAATGTTTG TAAGAGTTAA CAAAGTCGAT GGAGTACCAG ATGACTCGGG CGGTAAAGGC GGTTGGTGGA CAGTCATACC TGGTGTACCA GACGAAGGCC GACCGGGACG AAAAGCTAAA GCGCGCAAAG CCAAATTGGA GAAGGAAGCA GCATCAAAGG AAGCAGGCTC GCGTGTGGGG AAGGAGAACG ATGCCAGGGG CTTGGGTATG GGTGGAAGGA TGGGGATGGG AATGGGAAGT GTTTTGCCGC CGCCGGATGG ACATGCGCAA CTACCGCCGG GTGTGGCCCC TATAAGCTCT GGAGCAGGTA GCGAGCTAGG TCAGAATTAC GCGCGCAGTA ACGGTAATGG TCATAGTCAT GGTCATGGTC ATGGGCAAGG ACTAGAGTCC GGACAAGGGC AGGGACAGGG GGCGTTGCAT GAGAAATGGG TAGAAGGGAA TCGGGGACAA GAGGGCTCGG TTGATGAACT AGAGGATGAT GAGCTTTACG GCCAGCCATG AGTGTGAGGA GGATGAAAAG AGATTAGGGG GCCGGAGGAG AAATTTTGTT TTGTGAAGGT TCCTTGTAAG GGCGGATAAC AACAAGACAA ACGGAAAAAA GACAAACAAT AAAAGACGTG TTTATTTCTC CTTTCCTGAA CAACCGTTTT TGTTGAGTGT ATATACTGAT TTTACTTGTC ATCATTCTTC CAGTTTATTT TTTTCTCATC TTCCTTTCTC TTTTACATAC ATCTCTTATC TCTCCTCCTC TTCAGCCGGA AACAAAATGA GCAAGGGCTT TCTGCTGTAG GTGTAAGGTA TTTTTTG
|
Protein sequence | MPGALTKSYG NPSPTRTSPQ KTSSSRFYPT PASMSPNTTS SSSTLNYPPS FDPHHTSSRY PLTTHFAPYS PYRPGEYGRS ERGLASERTR IERKLFADGQ EEDGEVRRGR GSSPLAPAFE AKMVLRNGAS IDLISWLRTN FHHVPPPHTP LVPSMPLNGI RHLILERFPR APEAQEIHKA VLAAFPHSQW DYPPPESSEP PTIRGLIWHG KDIVNDDELD DRPRSAPKDV AAHTRSRLSG DIITTANSNS TSKGKQPSRS PTLSTLVSPI SSSKRHLPDT PARSVLEEFA EIATLAEKTP VSRTKSLPGE PLAASPEQEV AHLSHNPFEQ SMLLRGRGKR RASQSPERGH RRRASTPDKL HGLLAAAEAV EGSPITSVLG HKRRRTIGGP APAREIMSFP RRAMSSRGTM SPPPTRGLAI LPHVEENIDY LVPLNDTASA GSRISEEDAA SNALPSIAGA STARRAPTSS STASQSSAIS HHLVSAPIAG SSGSMHSYPH DHIRGHGHSL SHSHSHPHAH HHSSSQTQQF SPNLPRVHAP RTGGGGRKVN ELPTEGERPG YDCKPPYPYH EMIRHAIENA PDRKLQLNQI YASIAERFPF FKTLDEKKTA GWQNSIRHNL SLKKMFVRVN KVDGVPDDSG GKGGWWTVIP GVPDEGRPGR KAKARKAKLE KEAASKEAGS RVGKENDARG LGMGGRMGMG MGSVLPPPDG HAQLPPGVAP ISSGAGSELG QNYARSNGNG HSHGHGHGQG LESGQGQGQG ALHEKWVEGN RGQEGSVDEL EDDELYGQP
|
| |