Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04500 |
Symbol | |
ID | 3254753 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 249294 |
End bp | 251510 |
Gene Length | 2217 bp |
Protein Length | 702 aa |
Translation table | |
GC content | 54% |
IMG OID | 638253921 |
Product | hypothetical protein |
Protein accession | XP_567999 |
Protein GI | 58261178 |
COG category | [K] Transcription |
COG ID | [COG5576] Homeodomain-containing transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCATT TCAGTCACGC CCAGCACATT CACGTCGTCT TGTTTTTTAT TATTATTATT GCCTTCATCT GTTTTCCTCA TAATACCATT CTTCCTGAGC CCACCATGTC CGTCTGCGGG AACTCGCCCA CTCCCCCGCC ACCTCCGCCC CCCAACGCCA TCCACATGCA CCCGCCACAG GTCACCCCAT CCGCCGCCAG CCACACGTCC CACCCCACAC ACCACACACC AGACGCCAGA GCCCAACAGT ACTCTATACG TCGCGCGTAC TCAACCCCGT CCATTGCATT CCCGCTCCCA CACCAAGCAC CTCCATCCTC TGCGCTTACA CATGCATCTT ACACGTCGAA CATGTCCGAG AACATTACAC GTTACACCCC GGGTGGTACG CCCCAGTCAA GCGCGTCGCG TCCAGGCAAT GGCAACAGCA AGTTTCCACC TAGCTTTGAC TCGCCCACAC CACATGGCGC GAGATACAAC TCTGCAGTCA GGATGGATGG CATTGAGGGC GCTGAGATGC TCGAGTCTGG CGATCTTGCA GGAATTCAGC CCCCAGCGAC CTTTCCCTTG CCCGAATACC CTGCATCTTT TATGCGCGCG GAGCCGGTGC CCACGGAGGA AACGGAAGAG CTCCAAAGGT TTCTTCCCAA TGATGGTAGT GTGAGAAAGT ATCGATATGG TGGCGCGTCC TGCAACCCAT GGGACTACAT GCTTGGTGAT GTCCCTGATG CCGACTACGA CCACCCTTTG TCTTCACGAC CAGCCAGGTA CGGACCTGAC GCTAAGCACG GATGTAAGGT TAGAAGACGG TTTACAAAGA GAGAGCTAGA GGCTCTCGAA GTGCTCTGGA GTATTGCGAA AAGCCCCAGC AAGTATGAGA GGCAGAGGTT GGGCGCGTGG TTGGGTGTAA AGACGAAACA TATCACGGTC TGGGTAAGTA GACATGTCCT ACCATACACA TATTTTCATT GCTGACTATC TCACAGTTCC AGAACCGAAG GCAGGAAGAA AAGAGGTATT CACGCGACGG TCATCATGAC GCTCCTCCTC CATCTCGATC CAACCGTGGC ACCTTTGACC CTGTCACCGG TAAATGGCGT CCCGTACCCG CATCCTGCAT CTCCGGCCTC CAACCCCCAC CCGATGATAA GATTGCAGTC GTCCGTGCCA TTAGTCTCGG TGACGTGACA CGTGATATGT GGTTAAATAA GTATCCTTCG TCGTCTGGAC GCGGCATGAC TGCGTCGGCC AGGGTCAGTC CCACACCCAT GAGCCTCGCG GCCACCTCAA GAGGAAGGAC TATCAAAAAT GCGCACACCA CCCCTTTGCT GCCTCGTAAT CAATCAAGAT CTCTCGATCA AGTGCTCCAG GCGCGCGAGA GCAGTTTTGG GACAGGTGCG CAGAAGCGAT ATCGCAGAGG CAGCGGTGAA TTTGTTATCA AGGGAGAAGG TCAAGATAGG ATCAAAGAGA TCCTGTCGCT CATGCCAAGT GATCCACCCA GCATGGGGCT TGCAGAGTCG GATGTCGAGG AAAGCGATGA GGATGATGGT GGGATTGATG AGGATGTCGA GAAGAGGAAA AGGGCAAAGC AAGCCAAGGC ATCTAGCACC TTGGCAGGTC TCGGCAGGGC AACACCTTAC GATGTCCTGG CCTCCAGCTC CAGGGCTAAG CTACTCGCAA AACCCATCAG TGAGCACTCC AGCCGCAACC CTGTTCTTAG CCAACTCAAC CCAAATCTCT CCCACTTTGC CCCGCCCTCC AACCTTCGAA AACACACCCT TGAATCTGTA GCTACAAACC AACCTACAAA GCGACATCGT TCACGCGTCA CCGGTCATCC GAATCCCAAC TTTCATGCGG GTTCTAGGAC CAAGGACTTT AACAGATCGG TTAGCACCTC TGCGCTCCCT CGTTCGTCGC GAGTGGCGGA AGGCCAAGGA TCATCATCAT CATCATATCT TCCCTCACAA CTCAAAACAC CGAACCTTGG GTACACGCGT TCCCATTCTG TCTCTTCCAG CAGTTCCCGG GTGATTACTC CCGAAGATGT CAAGGATAAG CGAGAAGGCC CGCAAATGCG GGGGCAGGCT AGGGGGCAGG AGAAGGATCA AGAGGTCATT GGTGCTGCGG AAATGTTGTT ACAGTTGTTT GGCGGGTCAT AGATTGTTCA CGATAATAGT TATCACTGCT TATTTCTTAT ATGATCTTAA GGGAACC
|
Protein sequence | MNHFSHAQHI HVVLFFIIII AFICFPHNTI LPEPTMSVCG NSPTPPPPPP PNAIHMHPPQ VTPSAASHTS HPTHHTPDAR AQQYSIRRAY STPSIAFPLP HQAPPSSALT HASYTSNMSE NITRYTPGGT PQSSASRPGN GNSKFPPSFD SPTPHGARYN SAVRMDGIEG AEMLESGDLA GIQPPATFPL PEYPASFMRA EPVPTEETEE LQRFLPNDGS VRKYRYGGAS CNPWDYMLGD VPDADYDHPL SSRPARYGPD AKHGCKVRRR FTKRELEALE VLWSIAKSPS KYERQRLGAW LGVKTKHITV WFQNRRQEEK RYSRDGHHDA PPPSRSNRGT FDPVTGKWRP VPASCISGLQ PPPDDKIAVV RAISLGDVTR DMWLNKYPSS SGRGMTASAR VSPTPMSLAA TSRGRTIKNA HTTPLLPRNQ SRSLDQVLQA RESSFGTGAQ KRYRRGSGEF VIKGEGQDRI KEILSLMPSD PPSMGLAESD VEESDEDDGG IDEDVEKRKR AKQAKASSTL AGLGRATPYD VLASSSRAKL LAKPISEHSS RNPVLSQLNP NLSHFAPPSN LRKHTLESVA TNQPTKRHRS RVTGHPNPNF HAGSRTKDFN RSVSTSALPR SSRVAEGQGS SSSSYLPSQL KTPNLGYTRS HSVSSSSSRV ITPEDVKDKR EGPQMRGQAR GQEKDQEVIG AAEMLLQLFG GS
|
| |