Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND00370 |
Symbol | |
ID | 3257317 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 106426 |
End bp | 109249 |
Gene Length | 2824 bp |
Protein Length | 693 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255974 |
Product | transcription factor, putative |
Protein accession | XP_570381 |
Protein GI | 58266450 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAACATCCTC TGTTGTCTTT TAGAGCCTAC ACTGCCCACG AACAGTGATT ATCACGCACA AGCTGGAAAA GTATGTACAC CACTGCGTGC GAGGCATGCC GCAAGGTGCG AATGAAGTGA GTATTTTGGG CGTCCCCGGT TATGAGTGCT TCCGCTCACG GATCCATTCA GGTGTATCCG TCCTTCAAGA GGATATGACA TGTCTGAGAT ATGCGAACGA TGTCGATCCA CAGGCATCGA GTGCATCACT GTAAAGCGTC GGGTAGGAAG ACAGCCGGGA GTAAAGAATC GCAAACGTAA AGCGGATTCT ATCAGTGAAC AAACCGGAGC TTCTTCCGCA AGCAACAGTG AAGCACAGAT TCCGAGAGAT GTGGATCATC TTCCAAATCC GCTACATGTC TTGGCTTCGG AAGCTATCAG AAGGCATTTT ACACCAGAGT ACGCTATGAG GATCGTGATT CATTTGATAT ATAGTTCGCT AATCGAATTT CTTCACAGGG CAGAAGAGGC AACCGCTCAG GACAGCTACC ATATACGCAG CAGCAAGAGT ATCTTCCATA GGTACTCAGA TTGGGCTGAA AAGATTCAGC CGGAGGGAGG GAAGGAGGCG ATCATGCGTC GACTTGATTC ACTATTATCA AAGAAGCCCA TAGAAATGTC CTCGGATGTT GACGAGCCAT CAGTCTTTTG TGGAAGGATT GACGTGAGCA TACTCCTCTA TTTTCTCCAT CTCGCGTTCG TTTCTGATTA TTATTCAGAT GGCGAGGCCA GATGCATCCC CCGAGCACGA TGTTATCTCA CTGCAGATTG TCAGTCTCGC TGAAGCGCAG CATCTTTTCG ACTCGTATGG ATCCTTTTCA TTCGATAATG TTTTACTCGT TTGCTTAATG TTTGCACAGG TTTATGGAGC TCATCACCAA TGGATCCATG TACTTTGACC CACGCCTTCA CACCCTCGCA TTCGTCCGAT CCCGCAGCTC ATTCCTCCTT GCCGTCATTC TCGCTATCGC AAGCACCTAC AAATCCATTT GTCCCTCTGC TCGGCTGCAT ACTCTCCTCA TGAACCACGC ACACCGACTC GAGGCTGTAG TCCGGAATAA TCACCTCAAG TCAATTGAGA TTATCCAAGG TCTCTTGCTC TTGGCGAGTT GGATGGAAAT ACCGTCTACT CTGGCGAGAG ATAAGACGTG GATGTTTGTG TCATATGCGT TGGCGCTTGT TGTTGAGCTC CGATTAGATA CTGCACTGCC ATACTGCGTT CAAACGGATC CATTATACGA CAAGAGTAAT CACGACCTAC TGGTTAGGAA TGCACATCGG GTATGTTTCC TCATGTATAT ACACGATCGG GTAAGTGACA AGAAGGCGTA TTTTTCCTTT TTTTTGAAGG TATACTGAAG TGGTAATAGA ATATGGCGAT GGTGGCGGGC CGCCATCCGA TATTCCGAGA TTCTGCTTTA GTGTCACCCG ACTCATTAGC AAAATGGGGA AAACATCCTG TAAAGTTGAA TCTCCTTATC AAAACGACGG GGTTTGTCTG ATGGCCAATT AATAGCTTGC GCACCGTTTC GATGCGGCTA TCTGTGCATC TGTATCGCTG CGAAAGCTCG TAGTGAGTCC ATAATGCTTT CGACAATTAT TGTCTTTGTA TTGATCTTGG CGAAACAGAC GAGCGCGCAT GCCCGGTTGA GTACCCAGAA TTATCCTGAT TTCGCCTCCG GCGAAGAATA CATTGACCGT TCAATGGCGG AATGGCGAAG ACGATGGTCT TATGAAATAC AAAGTTTGTC GGGCCCGGCT CGGCATCTGC GCTGAAACAC TCAATTGAAA CTGACAAACC GGATACAGGT ACCCATGAAT ACGACATTAT TGCACGCTTT TCAGCTTTTG TGCTCGCACT GACACTGGTC AAGAAGCGAC AGCTTACAGG GCAAATTGAA AGAGAAGCAA GGAGGGCATG TGAGGTACTC GCGTTCGATG TCGTATGTGC TGCCATACAT CACTATAAGA CATGGAAAGG TCTTTTGAAT TCCGCGACAT TCGAGTAAGT CGATGCCTTT ATGATGGACC TCCTGACCAT CTGGATTTCA GCACAAGCAT GGTTGCGTTT TGCGCCATAT ACACTATTCA GTCAATTAAT CATTCTGCCT CACCGTACCT TTCAGATCTT TCGCTGCTAC GCTTGGCTAC TGTCCACGAG CTTATTGGCG AACTGGAAAC TCAAGCGGAA GCAAGGCATA CAGTTGACAT CCCGGGGTAC TTTTCTGTTG TTGACGCCAT GGCTCGTCAA TTGTCCCGCA ATATGCGCCT GTTGCTTTCA AAGAAAGAAA TTTACCAAGC GCCTCATTCT GAGACCTCGG AAACCCATTC TTCGACATAC AACCCCCATA CAAACTTTAT TCATACGCAT CCTCATCTTC ATCATCTACA AGACCCTCAC CACTATACCA ACAACCAATT TCCACAATTC GACGAGGTCG CACAGTTCAT GTTTACAGCA GACGATGGGG GATTACCGTT CATGGGCGAT TGGGGCTTGG AGGGGCTGTT GCCAGATATG GATTTTGGGA TAGACACAGG GTATTCGGGG AGTGAGTCTG GAGGGACGTC ACATGAGATG ACTCAAGGGG TGAACATGCA GCATATGCTG AATCTAGGAT GTGAGTGAGA TGACCATCCT TTCTACAAGG TGTGCTGACA GGAGATTATC ATAAAGGATC ATAAAGGATG TTGTAAATGG CTAGTGTTGG ATCAATGCGT ATTGTCTTGG TGGGAAAATG TTTTGATGTT TTGATGTTGT AATGTTATCA TCAGCGCTTC AAAG
|
Protein sequence | MYTTACEACR KVRMKCIRPS RGYDMSEICE RCRSTGIECI TVKRRVGRQP GVKNRKRKAD SISEQTGASS ASNSEAQIPR DVDHLPNPLH VLASEAIRRH FTPEAEEATA QDSYHIRSSK SIFHRYSDWA EKIQPEGGKE AIMRRLDSLL SKKPIEMSSD VDEPSVFCGR IDMARPDASP EHDVISLQIV SLAEAQHLFD SFMELITNGS MYFDPRLHTL AFVRSRSSFL LAVILAIAST YKSICPSARL HTLLMNHAHR LEAVVRNNHL KSIEIIQGLL LLASWMEIPS TLARDKTWMF VSYALALVVE LRLDTALPYC VQTDPLYDKS NHDLLVRNAH RVCFLMYIHD RNMAMVAGRH PIFRDSALVS PDSLAKWGKH PLAHRFDAAI CASVSLRKLV TSAHARLSTQ NYPDFASGEE YIDRSMAEWR RRWSYEIQST HEYDIIARFS AFVLALTLVK KRQLTGQIER EARRACEVLA FDVVCAAIHH YKTWKGLLNS ATFDTSMVAF CAIYTIQSIN HSASPYLSDL SLLRLATVHE LIGELETQAE ARHTVDIPGY FSVVDAMARQ LSRNMRLLLS KKEIYQAPHS ETSETHSSTY NPHTNFIHTH PHLHHLQDPH HYTNNQFPQF DEVAQFMFTA DDGGLPFMGD WGLEGLLPDM DFGIDTGYSG SESGGTSHEM TQGVNMQHML NLG
|
| |