Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF04390 |
Symbol | |
ID | 3258455 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 1271564 |
End bp | 1276348 |
Gene Length | 4785 bp |
Protein Length | 667 aa |
Translation table | |
GC content | 49% |
IMG OID | 638257557 |
Product | conserved hypothetical protein |
Protein accession | XP_571398 |
Protein GI | 58268484 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.483072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGTACTGC GAAGGGTTAA TACGGAAAGT AGTACAGCGA AAGCAGCAGC GGCCACTTTT CCACGCTCTT TACAACGTCC CGCACAAAGG AGAATAGTGG AGCGTCGAAA GGGACGCAGG GGTTTATCTC TACTGCACCT CCAGCACGCT GTCTATCAAC CATCGTTGTA AATACATTCG CTGTGAATAC ACGCTCAGGC CATCTGTGTG AGTGACTGTG ATCTGTGAGT GATTGTGTTT AAAAATGTCG GATGGTCCGT GCGCCTGCTA GCCTTCGAGA AAGATTTGCT GAGGTGTCAC CTTTTATTAT TTACTAACAT TAACAAAAGC TCATGTCCGC CCACGGCCAC CCCTCCACCT CCTCCCAGCA GCCGCACCTC TCCAATATCC TCGCACCAGC CAGCCAGCCA GCCAACACTG AGCGAACGGA GAGCGAGGTA CGAACCAGAA ACAGTGCTGC GGGGAACAAG AGATTGACTT TGCCTGCTGT CCAGATCCAA GCAGATGATG GTGGCGGAGA GGGCGAGCAA GGCCAGAATG CAGACCGGAG CGATAGTAGC AAGGGTAAGG AGAAGGATAA GGAGTATGAC AAATGGATAC AGGGGATTGA CTATGCTTAC GAATATGTGC CTGTAACTCA GGTGAGTGGG ATACACATTC TCGTCACAAT CGCATGCTGA CTCTTATCCC AGCGACGTCA AGGAAGAAAG AATATGTAAG TGTCTTGCTG TCGTGGTGGC TTGGGTTCCT GTTCCTTTCT CTGTTTTTTT CCTCTCTTGG ATTGGGCTTT GTTATGAATG GCTTCGCTAG GTGATGGACG CCAAATCTAC CTTCTCTCTG GAAGCTTTCA ATGAACGGAC GATTATTACC ACATAGACTT GTCAATTTGG CCTTGCTCGT GAGTTCAAGT CCAGTTTGCT TCTCCAGAGG ATATTCGTTG CTCATCTACC TCGAGGTCAC GCGGGACTGC TTTTTATCAT TGTCACTCTT TCACACCTTA ACTTGTTTTG TATCACAAAT TATCAATTTT GAATTCTCGT CGCGGTTAAA AGCGTATTCG GCAATTCTTA TTTGCACTGC GTTGTTTTCA CCATCGCGGA CCTCATTCTC TTACCAGGAA AGCCGGCTCT GAAAGAACGG GAATCCTACG TTTTCGCCTC TGCTTCTATT ACTTGTTCAT TAGTCACTCA GTGTCACCGG CTTTCCCTGT CAACTTATAT ATCATGACCT GCGTACATTC GCAGAAATCT ACAGCTTCGC CCGGTTTCCT TTAGACTGCA AGATCCTCTG GACTACGATA TTAAAAGTAT TCGAGGCAGC TAGTTCTGAC GATGGGCGAA TTGGGAAACT GGAGGACTTT CAAGAGACTT GTCGTCGTAT TCTCCACAAG CTGAGAGCAA GATGTGGGTC TCAGTTCCAC AGAGATTACC TGGGAAGATT ACTGATAACG AATCAGACTG TCGAACGCTG TTGAGGGCTC TTCCTTTGCG CAATCGGATC CCAGCCCTTT CCATTTGTAA GGCTCGCACA AAGCTTCCCG ACTACACTAT AGTGTCTGCC CTTCACAAAA ATGAACCAAA TGAGATTCTT CGCCCGAGGG TCATCTAGCA TACCACAATC CACTCGTACG TCTGACGAAA AACAATACTT CGCGGCACTA TATTGGGCTC TCACATTACG AGTGCTCGCC TTTATTTGGA TCGGAGTCCA TTTGCTGTGA AGTCAACTCA AACCGTCCAT CGGAAATTCG CGCGATGGAC AATGGTCTAT TTGTGCAGAT CGTTTCAAAA GCCCGCAAGA TGTTTTGTGG GATGTCACTT TCCCTCTCCT CTTCAGCCCG GCGAAAAGAG GTGGTGGGGT GATAGAGGGT TGGAGGATTG GATGAGGGCT GTTCGGTTAT TATTCTGAAC CTGACAATCC CACGGTGGAT CTGTCTTAAC ACTTAAAACC CAAGGATGTT TTATGAAACA TGTTTGCATG TAGGTGAGTA TGCATTCTGA GGGTTGAATG TGATCATTCT ACTAATGTTC AACCACAGAA CCACATCGCC AAGGCTCATT ACCTCTTCGC CTGAAACTTC TACACCCACA GCATGTGGAT TCGCGGACAC CGTTCAAGGA TTTTGTCCAA CGTTCGATCG CCGACACCTA TCAAAAATCA CCGGACACCT TTGATCTCCA CATACACACA ATTGTATACA CAAAAACAGT CACTCCTACG TCCATTGTCC TTTTTCTCGC TTTATCAGCC CTATACTTTC CCCTGCATTG CCCCGCATCG CAGTAGTCTT CCCGATCTTA TCGCTCCTTA TTGTTCTTTT AGGGTCCTTT TGAACACACA TCATCCTTGA TAGTTTCGTT CATCTATTAC GAAAGTACCC AAGGACGTTG TTCTTTGTGC TCTTTGTGTA TCGCTCGTCT TTTCGTTGGT CGAAGTGCTG TAAAGAGTGA TTTCCAAAAA GTCTTCAGGC TTGTGCCTTT AATTACCCAT TCCCTGCTTT ATTTTGCTTG TTCTTTTATT TCCTTTTTAT TCTTCTTCTT CTCTTGCCTC ATGGTCCATC ACTCACAATT AACGTCGAAT GGGGCTGATC AGATGTCTTT TTACGTCTAC CCACCACCTC CACCGCCCGG CTATCAACTT GTCTACGTAC CGCCCTTTCA CACTACCCAC TTACCACGCC AACGTCCTTG TTCAACTTCA TCACTGCCGA CTACACCTTC CAACCCTTAC CCTCCCCATC TCCCCACGCC ATCCTACACG CCAATGTCAA TACGGCACTG TTCCTCGGTT GCGACGGACC ATGATTGTAT ACTCAGAACA TACGAGCTTA CCATGCGGCA ACAGCCTGTA CAGGCAAGGA TGTGTGGGAT TGGAGATAAA TGTGAGTTTT GTGCGGAGAA GCACTAAGGG AGGGGGACGG GATTGCTAAC TGGCTTGAAT AGCCGATCGA CGGCCAGTGG ATCCCACGCC AATCATTCAG CTGAAGGTTA TCGACCCGCA AGGCGATGAT ATCACCTCAA TCGATCCTCA GACTAGGCAG CAGATCCGAA GGCCGTCCGG TTCTGAGGGC ATGACATATA TGCAAAGTAA GCCCAATTAA TCCTTATACA CCTCAATCCG GCTTGTCTCA TGCTGAAGTT TTATACGCAG ATCCATACTA TTTCTTATTC GCTTGCCTTG TCGGCGGTGA GGAACAGGAG GACGAGCTTC ATGTTATTGA TGATGGCAAA ACCCGATTCC TAACCGGGAC ACCTGTTTCT TCATTGTATC ATCTTAAGGA CTTGGATAAT TCGGATGCGG CCTTCTTTGT GTTCCCTGAT CTTGGTGTGA GGAAAGAGGG GAGATACAAG TTGAAATTGA CGTTGTTTGA GATTGTCGAG TGAGTTATCT CAACTCACCC TACCGCGCTG GGCATGCTGA TACATGTCGC TCGCAGTCAA GAAGTATATT ACTGTACCAC AATGTTCACT TCTACATTCT CTGTGTACTC AGCCAAAAAG TTTCCAGGAA TGTCAAGTAC GTCGCTCAAT GTCAAGAAGA GATGATTGTT GACGATGTAA TGCAGAAGCT ACAGATTTAT CGGTATCATT CGCCGAGCAG GGCCTCAAAA TTCGAGTCCG TAAAGATCCT CGCCAACGTT AGTTCGATGC TTCTTAGTGG CATGTGAGTT TCGGCTGATG CTGATCAGCA GCTGCGCGTG CAACGGCATC TTCAAGTGGA AAATCGAAGC GGAAGAGTGA TGCGCACGAA TCAGAAGAGG AGGCGCCAGC GCAAGGGCAG TCCAAGAGGA CCCGAGGGCT TTCGATGGGT TACCCCGGAC ACGAACAACG TTTCCACCCT CCTCCAGGCG AACACCATTA CTACCCTTAT CCTCCCAACT ATCCTCCCCC GTCCGGTTAT TATCCTCCAT CAGGTCATCC CATGCCTCCC CCACCACCAC CTAGTGCATA CGACCCTTAC CGCTCTCATA TGTCCCGTAG CGGATCGCCG TACTCTTACC ACCCACCTTA CCCCCCTTAC TCTGCCCCAG GATATCCACC ACCTCCATCA AGTAACTATC ATGTCAACTC TGGCCGTCCG TCTACCAATC CAGAATCAAG TCCAAGTAGA CATCGCCACT CTCTGCCTCC TCCGGGATAT GTTCTGGACC CCCGCCCAAT TCCGTTGCCT TCTTCTCATG GCCACCATGC GTACCCAACA TCATCTGGTA TGGGGTACCC GCCATACCCA ATGCATAGCG AGTCCTATGA CCGAGCAGGT GTGCCATGGG CTCAGGAGCG CCGTCCAACA TTGCCGCCGC CACCACTTCC AGAATCTGCG ATGGGCAGAC CTGGGTCTCG AGGAACTGGT CATCATTCGA CCCATCCTTC TTCTCTTCGT CAAGGGGTTT CTCCACTCGA AAGCCCAGGA AGACTTCAGT ATGCTCAAAG TCCCGATATG GGATCATACA GACCATTATC GTCTGCTAGC ACCAGAGAAG GCAGGGATGG GCGAGAAAGG GAGAGGGAAC GTGAACTACA TAGGGGATCG CCCGTCATAC TGCCTCCGAT CACTCGATCA CCAGTTCTTA TTTCACAGGG TCTGCCATCC ATCACCGACC CTCGAGAGTC TCGGGAATGT CTGAATCAGA GTGAGAGCAG GGCAAGCGTG GGTGGAGATT CTGGGTCTCG TCCAAGCAGT TCAGGCAAAA ACAAGATGGG ATTGGGCAAC CTGCTGGGTT GAGAGAATAC TGAAAAAGGA GCTGTGTCAT TTATTGTGAT ATAGTAGAGC AAATAGAACA ATGAT
|
Protein sequence | MSAHGHPSTS SQQPHLSNIL APASQPANTE RTESEVRTRN SAAGNKRLTL PAVQIQADDG GGEGEQGQNA DRSDSSKGKE KDKEYDKWIQ GIDYAYEYVP VTQRRQGRKN ITYELTMRQQ PVQARMCGIG DKSDRRPVDP TPIIQLKVID PQGDDITSID PQTRQQIRRP SGSEGMTYMQ NPYYFLFACL VGGEEQEDEL HVIDDGKTRF LTGTPVSSLY HLKDLDNSDA AFFVFPDLGV RKEGRYKLKL TLFEIVESIL LYHNVHFYIL CVLSQKVSRN VKYVAQCQEE MIVDDVMQKL QIYRYHSPSR ASKFESVKIL ANVTAARATA SSSGKSKRKS DAHESEEEAP AQGQSKRTRG LSMGYPGHEQ RFHPPPGEHH YYPYPPNYPP PSGYYPPSGH PMPPPPPPSA YDPYRSHMSR SGSPYSYHPP YPPYSAPGYP PPPSSNYHVN SGRPSTNPES SPSRHRHSLP PPGYVLDPRP IPLPSSHGHH AYPTSSGMGY PPYPMHSESY DRAGVPWAQE RRPTLPPPPL PESAMGRPGS RGTGHHSTHP SSLRQGVSPL ESPGRLQYAQ SPDMGSYRPL SSASTREGRD GREREREREL HRGSPVILPP ITRSPVLISQ GLPSITDPRE SRECLNQSES RASVGGDSGS RPSSSGKNKM GLGNLLG
|
| |