Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00850 |
Symbol | |
ID | 3256039 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 243159 |
End bp | 248136 |
Gene Length | 4978 bp |
Protein Length | 640 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254736 |
Product | expressed protein |
Protein accession | XP_569092 |
Protein GI | 58263364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.622601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGCACAGA TTTCCATCTT ATCGACATCT ACTACCCTGT CATTTTCAGC AGTACAATAC CATACCACAT AGATGCCTGG AACACCACGG GTACATCACA GGACAGACCT ACATATCTAG ATGGCCACCC AACTGGCCAT GGACTGGACG GACTTTCACC ACTCTCAGTG GCCGAAGCAA CAAGATAGTG GCGGCCTTGT ACCGCCCACC GACGAATGGG ATCTTAATGA CGTCCTCAAC ACGGAAATGT TTGGCTCCTC AAGTACTCTT GCGGCTTCAT CATCGGAGCA AGAGCCTCGA CCATCGTCAA GTAACAGCCT TCCACTGAAC ATATCTGAGC CCCTGGGCTC CAATGATGCC CTGTTCCAGT CATATATTTC GCAGAGCACG AAAATGTTAG GCGGGATGGG AGAGACTGAA CAATTATTGG AAACGGAGCG TGCATTTCCT GGAGATGGAG CATTGCCCTC TTCCATTACA AACCCTCGTG GTCGTGCCGT TACCGCAGAC GCCCCTACCC CGTCGTACAA ACCGGCGTCG GCATCTGGAG AACCTCCTTT GAGGAGAGGG ATGTGCGTAT ATTGAAGATC GATATGCTCA GTTGAGGAGC TGACATTCGA GTTTAATTAG GGCTTGTATG TTTTGCCGGC GAAGGAAACT TGTCAGTGAC TCATAGAGCA AGATAGAGGA TAGTAGTTGA CCACTGACGA TAGCGGTGCT CTGGAGAAAA GCCAACATGT GCGAGCTGTG TCAGACACAA CCAATCATGT GAATACCGGG TAACTGAAAA TTTTTCAAAG GTTCGATCCC GAAAGAGTAG GTTGATGACA TACCGCTGGA CCATAGCTAA CGCAATGTCC CAGATGGAAT CTATGAGGGC AACCATAACC CAAATTATCA CTATTCGCAA CGACCTATCG CCAATTTCTC AGTTCCTTTA CCTCCTATTT TTGGTGGCCT TTTATCTTAT GCTCCTGCAC AGCACGAACA ACCCGGATTT GATCCAGTTC CTTCCAACGC TTCTTCTTCC ACCCCTTCCG TTTCGGCAAT ATTCAATTCC GCCTCCCAAG CTGTACAGCC TTGCCCCTCA CCCTCCACAA ATACCGACCA TCCTCCACAG TCTTCGGCAC CGGAAACCGT TAGCTTCTTT CCGGAGAAGG AAGAGGAGGA CGCCGTTGAG CGTTTAACCG AACGACTAGG AGAATTTCTG TTTAGTCACA AAGGGAGTGA AGATACCTCT CGAGCTGTGG ATACCCCCAC TCTGAGTAAA GAAACGGGGT GGCAAAAGAA AAGGAAGACG GGGAAAGATG GCAGATGGGC GACGGATGGG AAGCCGACGA GCGAGAATGT AAGACGGGCT GGAGTTCTGC AGACGACGAT AGAGTCTGAT GGGTTGAGTA ATGAGACTCG TAATGCTTTG TGAGTGAGAT GTGACTTCCC GATCCTTGCC AGCTAACCAA TGTCCCTAGG CTGGAGTTCT TCCTGTCAAG CCCATACAAG TTCTTTGACA TGAATATCCC TCGTTTGCGA TATCGCTTGT CATTTACCGA TAAGAGAAGA CCGGCGTTGG CACTGTTGAA TGCGATGGTA AGTTTGAGAT TGACATTGAA GGATGTTTAA CTCATCTAAT GCCCAATGTT TAGTATCTAT GGGCCGCGCG ACTTTCTGAT GTGCCCAACT CTGATGCAAT GGAGCGACAT TTCTTCACCA AAGCTTGTCA ACATCTGGGG TCTGCCACCG CAAGCGATGA TCAACTCATG GACACTATCA GAGCTGCTGC CCTTCTAAGC ATCTACATGT ATACCCGAAG CCGCTATCAC GAGGTAAGCA ATCGAGCACT AGTGAATCTT ATGTGCTGAT CGCTGGGACG TGGGGATTAG GGCTGGTTAG TAGCAGGTGT CGCTGTTAGA CTGGTACTTT CCTCGGGTAT ACATCAAATT TCTTCTCTCA CTTTCCAGCC TTCTCCCTCC GAAGATCCGC TACTGCGTAA CAGAGTTCAT CTTCTTTCGC CTCCTAAAGA TCCCATAGAG CTGGGGGAGC GCGTACATAC TTTGCAAGTC CTTTAATAAT TATTCAATTC CTAATCTAAC AATGCCTTGC AGTTGGATTG TGTTTATAGT GGAGCGAGCA GGGGCGTTAG CGACTGGGTT TCCATCTACC ATACGTGACG AAGATATCTT GACGCCTTTT GGCAAGCCTC TGAGCGACAT ATCATCCGTA AGATGCTGAT GCATTGAAAG ACGAGGGCAC AGTGGCTAAT GAGCTGTAGC AAAACGTCAC TCTACATGAT GATATCACTA TAACGGACAT GTATCACGAT ACAGGCGACT CGAATCCCAA AACGATTTAT ACACACAAGT ACACTCAACC TCTAACGCAT CATTGGATCT TAACTGATTT ATGGTTTAGT TCACAATGGA TCCAGGCATT AGCAATACTC GAACGGGCTT CAAAACTAGC TTTCCTCAAA CCTTCCCGAG ACTCTGATTA TACAAAGGCT TGGGTGGAAT ACACAAACGC CCTACGTTCC AGCAACGCTC AAGCATCTCC CCCTTCGCCG CCACCTGTTT ATCTTAATCA ACCGAAGCAC AGGAACCCGA GAGAGTACCG AGAATGTCTG CTGGCTTTGG ACAATCTTCG TAAAAATTTG GGTGTGGAGG GACTATCGCC TTTAGAGAGG AAGAGGATGG CCGATGCCAT TGGAGCGCAA TTGGTTATCC CGCCCAGGAC AATCCATTTG GTAAGTGAGG ATCGTTTTCC CTCAAGGAAT TCTTTTCTCA CTGCTACATT GCAGCATCAT CACTTTGCAG CGACTGAGCT GCTGCTTCAT GACATCAACT GTACAGATGC TGATAATAGC GAGGCTATGA AAGCTGCTCG TCAGAGTGTC GATCTGATTA GGTGTCTGCC TCAAACTGTA AGTCCCCAAC CGTTATCTTT TCATATGTAT AGTCTCGGCG AGCTGACCTC GTCTGGTATC AAAGACGTAT CATCGCTTCG ACGGGGAGAT TTCAGTAGTA TGGTGTTTGA TTGCAAAGTG TATGATCAAG GAGTTGGGGC GTTTAAAGCG AGCTGGAGAT GACGATGGTA AGTTGAAATA TACATAGATT TCTGTCGCCT AATCTGAGCG AATATTTGTG CGAAGCAGCT TGTGAACTAG TTTCTGAGGA CGTCGACGTC ATCATTAACG AACTACATCA TGTTGGAGAA GCGTTACATA TGTCGTAAGT GTGGTCCATA TCAAAAGAAC GATCAGTTCA ACTTATGTCT GCTTAGTCGT ACACAAGCGA AGGCGATGGA AGAGCTCAAA CGTGTAGTTC TGTCAAGCCA TAAAGAGCCA ACTTCACCTG TCGGAAATGC CTAGGGGAAA TCAGAGGTTT ACGAGAGAGC CGTGGATGTG CTGTATGTGT GTTCCTGATA TCTGTGTCTT TACTCCAGCG GACAACTCAC ATAAGGTTTC TGATAGAGAG TCGGTATGTA TGTCTTGCAT AATCCTTCCA CTGCTTGTAT GCATGTACTT TAGATGATCT GTTATTCCTA CTCGCGCTTG AGTATTTCAA CTGCGCTCTG CGGTTACTCT GAACCCGGCA ATTGGCAGTG AGACACCCAT CAACTGCTGT TCGCCGTCGG GACAGAAACG GGCCCGTGCA CTCAGCTGTG TGTGCCCCTG GGGTTTTGCC ACCCTGTATA GGTGGTTTTA CAAGAGCGCT ACCATCACGC GGGCGGTGCC AGATCGATGG CCCAGTACTC GGCACAAAGG ACCCTGACAC TGGACCATAC TGGACGGGGA CGCTGTTCGA ATCTCGTTGG AGACTGGTGA AGTGGATTTC TGAAGCAAAA GATTTCTGAC TGCCACTGTC TTGCTTTGGT AATCATGATA CCAAGCATTG CAGCAGCACA CCACGGTCCT AGCTAAAGGA CCGGCCTGGC ACGCATAGAA CAGCGATGCA CACGATGCAC CTGCTGTCGC GAACCTTGCC AGCACCAGTC GAGAGACTCG CCCGCTCGCG TGTAGGGTTT TCCCAGTTAT TCGGTTATAT AGCTTGGGTT TCCCTTTTTG CTTCCATCGC TGCTATGGAT CCAAAGGAGA GACTCGCGGT GGAAGCCTCC AGCACGTTCG CATTGACGTC AGGAAGCCGT TATCACACAC ACCTCCTTTG GCTCCTATAT ATTGGCCCGC TGGAGAGGGC TGTGCCGACG TACTCACTTG TCTCTAATCG CCACTGACAG CCATATACCC ACAGCCATGT TTGCGCCGAG GATATTCGAA AGCCCTGCGA GGAATATGCA CGTAGGTCGA CTATCTTGAT GCAAAGTGCC TGGCTCACCT TGTTTGCGCA AGTGGCGAAA CTATCAGTAC AACGGTCCTC TGACTCCAGA CATGGAAATG GCGTTCAAAA AGGAAGCTTC TCGAGCTCTT TCTCGCAGTA TGATGATGTT CTCTAATCAA ACGCGGTTCA GTCAACGACG CTGACGGGAG AGATAGTAGG CTCTGGATCT CTTGGTTTCT CTGTCTCGAC CTCTGCCACT ACCGTAAACG GTGTTACAAC CGGTATGACC GAGTTTCAGG TGGGCAAGGG AACGGCCTGG ACGGCAGAGG AGACGGCCGT GCTAGATGCA AAGTTTGCAG CGCTGGCGCC GAACGGGAGG ATGGTCGGGT AAGTCGGTTT GGTTATGTCC AACGTAACAT CAAAAAGTTC CGTATAGTGA ATTATTGACT TGGCCCAGTC CGAATGGGCG GTCCTTGATC ATGACTGGAT CATCGCAACA ACCAACGATG AGCCGTGTTG AAGATACTCC TCGAGCGATC GAAGCTCCTC CTCCGGTCCA CCATGACACA CAGGACTATA CCCCTTTGCA CCACGACCAA GTCTCACACC GCTATGCCAG TCCGTCGCCC TCAAAGCATT CCAGACGGGG CGCTCCCTAC GCTCCAAGCG TGAATGCTGC CCCTCCGCAA TCTGTTCAAC CTTTTTAC
|
Protein sequence | MATQLAMDWT DFHHSQWPKQ QDSGGLVPPT DEWDLNDVLN TEMFGSSSTL AASSSEQEPR PSSSNSLPLN ISEPLGSNDA LFQSYISQST KMLGGMGETE QLLETERAFP GDGALPSSIT NPRGRAVTAD APTPSYKPAS ASGEPPLRRG MACMFCRRRK LRCSGEKPTC ASCVRHNQSC EYRVTENFSK VRSRKNGIYE GNHNPNYHYS QRPIANFSVP LPPIFGGLLS YAPAQHEQPG FDPVPSNASS STPSVSAIFN SASQAVQPCP SPSTNTDHPP QSSAPETVSF FPEKEEEDAV ERLTERLGEF LLEFFLSSPY KFFDMNIPRL RYRLSFTDKR RPALALLNAM YLWAARLSDV PNSDAMERHF FTKACQHLGS ATASDDQLMD TIRAAALLSI YMYTRSRYHE GWLVAGVAVR LVLSSGIHQI SSLTFQPSPS EDPLLPSPPS PPPVYLNQPK HRNPREYREC LLALDNLRKN LGVEGLSPLE RKRMADAIGA QLVIPPRTIH LHHHFAATEL LLHDINCTDA DNSEAMKAAR QSVDLIRCLP QTTYHRFDGE ISVVWCLIAK CMIKELGRLK RAGDDDACEL VSEDVDVIIN ELHHVGEALH MSRTQAKAME ELKRVVLSSH KEPTSPVGNA
|
| |