Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03810 |
Symbol | |
ID | 3258037 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 1114570 |
End bp | 1116972 |
Gene Length | 2403 bp |
Protein Length | 619 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257500 |
Product | hypothetical protein |
Protein accession | XP_571338 |
Protein GI | 58268364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.369935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCTC GTACTCGCAC ACGACACCAC CCCAGCGCCG GCACCCGGCC TCGTCCGCCG CTTATCCAGA CAAGCCAGGC GGCTGAGCCA GCTGCCGGTG CGTATCTCGT GCTAGCAGGA GCGCAGTGCT GACGCCGTAA TAGATGGCCG GAGAGAAGAC AAGAGCGCAG AGTGTGAGTT ATGCGCTAGC GTACGTAGTT GACCCGCGAC TGACTTTACC CAGCCAACCC TTCAAGACCG CCCACCCCGG CCCCAACCTC CTCCAGGGCG TCCAACCCTC TTCCGGACCT TTCTCGCGTG GCAACCGCCG CTTATACAGC TTCTATCCTT GCCCGCCCAC CTCCGCCTAC TATCTACGGT CGGCCTGGTA CTCGTCCATT CTCTCTCGAT CCGTTGCCGT TCACCGACGA AGATATTCCG TTGCCCAAAC CGTACAACTC CAATTACCCG CATCCGATTA CCGCATACGC CTCTCCTGGG GTCAATCCAG AGACTGTCAA AGGATCAGGC TATAACAAGC TGCTGCAGGA TATCGTCAAG GAGGATGTAA AGAGCGCTCT CGGAGATATC CTTTACGACG CTGGCTTCAG AGATCTGTCT AATTCAGTCT ATTTGCTCTT TTCTTACGAA AGCATGGGGC AAGGAAATGG GATTCCGGAG AGTCTCTGGA GCAAATTTGA GAATGTCAAC AATGTGAGGC TGCCAGCCGC CGCGGCTGTC ACTGCGTCGG TCGTTGCTCA GGTCGAAGAA ACCCCTCCCA GATTTACTCA GTCTCAACTC AACTCAACCT GCAGCAGCGA CTTTAATGCT CATCGCACTG TGGAAAACAA CCAAGCGATG GGACGCACTC TCAGACTGTT ATCCGGCGAG TCGATTTTGT CGCCGCCTTA TAGCCTGATC CAGACACAGA CACACACACA GCCGAGAGAC AGGGAAGCAA TGGATGCTTG GAAGACTGAA ATATGCGCGG CTTGGGAGGC TACCGGACGC TGCAGATATG GATCTAGCTG CCAGGCAAGT AACGTTCGTC TCCCGCAACT CGCATGCTTG TTGACTGGTT CCGGATTTAC ACAGTTTGCT CATGGCATTG AAGAGCTCAA GCTTACTCGA CAGTCACTCA TCATACGTGG CCTCGCCCCC CAATCTCCTC CAACACCTTC TGATATACTC TCCCCTATTT CTCCACATCG TTCATCCGTC TCCCGCTATC CCGTCACTTG CAGGTCTAGT CAGACCCAGA TCTACCATCC TGCGATCATC TCGGGGTGCC CTTATGTCAT CAAGGCTAGT GATAGGAGAA TGTCAGTTCC ACATTCGCAG CTGAGTAGGG TGGCAGAAGA CGAGCTGCAA TTTAACGATT TGGACTTGGG ATTCAGGCGC CTGTCAGATG TGTCATCTGG CCCTCCTTTC GCCAACACTC AAAACCTCGG TGTGCCGTCA TCACGTCCTC GTTTCGACCC CCTTCCATCC AAATTCGCTC CTTCACCTGG TGAGGAATAT CAAGGCTACC TATTCCCTCC CTGCAAACCC ACGTCCCTTT CCTCTGACTC TGCTTCATCC ATCACATCAT TCAATATTGG CCCGCTCTCA GCTCAGGAGA AACAAAGAAG ACTGGTTTCC CAACCAAGTA ACTTGACATT GTACACTTCG TCGTCATCTT CAACTGAATC TGTCAGTGGT GGATCGAGGT TATCCATGTT CTCGGCCTTT GACGATGGAT TGGGCGAAAG TCTGGTCACC CCAATCGAGG TTGGATGGGA GAACAATTAC CTAGACTCCG CCTCTGGCTC ATCAAGCTCT AAAACAGGGC TCGACATCCA GTCAAGTAAT AGCGGGACTG GCGATGAGAT CGGTTTCGAA GCTACTGGAA TGAGAAAAGC CGGCCTAGTC AAGTCGGGAA GTATGGGCAG TGTGGGCATG ATGGGGTTAC CGGCCCATTC GAGCTTGCCT AGTATGGTGG GGCCGAAGGT GTCAACTACT TACGAATTTT CGAGTGGCAA TTCCATCTGG CGTTGACCTT GCTCTGCCTT TTTCTCAGCT TCACGAAAAG AGTCATGTGT ACTTTTCAAA AACGATCTGT GTCAAGTCAT GCCTGTGATC CTTCTTCCAC ATCTTCAGCT TTTGTTTATT GTGGGATTTT GTATATCTGC GCTGAGTTGT CGGTCCTTTT TTTCTCGCCT TTGCAAAAAT AGACACTTTA ATAAAAGTGC AGTAATGCTG TAGCTTCAAT CCGATTGACA GGTTTTGGCA TGATGTTATT GATAGGCAAA AGAAGTGGTT GTGTCTGATA GCTTTTATGT TCATGTCAAG TCTAGGTACT GGCGCCTTGG ATCTTATTAA TTATACACGC TCATACTTGT TTTTCCTTTT CTATCACTTC GTCTGTACCT GCTACAGGTG CTT
|
Protein sequence | MSARTRTRHH PSAGTRPRPP LIQTSQAAEP AADGRREDKS AESNPSRPPT PAPTSSRASN PLPDLSRVAT AAYTASILAR PPPPTIYGRP GTRPFSLDPL PFTDEDIPLP KPYNSNYPHP ITAYASPGVN PETVKGSGYN KLLQDIVKED VKSALGDILY DAGFRDLSNS VYLLFSYESM GQGNGIPESL WSKFENVNNV RLPAAAAVTA SVVAQVEETP PRFTQSQLNS TCSSDFNAHR TVENNQAMGR TLRLLSGESI LSPPYSLIQT QTHTQPRDRE AMDAWKTEIC AAWEATGRCR YGSSCQFAHG IEELKLTRQS LIIRGLAPQS PPTPSDILSP ISPHRSSVSR YPVTCRSSQT QIYHPAIISG CPYVIKASDR RMSVPHSQLS RVAEDELQFN DLDLGFRRLS DVSSGPPFAN TQNLGVPSSR PRFDPLPSKF APSPGEEYQG YLFPPCKPTS LSSDSASSIT SFNIGPLSAQ EKQRRLVSQP SNLTLYTSSS SSTESVSGGS RLSMFSAFDD GLGESLVTPI EVGWENNYLD SASGSSSSKT GLDIQSSNSG TGDEIGFEAT GMRKAGLVKS GSMGSVGMMG LPAHSSLPSM VGPKVSTTYE FSSGNSIWR
|
| |