Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02280 |
Symbol | |
ID | 3259176 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 491594 |
End bp | 493561 |
Gene Length | 1968 bp |
Protein Length | 450 aa |
Translation table | |
GC content | 46% |
IMG OID | 638258259 |
Product | conserved hypothetical protein |
Protein accession | XP_572400 |
Protein GI | 58270488 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.380416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTTCCGAC GGATTATTTG TTTTCGCCAA ATAGGAAATC ACTTGCCAAA ACTACCGTAA GCACATCTGC CCATTGTTCC GCAATACTCA TTCCCTTCAT AAATTCACCA TCTATAGGCG GCGTAAACCT AAAGGGTGCG CCAGTAGGCA ATAATGAGTG ATTGTGGGTA TATCAGGATA CTGAATGAAA ACATCCCACT TACGTCGCTC ATAGCCGGCA CATTGTCCTT GCTTGCGGGC ATCGTCTTTT TAGTGTTATG GACGGTTGTA TGGTCTATAT GTCTGCTTGG CTGGCGAACC GCGTAAGTTT CTGTTGCGGA ATTGTACTCG TCTAATTTAT CATTCAGGCG CATACGGTAC GCTCACCCAA ATATTCCCTC ACGTCTTTCC AAGCTGCCAG TATCGTCTGC TCCTGGTGTC ACCATCATAC GGCCTCTGTG TGGACTCGAT CAGAATTTAT ACAACACTCT TGAAAGCGTG ATGAAACTCG AGTATCCCAA GTTCGAAGTG ATATTTGCCG TTCAAAACGA AAAAGATGAG GCGCTTCCAG TCGTAAATAT GGTCATGGAG AAATATCCAG AGGTGGAAGC CAAAGTAATC ATAGGTGAAT ATTCTCTTGA TCACTCGATG AGTATTCTAA CCATATCATA GATTCACGCA AGGTTGGGGT GAATCCCAAA GTCAACAACC TGATGACTCC CTTCCAAGAA GCCAAATATG ATATGTTATG GATTCTTGAC TCGACATGTT CTGTCCTCCC CGGTACCCTC GGTCGCTCTG TCGAAGCCTT CTTCTCCAAT ACAAGCAGTA CCGCATCACC TTACGACCCA GAATCATCTC CTCTTCTGTC GATCTCGGAC GATGTAAGGA AGCCGCCGGT AGCTGGGGAA GTGGGTCTAG TACATCAAGT GCCCATAGCG GTTTGCTATC AGAAGACATG GGGAAGCCTG ATTGAACAAG CTTACCTTAA CACCACGCAC GCGAAAATGT ACCTTGCTAT CGTGAGTGAC TGAATATCTC CCATGTCTGA GCTCGATCTG ACTTTTGTGT ATTAGAACGC AACATCCATC GACTCTTGCG TCGTCGGGAA ATCCTGCATG TATTCACGCG ATAACATCTC TCACCTAACG ACTCCCTCAC CATCTCTGCG CTCTCTCCCC GATCCACCGA GCGGACTCGC CGGATTCGGT CCTTTCCTCG CAGAGGACAA CATGATCGGT CTCTCCCTTT GGCACGAGCT CAAACTCAAA CATTCAATGA CTTCGGATGT TGTCCTGGAC TTTATCGGGT CGCTCTCTGT GAGAGAATAT GTCAACCGTC GGATCCGCTG GATCCGAGTC CGAAAGAAGA TGACCTTGGC GGCCACTTTA CTTGAACCAT TGACGGAGTC AATCATCTCT GGACTATATG GTGCTTGGGC AATCTCTCGG TTATTGGGAG GCAATATCCT TCCTTTATTC TTACTACACA TGGCAGCTTG GATCTCTGTG GATCTCTCAA CCAAACGGGC GCTTGAGACG AACATCAAGG GCATAGGACC TCCAGAAAAC AAGGTCACGT TCTTGATGGC TTGGGCAGCA AGAGAGTGTC TCGCATTGCC TATATGGATT TTGGCGATGA CAAGCTCTGA AGTCGTATGG AGAGGGCAAA AGTATAAAAT TATTGATTCC GGTGAGTTGA TCGATATAAC ATGACATGCT ACATCCTTTG CTGATGATGT TTATAGGAGA AGCAATTCGT TTGGATGACC GAAATTGATT ACGACATTGC ATTGATATGA ATTTTGGTTT TCAAAAGCTC TGCATGTTTA CTGCATATGT GTGGATGTCC TTCGTCGTTT AACAGAAATT CTTGTTCCTC AACAGATGTA AACGCATCCT GAGAGCTGAG CCAGCACTCT CTGGTTCGAT AAATAATTAA CTTTTGACTA CTTTAAATGG TAACGCATTG TTGATTTACA GAATGGCA
|
Protein sequence | MSDSGTLSLL AGIVFLVLWT VVWSICLLGW RTARIRYAHP NIPSRLSKLP VSSAPGVTII RPLCGLDQNL YNTLESVMKL EYPKFEVIFA VQNEKDEALP VVNMVMEKYP EVEAKVIIDS RKVGVNPKVN NLMTPFQEAK YDMLWILDST CSVLPGTLGR SVEAFFSNTS STASPYDPES SPLLSISDDV RKPPVAGEVG LVHQVPIAVC YQKTWGSLIE QAYLNTTHAK MYLAINATSI DSCVVGKSCM YSRDNISHLT TPSPSLRSLP DPPSGLAGFG PFLAEDNMIG LSLWHELKLK HSMTSDVVLD FIGSLSVREY VNRRIRWIRV RKKMTLAATL LEPLTESIIS GLYGAWAISR LLGGNILPLF LLHMAAWISV DLSTKRALET NIKGIGPPEN KVTFLMAWAA RECLALPIWI LAMTSSEVVW RGQKYKIIDS GEAIRLDDRN
|
| |