Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ03320 |
Symbol | |
ID | 3254063 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | + |
Start bp | 1041103 |
End bp | 1042969 |
Gene Length | 1867 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 49% |
IMG OID | 638253481 |
Product | hypothetical protein |
Protein accession | XP_567411 |
Protein GI | 58260002 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0624249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGATGGTTA CCCTTATTCT CTAATCATCA TTCATGCTGT TCAGCGGACT GTCCTCCCCA CATATTCACG ACAGAGCGCG TGGCATACTG ATGAGCGGTA TGTACCTTTT AGCACAAATT GCTGCATTCC TTGGTTGGTT ACTGGGGTAC GTCTTTTTCT CTCTTGCTTG GCGTGTCGGC GCCTAACCTC CATCTTGTCT GCAGATCCCA GACTAGCCGA TCATTGCCAC CGCTCTCCAT TACTTCTCAA TCAGAGCTCT CATGTGGCTC CATATCGCAT CCCATCACGT CCGGATATGT GCATCTTGAA GGCGGGAACT TGTTCTTCTC CCTGATTGAG GCCAAGGAAT CTGCGAGAAA TAACGGGCTG GTGATTTACT TTGAAGGTGG ACCGGGAGGA AGTGCGATGG ATTACCCTTT CCTTGGTGCC GGACCATGTC AGCTTACTCC CGAAGGAGGG ACTATACTCT CGCCTGCACC GTACCCATGG ACAGATCATG CCAATCTCTT GATCCTCGAA TACCCGATTC CTACCGGATG GTCTTACAAC ACTACTTCCC ATATACCACA TGATTCGTCT GCTGGCGCTG CTGAAGACTT TGATGACTTT CTTCAAGCTT TGCTTCACCA TTTCCCTCAG TTTGTTCATC AACCATTGAT CATTAGCAGC TTGTCATATG GAGGAACAAC GGCCGCTCAC ATGGCGTCGA CTGTGCTACG TAGGAACCAG AATGCAGGGA TGTTTTCCAC CAGGATCAAA AAGCATATTG ATCAGTTAGT ACTGGGTAAT CCCTTTGGCG ATGTCATGAC GGTCATGTAA GTTGATCCCT TCGTAGTCTG CATTCGTAAT CTAATCAAGC CAACAGATAC CAAAACCTTC ATCATCTTTG CTACACCCCT CCAGCAATCC TTCCTTCCTC CTCTTGTGAA ACACTTGAAT CCTACCTTAC CCCATGCCTT GACCGTCTTG CATTCCTCAC GTCTGAATCC ACCTCGCATC TTTCCACTCG TGAGCTCCGA CGTGAGGCTG CTAAATATTG CGCACCGCCA TTCGAGTTGA CTTGGAAGGA CGCAAGGGTA GACAGGTATG ACAGTCGCAA ACCGCCTTGC TGGCCAGTAG ACACTTGCTG GTGGTGGAAC GATAGTTTGC GAGCTTTGAT GAACAGCGAT GAGATGAAGG AAATCGTAAG TGTACATTTG CATACCAACA TTAGATCGCT CACTAACTCG AGCGGCAGTT TGGTGTCCCG TCTCATTTGA CTTGGAGTTT CCTGGGCCTT TCTAGCTTGT ATTTCTATCT TAATGCGGAC AAGTAAGTTC CATTCTCGCC TCAAGGGATA CGTGCTAACA CGTCATCTCA CAGTATGCAA GCTGCACACC ACCTCCTCCC TGCCGTCATC GACGCTGGCA CCCGCATTTT TGTGTACAGC GGTATGAATG ACACTATCCT CCCATATGAA GGTTCACTCG CCTGGGTACG CCCAAAATGT TCTCTTGCCA CCTTGCAACT AATATTTTTG TTTAGATGTC CCGTATCCCT TCTTCCCAAC TTTCAGCATT CCGCCAAACC CCCATCACCA TTCCCCCATC GGCGGAGCCA TCAGAAACAG CATTCAGAGG TATCGTCCAT AATCCCGGAG GCGCCGTAAC GCTGTATGGT TTCCCAGATG CTGGGCACAT GGCGCAGGTA GATCAGCCGA CGGTGGTTTG GAAGATTTTG GAGAATGCTG TGAAAGGGGA GAACTGGAAT CCACTTGAAG GGTGGTGGTA ATGGGAATGT CAAATGTATG AACATGCTCG GCCCGCAACC CACTTTTGGA CTGGATGGGG AGTTTTGTTT TATAGAAGCC GGATAGAATA CATGTCA
|
Protein sequence | MLFSGLSSPH IHDRARGILM SGMYLLAQIA AFLGWLLGSQ TSRSLPPLSI TSQSELSCGS ISHPITSGYV HLEGGNLFFS LIEAKESARN NGLVIYFEGG PGGSAMDYPF LGAGPCQLTP EGGTILSPAP YPWTDHANLL ILEYPIPTGW SYNTTSHIPH DSSAGAAEDF DDFLQALLHH FPQFVHQPLI ISSLSYGGTT AAHMASTVLR RNQNAGMFST RIKKHIDQLV LGNPFGDVMT VIYQNLHHLC YTPPAILPSS SCETLESYLT PCLDRLAFLT SESTSHLSTR ELRREAAKYC APPFELTWKD ARVDRYDSRK PPCWPVDTCW WWNDSLRALM NSDEMKEIFG VPSHLTWSFL GLSSLYFYLN ADNMQAAHHL LPAVIDAGTR IFVYSGMNDT ILPYEGSLAW MSRIPSSQLS AFRQTPITIP PSAEPSETAF RGIVHNPGGA VTLYGFPDAG HMAQVDQPTV VWKILENAVK GENWNPLEGW W
|
| |