Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85710 |
Symbol | TRP2 |
ID | 4841010 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 788613 |
End bp | 790325 |
Gene Length | 1713 bp |
Protein Length | 514 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640392325 |
Product | anthranilate synthase component |
Protein accession | XP_001386741 |
Protein GI | 126140438 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACGAAGTCT TTGTAAATAT CAACATGGTG GTAAGTCTAT TTGAGGAAGT GGGAGAATGA CCAGCTGTAC CATTGCGTAT AACAGAAGAA AAAGTACTAA CTTTTCCAGG CGCAAATCCA GCCTTCGTTC GAGGCTTTAA AGGCCATCGT AGACTCTCAT CAGGAGTTGG AGACACAGCA CAATATGTAT CCCATCTACC AGTATGTAAA CAACAACGAC TTGTCTGTTC ACCAGGCTTA CTTGAAGTTG GCCAAGTTGA ACGACAAGGG TCGCTCACCT TCTTTCCTCT TTGAGTCGGC TGTTAAGGGT GACACTGTAG ACAGATATAG TTTCATCGGA GTCAACCCCA AGAAGGTCAT CAAGACAGGA GACGATGAAT CCAAGTATGC TGAATCATTT TGCAATGTTG ACCCTATCAC AGTGTTGGAG AAGGAAATGA AAAACTACAA CCAGGCACAA TTGCCCGGTT TACCTAAATT CAGTGGTGGT GCTACTGGGT ACATCTCGTA TGATTGTATC AAGTACTTTG AACCTAAGAC CAGAAGACCA TTGAAAGACG TATTGGGTTT TCCTGAAGCA GTTCTCATGC TTTGTGACTT AGTCGTGGCC TTTGACCACG TTTTCCAAAG ATTCCAGATC ATCAACAATG TCAGAGTAGG CGAAGGTGAA GATCTTGCTA CCAACTTCGC CAAAGCCGAA AAGGAAATAC AAGACGTAGA AAGGTTGTTG TCATCAGAGT TTTCAAGCGA CCTCAATCCA CAACAGCCTC CCATCAAGCT TGGCCAGACA TTCACGTCGA ATATCGGGAA GGAAGGCTAC GAAGCCCATG TGTCTAATTT GAAGAAGCAC ATCTTGTTGG GAGACATTAT CCAGGCCGTG CCCTCGCAGA GAGTCGCCAG ACCTACATCC TTGCATCCGT TCAACATATA CCGTCAGTTG AGATCAGTTA ATCCATCTCC ATACTTGTTC TACGTTGATT TGGTGGATTT CCAGATCATT GGAGCTTCTC CAGAATTGTT AGTACAAGCT GATGCTAAGG GAAGAGTCGT TACACACCCC ATTGCCGGGA CTATTGCCAG AGGTAAAACC AACGAAGAAG ATGAAGCTAA TGCTGAAATC TTGAGATCGA GTTTGAAGGA TAGAGCCGAA CACATCATGT TGGTGGACTT GGCCCGTAAC GACATTAATA GAGTTTGCCA GCCTACCACC AACAAAGTGG ACCGTTTGCT CACCATCGAG AGATTCTCAC ATGTGATGCA TTTGGTATCT GAAGTCAGCG GAACTTTGAG AGAAGACAAG ACCCGTTTTG ATGCGTTCAG ATCCATCTTC CCAGCTGGTA CTGTTAGTGG CGCTCCTAAG GTGAGAGCTA TGGAGCTCAT TGCCGAGTTG GAAAAGGAAA AAAGAGGTGT ATACGCTGGT GCTGTAGGCC ACTGGGGCTA CGACGGTAAG ACCATGGACA CCTGTATTGC CTTGAGAACC ATGGTGTACA AAGACGGCGT GGCGTACTTG CAAGCCGGTG GAGGTATTGT ATTCGACAGT GACGAATATG ATGAGTATAT CGAAACAATG AACAAGATGA AGGCCAACAA CAACACCATC GTGGTGGCTG AAGAGTACTG GGCTGAAAAG GTCGGTATCC AAAAATAGAC ATCGATGTGT ACCAATATTC ATACATAGTA AATACATTTA AAAGTTTATT AAGCTTTGGG TGC
|
Protein sequence | MVAQIQPSFE ALKAIVDSHQ ELETQHNMYP IYQYVNNNDL SVHQAYLKLA KLNDKGRSPS FLFESAVKGD TVDRYSFIGV NPKKVIKTGD DESKYAESFC NVDPITVLEK EMKNYNQAQL PGLPKFSGGA TGYISYDCIK YFEPKTRRPL KDVLGFPEAV LMLCDLVVAF DHVFQRFQII NNVRVGEGED LATNFAKAEK EIQDVERLLS SEFSSDLNPQ QPPIKLGQTF TSNIGKEGYE AHVSNLKKHI LLGDIIQAVP SQRVARPTSL HPFNIYRQLR SVNPSPYLFY VDLVDFQIIG ASPELLVQAD AKGRVVTHPI AGTIARGKTN EEDEANAEIL RSSLKDRAEH IMLVDLARND INRVCQPTTN KVDRLLTIER FSHVMHLVSE VSGTLREDKT RFDAFRSIFP AGTVSGAPKV RAMELIAELE KEKRGVYAGA VGHWGYDGKT MDTCIALRTM VYKDGVAYLQ AGGGIVFDSD EYDEYIETMN KMKANNNTIV VAEEYWAEKV GIQK
|
| |