Gene PICST_85710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85710 
SymbolTRP2 
ID4841010 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp788613 
End bp790325 
Gene Length1713 bp 
Protein Length514 aa 
Translation table12 
GC content45% 
IMG OID640392325 
Productanthranilate synthase component 
Protein accessionXP_001386741 
Protein GI126140438 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACGAAGTCT TTGTAAATAT CAACATGGTG GTAAGTCTAT TTGAGGAAGT GGGAGAATGA 
CCAGCTGTAC CATTGCGTAT AACAGAAGAA AAAGTACTAA CTTTTCCAGG CGCAAATCCA
GCCTTCGTTC GAGGCTTTAA AGGCCATCGT AGACTCTCAT CAGGAGTTGG AGACACAGCA
CAATATGTAT CCCATCTACC AGTATGTAAA CAACAACGAC TTGTCTGTTC ACCAGGCTTA
CTTGAAGTTG GCCAAGTTGA ACGACAAGGG TCGCTCACCT TCTTTCCTCT TTGAGTCGGC
TGTTAAGGGT GACACTGTAG ACAGATATAG TTTCATCGGA GTCAACCCCA AGAAGGTCAT
CAAGACAGGA GACGATGAAT CCAAGTATGC TGAATCATTT TGCAATGTTG ACCCTATCAC
AGTGTTGGAG AAGGAAATGA AAAACTACAA CCAGGCACAA TTGCCCGGTT TACCTAAATT
CAGTGGTGGT GCTACTGGGT ACATCTCGTA TGATTGTATC AAGTACTTTG AACCTAAGAC
CAGAAGACCA TTGAAAGACG TATTGGGTTT TCCTGAAGCA GTTCTCATGC TTTGTGACTT
AGTCGTGGCC TTTGACCACG TTTTCCAAAG ATTCCAGATC ATCAACAATG TCAGAGTAGG
CGAAGGTGAA GATCTTGCTA CCAACTTCGC CAAAGCCGAA AAGGAAATAC AAGACGTAGA
AAGGTTGTTG TCATCAGAGT TTTCAAGCGA CCTCAATCCA CAACAGCCTC CCATCAAGCT
TGGCCAGACA TTCACGTCGA ATATCGGGAA GGAAGGCTAC GAAGCCCATG TGTCTAATTT
GAAGAAGCAC ATCTTGTTGG GAGACATTAT CCAGGCCGTG CCCTCGCAGA GAGTCGCCAG
ACCTACATCC TTGCATCCGT TCAACATATA CCGTCAGTTG AGATCAGTTA ATCCATCTCC
ATACTTGTTC TACGTTGATT TGGTGGATTT CCAGATCATT GGAGCTTCTC CAGAATTGTT
AGTACAAGCT GATGCTAAGG GAAGAGTCGT TACACACCCC ATTGCCGGGA CTATTGCCAG
AGGTAAAACC AACGAAGAAG ATGAAGCTAA TGCTGAAATC TTGAGATCGA GTTTGAAGGA
TAGAGCCGAA CACATCATGT TGGTGGACTT GGCCCGTAAC GACATTAATA GAGTTTGCCA
GCCTACCACC AACAAAGTGG ACCGTTTGCT CACCATCGAG AGATTCTCAC ATGTGATGCA
TTTGGTATCT GAAGTCAGCG GAACTTTGAG AGAAGACAAG ACCCGTTTTG ATGCGTTCAG
ATCCATCTTC CCAGCTGGTA CTGTTAGTGG CGCTCCTAAG GTGAGAGCTA TGGAGCTCAT
TGCCGAGTTG GAAAAGGAAA AAAGAGGTGT ATACGCTGGT GCTGTAGGCC ACTGGGGCTA
CGACGGTAAG ACCATGGACA CCTGTATTGC CTTGAGAACC ATGGTGTACA AAGACGGCGT
GGCGTACTTG CAAGCCGGTG GAGGTATTGT ATTCGACAGT GACGAATATG ATGAGTATAT
CGAAACAATG AACAAGATGA AGGCCAACAA CAACACCATC GTGGTGGCTG AAGAGTACTG
GGCTGAAAAG GTCGGTATCC AAAAATAGAC ATCGATGTGT ACCAATATTC ATACATAGTA
AATACATTTA AAAGTTTATT AAGCTTTGGG TGC
 
Protein sequence
MVAQIQPSFE ALKAIVDSHQ ELETQHNMYP IYQYVNNNDL SVHQAYLKLA KLNDKGRSPS 
FLFESAVKGD TVDRYSFIGV NPKKVIKTGD DESKYAESFC NVDPITVLEK EMKNYNQAQL
PGLPKFSGGA TGYISYDCIK YFEPKTRRPL KDVLGFPEAV LMLCDLVVAF DHVFQRFQII
NNVRVGEGED LATNFAKAEK EIQDVERLLS SEFSSDLNPQ QPPIKLGQTF TSNIGKEGYE
AHVSNLKKHI LLGDIIQAVP SQRVARPTSL HPFNIYRQLR SVNPSPYLFY VDLVDFQIIG
ASPELLVQAD AKGRVVTHPI AGTIARGKTN EEDEANAEIL RSSLKDRAEH IMLVDLARND
INRVCQPTTN KVDRLLTIER FSHVMHLVSE VSGTLREDKT RFDAFRSIFP AGTVSGAPKV
RAMELIAELE KEKRGVYAGA VGHWGYDGKT MDTCIALRTM VYKDGVAYLQ AGGGIVFDSD
EYDEYIETMN KMKANNNTIV VAEEYWAEKV GIQK