Gene PICST_91065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_91065 
SymbolTRP6 
ID4840460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp222916 
End bp224485 
Gene Length1570 bp 
Protein Length512 aa 
Translation table12 
GC content43% 
IMG OID640391775 
ProductAnthranilate synthase component II Includes: Glutamine amidotransferase Indole-3-glycerol phosphate synthase (PRAI) 
Protein accessionXP_001386254 
Protein GI126139463 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0134] Indole-3-glycerol phosphate synthase
[COG0512] Anthranilate/para-aminobenzoate synthases component II 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.248052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.833789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAGAACCACC CCAGCTTGTA GAATCTTCAG CATGACCAAA GAATTGCCCA AGAGACATGT 
CTTAATTATA GACAACTACG ACTCGTTCAC TTGGAACCTC TACCAGTATT TATACCAGTC
ACCATTGTGT GGGAAGGTAG ACGTTTTCAG AAACGACAAG ATCGACATTT CTACCATTGA
AAATCAAGTC AAGCCCGACA TTATCCTCAT TTCTCCAGGA CCGGGCCATC CTACTAGCGA
TTCAGGAATC TCTCGAGACG TCATCAAACA CTTCATGGGA AAACTTCCTA TCTTTGGCAT
CTGTATGGGT CAGCAATGTA TGATTGATGC GTTTGGTGGA GAAGTGACTT ATGCCGGAGA
AATTGTCCAC GGCAAGACAA CTACCATTAA ACATGACGGT AGAGGAATGT TTGACACCGT
TCCACAATCA GTTGCTGTCA CGAGGTACCA TTCATTGGCT GGAACCCAAC AGACATTGCC
AGACTGTTTG GAAGTCACAG CACTAACCGA GACTACTCCA GAGATCATTA TGGGAGTTAG
ACACAAGATT TACACAATTG AAGGAGTTCA GTTCCACCCA GAATCCATTT TGACCGAATC
GGGCCAAATA ATGATCAACA ACTTGTTGTC TGTCACAGGA GGAACCTGGG ACGAGAACAA
AGCCAACGGA TCCGGCTTTT CAAAGAAGGA AAATATTTTG TCCAAGATCT ACAAGCAACG
TCAAATTGAC TACAAGAGAA TTGAGAGCTT GCCAGGAAAA TCACTTGAGC AATTGGAGAT
CTCATTGGCA TTGAACATTG CTCCACCCAT AACAGACTTC TACCAGAGAT TGAAGTACAC
CCAGGACGTG CTCAAGCAAA CCATCATTTT GTCAGAGTTC AAGCGTGCAT CTCCGTCTAA
GGGAGACATC AATATTGATG CGCATCCAGG TAAACAGGCA TTGACATATG CTACCAATGG
ATGTTCCACC ATATCGGTGT TGACCGAACC CAAGTGGTTT AAGGGTTCCT TGGATGACTT
GTCATTGATC CGTAAGGTTA TCGATATTCC AACTACTGAA GGATACAAGA GACCAGCCGT
ATTGAGAAAA GAGTTCATTT TCAGCAAGTA CCAGATCTTG GAAGCTAGAT TGGCAGGCGC
AGATACTGTG CTATTAATAG TGAAGATGTT GAATGATATC AAGTTATTAC AACAACTATA
CGAATACTCT TTGTCGTTGG GTATGATTCC GTTGGTTGAA GTTCAAAACA AGCAAGAATT
GGACCAGGCA GTGAAGTTAA CGTACAACGA TGACACCAAG GAGCCATTGG TGATCGGTGT
CAACAACAGA AACTTAGCTA CATTCGACGT TGATTTGAAT ACCACCAGCT CTTTGGTTGA
ATCTTCAAAG AAAAGCCAAA GAAGGGGTGA TGTTCTTGTG TTGGCATTAT CGGGAATTAC
TTCCGTTGAA GACGTTAAGA ACTACAAGTA CAACGACGGA GTTGACGGCT TTTTGATTGG
CGAAAGTTTG ATGAGAGCAG AAGAAAGGGG AGAGGCAGGC AAGTTCTTGA ACGACTTGTG
CAATTGCTAA
 
Protein sequence
MTKELPKRHV LIIDNYDSFT WNLYQYLYQS PLCGKVDVFR NDKIDISTIE NQVKPDIILI 
SPGPGHPTSD SGISRDVIKH FMGKLPIFGI CMGQQCMIDA FGGEVTYAGE IVHGKTTTIK
HDGRGMFDTV PQSVAVTRYH SLAGTQQTLP DCLEVTALTE TTPEIIMGVR HKIYTIEGVQ
FHPESILTES GQIMINNLLS VTGGTWDENK ANGSGFSKKE NILSKIYKQR QIDYKRIESL
PGKSLEQLEI SLALNIAPPI TDFYQRLKYT QDVLKQTIIL SEFKRASPSK GDINIDAHPG
KQALTYATNG CSTISVLTEP KWFKGSLDDL SLIRKVIDIP TTEGYKRPAV LRKEFIFSKY
QILEARLAGA DTVLLIVKML NDIKLLQQLY EYSLSLGMIP LVEVQNKQEL DQAVKLTYND
DTKEPLVIGV NNRNLATFDV DLNTTSSLVE SSKKSQRRGD VLVLALSGIT SVEDVKNYKY
NDGVDGFLIG ESLMRAEERG EAGKFLNDLC NC