Gene PICST_80440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80440 
SymbolAAT1 
ID4851459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1861212 
End bp1862642 
Gene Length1431 bp 
Protein Length439 aa 
Translation table 
GC content46% 
IMG OID640393167 
Productaspartate aminotransferase 
Protein accessionXP_001387990 
Protein GI126274587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTACCCGTAG TACCCCTTCT TGTATCCCAA GCCCTTATAG AAGATGTATA GAACCTCGTT 
GCTCAAGCAG ACTGCACGTC CTTCCGTCCG AGTCTCCACC AGACAATTCT CAGTGCTCAA
CAACCAGGTC AGAAAGTGGA GCGAAATCCC ATTGGCTCCT CCAGACAAGA TCTTGGGTAT
CTCCGAAGCC TACAACAAGG ACGCCAACAC CTCCAAGATC AACTTGGGTG TCGGAGCCTA
CAGAGACAAC TCCGGTAAGC CTATCATCTT CCCAAGTGTC AAGGAAGCTG AAAAGATCTT
GCTTGCCAGC GAAGTTGAAA AGGAATACAC CGGTATCACT GGTTCCAAGA AGTTCCAGAA
CGCCGTCAAG GGCTTTGTTT TCAACAACTC CGGCAAGGAT GTCAACGGTC AACAATTGAT
TGAACAAAAC AGAATTGTCA CTGCCCAGAC CATCTCTGGT ACTGGTTCCT TGAGAGTCAT
TGGTGACTTC TTGAACAGAT TCTACACCAA CAAGAAGCTC TTGGTTCCAA AGCCTACCTG
GGCCAACCAC GTTGCCGTTT TCAAGGACGC TGGCTTAGAA CCAGAATTCT ACGCTTACTA
CGAGACTTCC AAGAACGACT TGGATTTCGC CAACTTGAAA AAGTCCTTGT CTTCCCAGCC
AGACGGCTCT ATTGTCTTGT TGCATGCCTG TTGCCACAAC CCAACTGGTA TGGACTTGAC
TCCTGAACAG TGGGAAGAAG TTTTGGCTAT TGTCCAAGAG AAGAACTTCT ACCCACTTGT
TGACATGGCC TACCAAGGTT TCGCTTCCGG TAACCCATAC AAGGACATTG GCTTGATCAG
AAGATTAAAC GAGTTGGTTG TCCAGAACAA GCTCAAGTCC TACGCCTTGT GTCAATCGTT
TGCTAAGAAC ATGGGTCTCT ATGGTGAAAG AACTGGTTCT ATCTCCATCA TCACTGAGTC
TGCCGAAGCT TCTCAAGCCA TTGAGTCTCA ATTGAAGAAG TTGATCAGAC CAATCTACTC
CTCTCCACCA ATCCACGGTT CCAAGATTGT CGAAATCATC TTTGATGAGC AACACAACTT
ATTGAACTCG TGGTTGCAAG ACTTGGACAA GGTTGTTGGT AGATTGAACA CTGTCAGATC
CAAGTTGTAC GAAAACTTGG ACAAGTCCTC TTACAACTGG GACCACTTGT TGAAGCAAAG
AGGTATGTTC GTGTACACTG GTTTGTCTGC TGAGCAAGTT ATCAAGTTGA GAAACGACTA
CTCGGTCTAC GCTACTGAAG ACGGAAGATT CTCCATCTCT GGAATCAACG ACAACAATGT
CGAGTACTTG GCTAACGCCA TCAACGAAGT CGTCAAGCAG TAGACGTATA GATGGTCTGC
TATTTTTTCT ACGAATTCAT AAATTATATA TCAATAAAAT GACTTATGGT T
 
Protein sequence
MYRTSLLKQT ARPSVRVSTR QFSVLNNQVR KWSEIPLAPP DKILGISEAY NKDANTSKIN 
LGVGAYRDNS GKPIIFPSVK EAEKILLASE VEKEYTGITG SKKFQNAVKG FVFNNSGKDV
NGQQLIEQNR IVTAQTISGT GSLRVIGDFL NRFYTNKKLL VPKPTWANHV AVFKDAGLEP
EFYAYYETSK NDLDFANLKK SLSSQPDGSI VLLHACCHNP TGMDLTPEQW EEVLAIVQEK
NFYPLVDMAY QGFASGNPYK DIGLIRRLNE LVVQNKLKSY ALCQSFAKNM GLYGERTGSI
SIITESAEAS QAIESQLKKL IRPIYSSPPI HGSKIVEIIF DEQHNLLNSW LQDLDKVVGR
LNTVRSKLYE NLDKSSYNWD HLLKQRGMFV YTGLSAEQVI KLRNDYSVYA TEDGRFSISG
INDNNVEYLA NAINEVVKQ