Gene PICST_50446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50446 
SymbolAAT3 
ID4840843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp45749 
End bp46963 
Gene Length1215 bp 
Protein Length404 aa 
Translation table12 
GC content36% 
IMG OID640392158 
ProductAspartate aminotransferase (Transaminase A) (AspAT) 
Protein accessionXP_001386410 
Protein GI126139776 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.566303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGC AAGAAGATTT CGATGTGGAG CAATTCATGG ACAAGTATGA AACCAACATT 
GTCAACAATA TGGGAGAAAC ATGTTGTGAC TCTTTAAGCG TAAACCAATT GCTTGAGTTG
ATATCCAAGG AAAAACCCGA AGTCGATATC AGTTTGAAAA GAAAGCAAAT ATGTGATTTG
ATCTTGGATA CTAAGGCCAC CTACGGCCAT ATTAGAGGTT CTCCAGACTT GAAGTCAGCA
ATAGCAGCAA TTTACAACGA AGACTTAAAA ACTGAAGGAA TTTCTAACGA TAATATTGTT
GTCACAAATG GTGCTATTGG TGCAAACTTC TTAACATTGT ATTCTCTTGT CGATGCTAAT
GATAAAGTAA TAGTTGTCAG CCCCACTTAT CAACAATTGG GGAGTGTAAG CGCAGTTTTC
TCTCAATCAA AGGTCAATGT GATCCCTTTT GAATTGAAAT ATGAAAATGA ATACTTGCCT
GACTTGGTCG AATTGAAGCA ATTAATTGAA ACTCATTTTC CTAAGCTAGT AATTATCAAC
AACCCAAATA ATCCTACTGG GGTCGTCTGG GACAACGAAA CTATGGAAAA AATGGTAAAC
CTCTGTAAAT CGAAGGACGT TTGGTTGATG TGTGATGAAG TATATCGTCC ATTATATCAT
TCTGTAAAAA GAGAAGATCA GCCAAAATCA GTGGTTAATT ATGGATATGC AAAAACAATT
TCTACCGGTT CAACTTCAAA GGCATTTGCA TTTGCAGGGT TGAGATTGGG TTGGGTTGTG
ACTCGTGATA AGGCTTTGCT TGACAATTTG TTTTCGAAGC GTGATTATAA CACGATCTCG
GTCTCGATGG TGGATGACAG TCTTGCCACG TTGGTTTTAC AAAACCACAA AGTTATTTTG
AAAAGAAACT ATGATATTTG CTTGAAGAAC ATTGAAATTG TTCAGAAGGA AATAGACCAT
TCTAATGGAT TGCTTAGCTG GATAAGACCA AAAAGTGGAA CAACTTGTTT TATCAAAATT
AATATTCCAA ATCTCGACAC CTACAAATTG TGTAGTGAAT TAGCCGAACA ACACAATACT
TTAGTGGTAC CTGGTGAAGT ATTTGACAAC AGAGCTGGTT TCCTCAGGGT TGGGTTTGGA
AATTCTCCTG AAAGCATTGT TGGAGGGTTT TATGAATTAA AAAAATGGTT TATCAAAAAT
GGTTACCAAA AATAA
 
Protein sequence
MVKQEDFDVE QFMDKYETNI VNNMGETCCD SLSVNQLLEL ISKEKPEVDI SLKRKQICDL 
ILDTKATYGH IRGSPDLKSA IAAIYNEDLK TEGISNDNIV VTNGAIGANF LTLYSLVDAN
DKVIVVSPTY QQLGSVSAVF SQSKVNVIPF ELKYENEYLP DLVELKQLIE THFPKLVIIN
NPNNPTGVVW DNETMEKMVN LCKSKDVWLM CDEVYRPLYH SVKREDQPKS VVNYGYAKTI
STGSTSKAFA FAGLRLGWVV TRDKALLDNL FSKRDYNTIS VSMVDDSLAT LVLQNHKVIL
KRNYDICLKN IEIVQKEIDH SNGLLSWIRP KSGTTCFIKI NIPNLDTYKL CSELAEQHNT
LVVPGEVFDN RAGFLRVGFG NSPESIVGGF YELKKWFIKN GYQK