Gene PICST_90828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_90828 
SymbolFAO1 
ID4840653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp370988 
End bp373299 
Gene Length2312 bp 
Protein Length699 aa 
Translation table12 
GC content44% 
IMG OID640391968 
Productlong chain fatty acid oxidase 
Protein accessionXP_001386087 
Protein GI126139129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.654464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAATGAGTTC ATTTCTGCAT AACGTATTCT GCATGGAAAT TAGTATATAT AATAGTTGAT 
AATCTGAACT CACCTTTGCT ACTTAACGGA ATAATAGCTA CAACTCGTTA TATTTACTAA
TAATATGAGT TCGGTTAGTG AGAGTCATAT AGAGATTTTG GTGGCATTGG CCGATGGGAT
CATCCATGAC ACAACGGCCG ATGTCGTCAG ACCCTACTTG GACCCCGGAT TTCCAATTGA
CAGGCTTGAA GAATACTTGA AGAACTACAC CAGACCCTCA GAAACAGAAG GGTTTAAGCA
ATTCGTCACT AACGTGATCA ATTCGAATCC GACCCATTCC AGGAGAATGT TCACTGTGCT
TATGACGGTG TTAGACTCCC GGATCTTAGC ACCAACTCTC ACGGGCTCGC TTACATTGAT
CAGAGATATG ACGGTAGAAC AGCGTGAGCA GCTATTACAA TCGTGGAGAG ATTCTCCTAT
AGTGACCAAA AGACGTGCAT TCAGAATGGT GCACGCTGCA GCCGTGTCCA CATTCGCGAG
AATGGCTACA GACTTGCACT TGAAGGCTAT TGGCTATCCT GGAGTCGAGA CGAGAGAAAA
GCTATATGCT GACCAAGTGC CCGACACTTT CAAATATTCG ATGTTGAAAA AACCACAAGT
TGACGGCTAC GAGTTGCACA TTCCTGACGT CGATGTGTTA ATCATTGGGT CTGGTTCTGG
TGCCGGTGTC GTTGCTCATA CTATAGCTGA AGCAGGATAT AAGGCTTTGG TTCTCGAGAA
AGGAAAGTAC TTTTCGAGCG AAGAATTCAC CTTCAATGAC TTGACTGGAT ACCAGAACTT
GTATGAACAA CAAGGTGCCT TAGTCTCTTC CAACCAACAG TTGTTCGTAT TGGCTGGGTC
TACATTTGGT GGAGGTTCTG CCATCAACTG GTCTGCTTGT TTGAAGACTC CATTCAAGGT
CAGAAAAGAA TGGTACGACG ACTTTGGTCT TGAATGGGCT GCCAGCGAGT CATTTGACAA
GTGCACTGAC TATGTATGGA CTCAGATGGG TGCGAACAAG AACAACATCA ATCATTCTTT
AGCCAATAAA GTTATTTTAG AAGGTGGTGC CAAGTTGGGC TACAAGGTCA AGGAAGTGGA
ACAGAACAAT GGAGCCCACG CTGATCACAG CTGTGGTTTT TGTCACCTCG GTTGTAAATA
CGGAATCAAA CAGAGTTCCC CAGCTTGCTG GTTCAGAGAA CCAGCAGACA AGGGCTCATT
GTTTATGGAC CAAGTTAAAG TGATCAAGGT TCTCCACAAT CGTGGAGTAG CCATCGGAGT
CTTGTGTGAA GATATCCTCA CTGGAAAGCA GTTCAAAATT ACAGGTCCTA AGAAGTACGT
AGTGAGTGGT GGTTCATTGT GTACGCCAGT TGTTTTACAG AACTCCGGTT TCAGAAACAA
GCATATAGGT GCCAACTTGA AGTTGCACCC TATCTCGATT GTTTTCGGAA ACTTTGGCAG
AGAAGCCAGA GCTGATCCTC ATGAACATCC TATATTGACT TCTGTGTGCA CTGAGGTGGA
CGATTTGGAT GGAAAGGCTC ACGGTGCTAA GATCGAAACC GTTTTGAATG CTCCATTCTT
GGAGTCGGTA TTCCTTCCAT GGCAGAACAG TGACAAGCTC AGAGAAGACT TGTTGAAGTA
CCAGAATCTT GCAACCATGT TGCTAATCAC AAGAGACAAA TCCAGCGGAT ATGTGAGGGC
AGATTCCAAC GCTCCTAATT CGTTGATTGT TGACTACACA GTTAACCAAT ACGACCGTAA
TGCTTTGCTC CAGGCTTTTG TCACCACTGC AGACATGCTC TATATCCAAG GTGCAAAAGA
GATTTTTGGT TCCCAAGCAT GGCTTCCTGT GTTCAAGTCA GAAAAACCCA AACACGAAAG
AGCCATCACC GACCAAGACT TTGTGGATTG GAGAAATGCC GTGTTGAAGA TAGGCTTAGA
TTCATACGGA AACGTGTATG GTTCTGCTCA TCAGATGAGT TCCTGCCGTA TGTCTGGAAA
GGGCCCTAGA TACGGGGCTT GTGACGAGAA CGGCCATTTG TTTGAATGTA AAAATGTCTA
CGTTGCAGAT GCCAGCGCTA TGCCTACCGC CAGTGGTGCC AATCCTATGA TCACAACCAT
GGCTATAGCA AGACATGTAG CTCTTGGACT TGTCAAGGAC TTGCAACCAG CAGCAAAGCT
TTGATTTATT TATGTAATGA TTGAATACTA TTGTACACTT GCAATTAGAT ACACTGTGTA
CTACTGTATA TGATATTAAA GATTTTGTAA TT
 
Protein sequence
MSSVSESHIE ILVALADGII HDTTADVVRP YLDPGFPIDR LEEYLKNYTR PSETEGFKQF 
VTNVINSNPT HSRRMFTVLM TVLDSRILAP TLTGSLTLIR DMTVEQREQL LQSWRDSPIV
TKRRAFRMVH AAAVSTFARM ATDLHLKAIG YPGVETREKL YADQVPDTFK YSMLKKPQVD
GYELHIPDVD VLIIGSGSGA GVVAHTIAEA GYKALVLEKG KYFSSEEFTF NDLTGYQNLY
EQQGALVSSN QQLFVLAGST FGGGSAINWS ACLKTPFKVR KEWYDDFGLE WAASESFDKC
TDYVWTQMGA NKNNINHSLA NKVILEGGAK LGYKVKEVEQ NNGAHADHSC GFCHLGCKYG
IKQSSPACWF REPADKGSLF MDQVKVIKVL HNRGVAIGVL CEDILTGKQF KITGPKKYVV
SGGSLCTPVV LQNSGFRNKH IGANLKLHPI SIVFGNFGRE ARADPHEHPI LTSVCTEVDD
LDGKAHGAKI ETVLNAPFLE SVFLPWQNSD KLREDLLKYQ NLATMLLITR DKSSGYVRAD
SNAPNSLIVD YTVNQYDRNA LLQAFVTTAD MLYIQGAKEI FGSQAWLPVF KSEKPKHERA
ITDQDFVDWR NAVLKIGLDS YGNVYGSAHQ MSSCRMSGKG PRYGACDENG HLFECKNVYV
ADASAMPTAS GANPMITTMA IARHVALGLV KDLQPAAKL