Gene PICST_43257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43257 
Symbol 
ID4837980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp795603 
End bp798449 
Gene Length2847 bp 
Protein Length948 aa 
Translation table12 
GC content38% 
IMG OID640389295 
Productpredicted protein 
Protein accessionXP_001383787 
Protein GI150864810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.929698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAC AATTACATAC GTTACAATCA ATAGCTATAG AGGACGAGTC CGCTCCATCT 
TTGGCGGTTT CCAACCGTTT TGTTCCGGCA TTATACGAAC TCAAATTGGA CATTAACCAT
ACAAAACCAA ACTTCCAAGG TCAATTAGAC ATTCTTTTGA AAGAGAACGA CGTTTATAAT
GTTGAAAGAC TGCAAACATC CACTAAATTC TCATTATCTT TACATGCATC CAAATTAGTC
ATTACCAAGG CAGTTTTGAA TACCACTTCT GAAGTCAAAC TTACTGTTAA ATACGACAGA
ATCAACAGTC AGGTGACTCT TTCTTCGTCA GAAGATGTCG AAATTGTAGA TGTCGCTAAT
CTGAAAGTCT CCATTACGTA CATGGGGCAG ATCAATAGCA TCAAAACGTA CCAAGATAAA
ACCCATGGTT TGTTCAAGAC TAATTATTTG GACAGTGTCT CAGGAAAGTC TAACAATTAT
ATTCTTTCTA CTCACTTCCA GCCTCATTCT GCCAAGTTGG TCTTTCCCTT GATCGAAGAA
TTGCACGTCA AGACTCCGAT CAAGTTGACC ATCACTACTC TATCCAAATT TAAAGTAATT
TCAGTTGGTA AACTTCTACT GAATCAGCCA TTGGAGATGA GTGAAAACTC TACTTTCAGC
TTTGAAACTT CTCCTCCAAT TGCGCCTTCT GTATTTGGAT TTGTAATCGG TGATTTTGAA
TACTTGGAAG ACAAATATGG CGATCTTCCT CTTCGTGTAT ACACAAGTAT TGGTGAAAGC
AGATATGCTA TTCGTGCTCT TAAACTGATG AAGAAATTGC TTCCTATTCT TGAAAGTTTG
CTTGATGTCA AGTATCCTTT GGAAAAGCTT GATTTCGTTT CTATTCCATT CTTAAACGAT
GGAGCCATGG AAAATTGGGG CTTGGTAACT GTTCTTTCTA ACCAATTGCT TGTGGATGAA
AGCACAGCCA GCCCATCCAC TTTAAGACAA ATTGACCAAA TAGTTGCACA TGAACTTGTC
CACCAATGGA TAGGAAACTT GGTTACATTT GATGACTGGA AATACTTGTG GTTGAACGAA
TCCTTCGCTA CATGGCTCGG TAACTATATT CTTGACGTTG CTGATAACTC TCACCCAAAA
GACAAAGAAT TTGAAAAATT CCTTGACCAA GACTGCTTCT ATGCTGAAGA CAAGTTCTCA
ATCCCAAGCA TCAATACCTA TATGTCCAAA ATAGATACCG GATTAAACTC TTTGACATCG
ACAATTTTTG ATACACATGC TTACGAGAAG GGTATAATTT TATTAAGAAT GATTGGAAAT
ATCATACATG TAGATGGGGA TCCAACATCA GCTAGTGAGG GTGATGACTA CACGAGAATG
TTGCGTGGAA TCGGCGCACT TATCAAAAAG TATCAGTACA AGTCAATCAA GGCATTTGAA
ATTTGGAATA CATTGAACGA GTTGACATCC ATTGACTTGC AGAGTTTCGT CCATTCTTGG
TTAAGATACC CTGGATTTCC ATTGGTTCAA GTTACTACCA ATGACGACAA CTCCAAGTTG
TTATTTGAAC AACATCAATG TTTGTACAAT TTGAGGGCAG ACCAAGTTAA TTTAGAAGAC
CATCCTTTTC ATGTCCCATT GTTTATCAAG GTGATAGATG ACAAGGGCTC ATCGAAAGTA
TTGAATATCA TAATGACAGA CCGTACTTTG GAATTGGATA TTTCCTTGGC TCAATTAGTC
AATATTAACC ACAACCATAG TGGTTACTAC CGTGTCAAGT ATTCCCCAAA ACTAATTGCC
AATATCATTG AGAACATTGA TAAGGTGTCC CTGACTGATT TGATTACCAT CATCAATGAC
TATGGAAAGC TTTTAGGAAG TGTTGGCACT ACAAAAGAAG ATCTCATTTC TTTAGTACAG
ATCATTGAAG CTGTTTGCAA GAGATCACAC ATTGACTACG ATTTATTACA AGTTGCGATG
ACCTATTTAG AAACAATCAA CTCAAACTTG ATGCATTTCA GTAAATATAC TGAGTTCCAG
GTATGGGTTG ACAACTTAGT TAGCACTTTG TTCCAAAGAA TTGGTGGATG GGACAAATTG
CAGTCTTTCA AAGATAGTCA TTGCTACGAT CCTGTAGAAA TGGAAGTTAG AAACGCCATT
CTTCAAATTG GAGTATACCG TTCAGACTTC CAAGAAGTTG GCAAAAAGTT ATTCAAGAAT
TTCGTTAATT CGGGAATAAA CAAATCGTTT ACTCCCAAAC AATTGGCAAC ATCCATGTTC
AATACAATGA TCTACAATGC CCCACAGAAG GACTACAAGC GAGTCTTAGA GTTTGTCAAG
AATTCCAACA ATTCTCTTTT GGAGCATACC GACCTTACGA ACCTGGATTT GCAAACCACA
GCAGTATCGT CGTTGTCGTT TGTACACAAG GACGATTTGC TTCATAAGAC CTTGAACTTT
GTCATGACCA ACATCGATGC TAAAATGATA GAGTTAGGCT TGATTGGATT CCAGTACAAG
AGCTCCAAAC AAGACAAGCT TAAATTGTTC CAATGGTATA AGCTCCATTA CGACCAGTGG
GTATTACGTT CGTTAAGAAA GGGTTCCGAT TGGTCAAAGC AAATCGGTAT TACGGTTTCA
AACATTTCGA AGATGGTCTT AGGAACCATA ATGCAATTCG ATCCCGAACT TGTCGAATTG
AGAGAAAAAT TTGTTAAGGA CAAACTTGCG ACGTTACCTC CTCACGGTTT ACAAGAATTG
CTTGAAGCGG TACAGGATGA AAACGAAGAG AAGGTGTTGA TCGGCGGTTA TTATGACGAC
TTGGTCCTGC AAGTGCTTAG AGCTTGA
 
Protein sequence
MVEQLHTLQS IAIEDESAPS LAVSNRFVPA LYELKLDINH TKPNFQGQLD ILLKENDVYN 
VERSQTSTKF SLSLHASKLV ITKAVLNTTS EVKLTVKYDR INSQVTLSSS EDVEIVDVAN
SKVSITYMGQ INSIKTYQDK THGLFKTNYL DSVSGKSNNY ILSTHFQPHS AKLVFPLIEE
LHVKTPIKLT ITTLSKFKVI SVGKLLSNQP LEMSENSTFS FETSPPIAPS VFGFVIGDFE
YLEDKYGDLP LRVYTSIGES RYAIRALKSM KKLLPILESL LDVKYPLEKL DFVSIPFLND
GAMENWGLVT VLSNQLLVDE STASPSTLRQ IDQIVAHELV HQWIGNLVTF DDWKYLWLNE
SFATWLGNYI LDVADNSHPK DKEFEKFLDQ DCFYAEDKFS IPSINTYMSK IDTGLNSLTS
TIFDTHAYEK GIILLRMIGN IIHVDGDPTS ASEGDDYTRM LRGIGALIKK YQYKSIKAFE
IWNTLNELTS IDLQSFVHSW LRYPGFPLVQ VTTNDDNSKL LFEQHQCLYN LRADQVNLED
HPFHVPLFIK VIDDKGSSKV LNIIMTDRTL ELDISLAQLV NINHNHSGYY RVKYSPKLIA
NIIENIDKVS STDLITIIND YGKLLGSVGT TKEDLISLVQ IIEAVCKRSH IDYDLLQVAM
TYLETINSNL MHFSKYTEFQ VWVDNLVSTL FQRIGGWDKL QSFKDSHCYD PVEMEVRNAI
LQIGVYRSDF QEVGKKLFKN FVNSGINKSF TPKQLATSMF NTMIYNAPQK DYKRVLEFVK
NSNNSLLEHT DLTNSDLQTT AVSSLSFVHK DDLLHKTLNF VMTNIDAKMI ELGLIGFQYK
SSKQDKLKLF QWYKLHYDQW VLRSLRKGSD WSKQIGITVS NISKMVLGTI MQFDPELVEL
REKFVKDKLA TLPPHGLQEL LEAVQDENEE KVLIGGYYDD LVSQVLRA