Gene PICST_88119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88119 
SymbolADE3 
ID4837008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1374755 
End bp1377799 
Gene Length3045 bp 
Protein Length946 aa 
Translation table12 
GC content48% 
IMG OID640388323 
Producttetrahydrofolate synthase 
Protein accessionXP_001382486 
Protein GI150863865 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0190] 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase
[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.226712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATTGAAGTC AATATGGTCG CCCAATTGAT CGACGGAAAG GCAATCGCTC TCCAGTTGCG 
TACTGGCATC CACGATGAAA TCGCCCAGAT CCAGCAGAAA CACTCCGACT TCAAGCCCAA
CTTGACTATT ATCCAGGTGG GTGACAGACA AGACTCCTCG ACTTATGTCA GAATGAAGTT
GAAGGCCGCT GAAGAGGCCA GCATCGACTG CCATATCATC AAGTTGGCAG CCGACATCTC
CGAGTTTGAG CTTCTCAACG AAATCACCAG ACTCAACAAC AGTCTCGATG TTGATGGGAT
CTTGGTCCAG TTACCCTTGC CAGCCCATAT CGACGAAACA AAGGTCACCA ACGCCGTTTT
GGCCGACAAA GATGTCGACG GCTTTGGTCC CTTCAATGTA GGCGAATTGT CCAAGAAGGG
AGGTCAGCCA TTATTCTTGC CATGTACTCC CAAGGGTATC ATGGAATTGT TGGACGTCTC
TGGTGTCACT GTAGAAGGAG CCAACGCTGT AGTCCTCGGC AGATCCGATA TCGTCGGCAA
GCCTATTGCC AGATTGTTGA CTAAGGCCAA CGCTACCGTT ACCACGTTGC ACTCCAAGAC
TCCTCAAGCC CAGATCGAGT TGTTCTTGTC GCAAGCCGAT ATCGTAATCG CTGCCATCGG
CCAGCCCCAG TTCGTACAGG GCCTGTGGTT GAAGGAAGGT GCCGTTGTTA TTGACGTCGG
CACTAACTTT ATTCCTGATG CCACTAAAAA GTCTGGCTCC AGAATGGTCG GTGACGTAGA
CTTCGAGTCC GCTTCCCAGA AAGCATCCTT GATCACTCCT GTTCCAGGCG GTGTTGGCCC
AATGACTGTT GCCACATTGT TGGAAAACGT CATTCTTGCT GCCAGACGTC ACTATGCCAA
AAACAACGAA ACTCCCAAGT TCACTGAACC TTTGACGTTG CACTTGCAAA AGCCAGTTCC
TTCGGATTTT GAGATCTCCA GAGCTCAACA GCCCAAGAAG ATCACGCAGG TCGCTGACGA
AGCCGGAATC TTAGAAAGCG AAGTTGAACC ATTTGGTGCC TACAAAGCTA AGGTGTCTTT
AGATATCTTG AAACGTTTGC ACAACAAGGT CAACGGTAAA TATGTCTTGG TCACTGGTAT
CACTCCTACA CCTTTGGGTG AAGGTAAGTC TACCACCACT GTAGGTTTGG CTCAAGCTTT
AGGTGCACAT TTGAAGAAGA ACGTCTTTGC TAACGTCAGA CAGCCATCTA TGGGACCTAC
TTTTGGTATC AAGGGTGGAG CTGCTGGTGG AGGGTACTCC CAGGTGATTC CTATGGACGA
ATTCAACATG CACGTGACTG GAGATATTCA TGCTATCACC ATGGCTAACA ACTTGTTGGC
AGCTGCTATT GACACGAGAA TGTTCCATGA ATCCACTCAG AAGGACGGAC CCTTGTACAA
GAGACTTGTT CCTGCTAAGA AGGGTGTGAG AAAGTTCACC AATTCCATGT TGAGAAGATT
GAACAAATTG GGCATCGACA AGACCGATCC AGACTCGCTT ACACCTGAAG AGGTTACCCG
TTTCGCCAGA TTGGATATTG ATCCAGAAAC TATCACCTGG AGAAGAGTTG TGGACTGTAA
CGACAGATTC TTAAGAGGAA TTACTATTGG TCAGGCTCCT ACAGAGAAAG GTTTCACTAG
AGAAACTGGT TTCGACATCA CTGTTGCATC TGAGTGTATG GCTATTTTGG CCTTGTCTAA
CTCCTTGGAA GACATGCGTG AAAGATTGGG GAAGATGGTA ATTGCTTCCA ACAGAGCTGG
TGAGCCAGTT ACAGCTGAAG ATATTGGATG TGCTGGTGCC CTTACTGCTT TGTTGAAGGA
TGCTATTAAG CCCAACATCA TGCAAACCTT GGAAGGAACA CCAGTGTTTG TTCATGCTGG
TCCTTTTGCC AACATCTCCA TTGGTGCCTC TTCTATTCTT GCTGACAAGA TGGCGCTCAA
GTTGGCTGGA ACGTCCCCTG ATTTGTCTGC TGAAGAAAGA CAGCAGCAGG AAGGGTATGT
TGTAACAGAA GCTGGTTTTG ACTTCACTAT GGGTGGAGAA AGATTCATCA ACATCAAGTG
TCGTTCTTCC GGCTTGGTTC CTGACGTCAT CGTCATTGTA GCCACTGTTC GTGCCTTGAA
GGTTCACGGT GGTGGCCCAG AAGTCAAGGC TGGAGCACCT TTGGCTGCTG AGTACACCCA
GGAAAACACT GAGTTGTTGA GAGCCGGTTG TTCCAATTTA GGCAAGCACA TCAGCAACGC
TCGTTCGTAT GGTCTTCCTG TCGTTGTAGC AATTAACAAG ATGTCTTCAG ACTCAGAAGC
TGAACATGCC ATCATCAGAG AAGAAGCCCT CAAGGCAGGC GCTGTTGATG CCATTGTTTC
TAACCACTGG GAAGAAGGTG GTCAGGGAGC TGTAGATCTC GCCAACGGTG TTATTTCTGC
TGCCAACTTG CCTGAGAGAA ACTTCAAGTT CCTCTACGAC ACCGAGCCTT CTGTGGAAGA
AAAAATCGCC ACCATTGCCA GGGAAATGTA CGGTGCTGGA GAAGTCGAGT TCCTGCCCGA
AGCCCAAAAG AAGATTGACT TATACACCAA GCAGGGCTTC GGCAACTTAC CAATCTGTAT
TGCCAAGACC CAGTACTCGT TATCGCACGA TGCTGCACTC AAGGGTGTTC CTACTGGATT
CACGTTCCCT ATTCGTGACG TGAGAGCCTC TATCGGTGCC GGCTACTTGT ATGCCTTGGC
TGCTGAAATC CAGACCATTC CAGGTTTGCC AACCCACTGT GGGTTCATGA ATGTGGAAGT
CAACGAAGAC GGCGAGATTG ACGGTTTGTT CTAAACAAAC TAAACTAAAG TTCAACACTT
ACGACTCACG ATAAAAATTC ATACACAATA ATACATTTAT TTCCCACTGC CCCATGGGGT
TGTTTGCCAT CATCGAATAG CTAGTGATAG CGATTAGTTG TCCTGGCTTG TTTATACTAT
ATTTTGTCTG TACAGTAAAT AAAGTTTCAA CACGTTAGAA ATTGT
 
Protein sequence
MVAQLIDGKA IALQLRTGIH DEIAQIQQKH SDFKPNLTII QVGDRQDSST YVRMKLKAAE 
EASIDCHIIK LAADISEFEL LNEITRLNNS LDVDGILVQL PLPAHIDETK VTNAVLADKD
VDGFGPFNVG ELSKKGGQPL FLPCTPKGIM ELLDVSGVTV EGANAVVLGR SDIVGKPIAR
LLTKANATVT TLHSKTPQAQ IELFLSQADI VIAAIGQPQF VQGSWLKEGA VVIDVGTNFI
PDATKKSGSR MVGDVDFESA SQKASLITPV PGGVGPMTVA TLLENVILAA RRHYAKNNET
PKFTEPLTLH LQKPVPSDFE ISRAQQPKKI TQVADEAGIL ESEVEPFGAY KAKVSLDILK
RLHNKVNGKY VLVTGITPTP LGEGKSTTTV GLAQALGAHL KKNVFANVRQ PSMGPTFGIK
GGAAGGGYSQ VIPMDEFNMH VTGDIHAITM ANNLLAAAID TRMFHESTQK DGPLYKRLVP
AKKGVRKFTN SMLRRLNKLG IDKTDPDSLT PEEVTRFARL DIDPETITWR RVVDCNDRFL
RGITIGQAPT EKGFTRETGF DITVASECMA ILALSNSLED MRERLGKMVI ASNRAGEPVT
AEDIGCAGAL TALLKDAIKP NIMQTLEGTP VFVHAGPFAN ISIGASSILA DKMALKLAGT
SPDLSAEERQ QQEGYVVTEA GFDFTMGGER FINIKCRSSG LVPDVIVIVA TVRALKVHGG
GPEVKAGAPL AAEYTQENTE LLRAGCSNLG KHISNARSYG LPVVVAINKM SSDSEAEHAI
IREEALKAGA VDAIVSNHWE EGGQGAVDLA NGVISAANLP ERNFKFLYDT EPSVEEKIAT
IAREMYGAGE VEFSPEAQKK IDLYTKQGFG NLPICIAKTQ YSLSHDAALK GVPTGFTFPI
RDVRASIGAG YLYALAAEIQ TIPGLPTHCG FMNVEVNEDG EIDGLF