Gene PICST_75631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_75631 
SymbolGCV2 
ID4836729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2411605 
End bp2415026 
Gene Length3422 bp 
Protein Length1033 aa 
Translation table12 
GC content46% 
IMG OID640388044 
ProductGlycine cleavage system protein 
Protein accessionXP_001383214 
Protein GI126133378 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTCAGGTTC AGCTACTGCT TCTCTTACGG CAGTTCTTGT CTATCAGTGT CTACGTGCAC 
GGCATACTCA CCTCTATCGT CTTATCACAG ACTCGAACTG GTTTCAGATT TCACCATAGA
TCGTTCTGTA AATCCAGTCA ATTTGTTTCA AAACACCCAA ATCATATTTG CCTATAGTTC
CATAGAGTCA CTCAACAGTG CTATTATCTA TTACCATTAC TTTTCATAAC TCCCATACAG
CATGTTCCAG GCTAGAGCCA TTCTTAGAGC TGCCCGTTCC TCTCGTCCTG CATTGAGAAC
CCCAGCCTCA TTTGCACCCA GAGCTATCTC ATTTGCACCC AAAACGGCTA GCTTGAGAGC
TTTTGCTACC AAAGCGGACA CTTCATCTGT CAATTATGCC AAGGTGTACA ACCCCAACTC
CGAAAAGGTG TCCATCGGCA ACTTGGACAC TTTTGCCAGA AGACACATAG GCCCCACTCC
AGACAACGTG ACCAAAATGT TGTCCTCCTT GGGCTACAGT GACTTGGACG AATTCTTAAG
CAAGGCCATT CCTGAACACA TTCTCTACAA GAGAAAGTTG AAGATCGAGC CTGCGCAAGG
TTTCACCGAA TCAGAAATGC TTGAGCACTT GCACGAAATT GCTGGCAAGA ACAAGATCGT
CAAGTCATTC ATTGGTAAAG GTTATGCTGG TACCAGATTG CCACCAGTGA TACAGAGAAA
TTTGTTGGAA AGCCCAGAAT GGTACACTTC GTACACACCT TATCAGCCAG AAATCTCTCA
GGGTAGATTG CAATCGTTGC TTAACTACCA AACCATGGTT ACCTCTTTGA CTGGCTTACC
CATGGCCAAC GCCTCGCTTT TAGATGAAGG TACTGCTGCT GGTGAAGCCA TGTCTTTGTC
TTTCCACAAC TCGAAAAACA AAAAGTCTAC TTACGTGGTT GACTCTAACA TCCATCCTCA
GACCTTGCAG GTGATCCAAT CTAGAGCTGA AAAGATTGGT GTTAAGATCG TAGAATTGCC
ATTGTCAACT GAAGAAGGTG TAGAACAATT GCAGAAGCTT TCTAGTGATG TTTGCGGTGC
CTTGGTTCAA TACCCAGGTA CTGATGGTTC TCTTTACAAC TACTCTAAGA TCGGTGAAAT
CGTCCATGCT GGTAAGGGAT TGCTTGCTAT GGCTACCGAC TTGTTGGCTC TCACTATGAT
CAAGCCACCA TCTGAGTATG GTGCCGACAT TGCCTTGGGT ACTTCGCAGA GATTTGGTGT
TCCATTCGGC TACGGTGGTC CACATGCCGC TTTCTTTTCT ACCAGCATGA AGTATTCCAG
AAAGATTCCC GGAAGAATCG TTGGTGTCTC TAAGGATAGA TTAGGCAAGC CAGCTCTCAG
ATTAGCCTTG CAAACCAGAG AGCAACATAT CAAGAGAGAA AAGGCCACCT CCAACATCTG
TACAGCTCAA GCTTTGTTGG CTAACATATC TGCCATGTAC GCTGTATACC ACGGCCCAGA
AGGATTGAAG AACATTGCTA AGAGAGTGTA TGGTTTCACA ACTTTGTTGG CTAACGAAAT
CGCCTCTTCT CACGAAATTA CCAACAAGAA CTGGTTCGAC ACTTTAACTG TTAGATTGAG
CGGCACTACT GCGGATGAAA TTTTGGCCAA GGCTTTGAAC GAACACAACA TCAACTTGTT
CAAGGTCAAT GATTCCACTG TCTCTGTCAC CTTGGATGAA ACAGTAGAAG CCCAAGATTT
GGCCAGTTTG GTCGAAGTTT TCACCGGTAA GGAGTACGAC GTCGCTTCCA TTGGTGAATT
GCCTCAATTC CCAGCAGAAA TCTTGAGAAC CGACGACATC TTGACCCATG AAGTATTTAA
CACCCACCAC TCAGAAACTG CTATGTTGCG TTACTTGCAT TTGTTACAGA GCAAGGACTT
GTCGCTTGCA AACTCCATGA TTCCTTTGGG TTCTTGTACC ATGAAGTTGA ATGCTACTGT
TGAAATGCAA ACGTTGTCGA TTCCTGGCTT CAACTCCATC CATCCATTTG CTCCTATTGA
CCAAGCCCAG GGCTACAAGG AGTTGGTGGA CGAGTTTGAA AAGGACTTGA ACGATATCAC
TGGATTTGAC GCCACTACTT CGATGCCTAA CTCCGGTGCT CAGGGTGAAT ACACTGGTTT
GTCTTTGATC AGACAATACC ACAAGTCCAG AGGTGACTAC GAGAAGAGAA ACATCTGTCT
TATTCCTGTC TCAGCCCACG GTACCAACCC AGCCTCTGCT GCCATGTGTG GCTTGAAGGT
GGTACCAATC AAGTGTTTGG ACAACGGTTC CATTGATTTG AAGGACTTAC AAGAAAAGGC
TGAAAAACAT GCTGAAAACC TTTGTTCCAT CATGATCACC TACCCTTCTA CCTACGGTTT
GTTTGAGCCA GGTGTCAAGA CAGCTATTGA CACTGTTCAC AAGTACGGTG GGTTGGTTTA
CTTGGATGGT GCCAACATGA ATGCTCAGGT TGGTTTGACA TCGCCAGGTG ACTTGGGTGC
TGATGTTTGT CACTTGAACA TCCACAAAAC CTTTGCATTG AGTCACGGTG GTGGTGGTCC
GGGCCAGGCT CCTGTCTGTG TTAAGGAACA CTTGAAGCCA TTCTTGCCTT CTCATCACTT
CCTCCAGACT CCTCACTCGA CTTCTGAGTC CATCAAGGCC GTCAACTCGG CTCCTTACGG
TTCAGCTTCT GTCATCCCAG TGTCTTACTC ATACATCAAG ATGTTGGGAG CCGAAGCCTT
GCCATATGTT TCTACAATTG CCATGTTGAA CGCCAACTAC ATCTTGAACA AGTTGAAGGA
CCACTACAAG ATCTTGTTCA TTGACCCCAA TGCCTCTGCC GACGAAGGCT TGAAGCACTG
TGCTCACGAA TTCATCTTGG ACTTGAGAGA ATTCAAGGCT GTCGGCATTG AGGCCATTGA
TGTCGCCAAG AGATTACAGG ATTACGGTTT CCATGCCCCA ACCATGTCTT TCCCTGTTGC
CGGCACCTTG ATGATCGAGC CTACTGAATC AGAAAACTTG GAAGAGTTGG ACAGATTCAT
CGACTCGTTG CTTGCTATCA GAAAGGAAAT CGAAGCCTAT GCCAACAAAG AGCCATTGGG
TTTGGTGTTG AAGAACGCTC CCCATTCATT GGAAGACGTT GTTTCTACTC CACAGGCAGA
CTGGGACGCC AGAGGCTACA CAAGAGAAGA GGCTGCCTAC CCATTGCCAT TCTTGAAGAC
GTCCAAGTGC TGGCCTACAG TCGCTAGATT GGACGACACC TACGGTGACA TGCATTTGAT
GTGTACTTGT CCATCAGTCG AGGAAGTGGC ATCTGAACAA TAGGTTCAGA TTCAGTTCGT
GTATGTATTT TAGTATTTAT AGATAGAAAA AAAAATCGAA TAAGAGATGT TTGACTTGGT
GT
 
Protein sequence
MFQARAILRA ARSSRPALRT PASFAPRAIS FAPKTASLRA FATKADTSSV NYAKVYNPNS 
EKVSIGNLDT FARRHIGPTP DNVTKMLSSL GYSDLDEFLS KAIPEHILYK RKLKIEPAQG
FTESEMLEHL HEIAGKNKIV KSFIGKGYAG TRLPPVIQRN LLESPEWYTS YTPYQPEISQ
GRLQSLLNYQ TMVTSLTGLP MANASLLDEG TAAGEAMSLS FHNSKNKKST YVVDSNIHPQ
TLQVIQSRAE KIGVKIVELP LSTEEGVEQL QKLSSDVCGA LVQYPGTDGS LYNYSKIGEI
VHAGKGLLAM ATDLLALTMI KPPSEYGADI ALGTSQRFGV PFGYGGPHAA FFSTSMKYSR
KIPGRIVGVS KDRLGKPALR LALQTREQHI KREKATSNIC TAQALLANIS AMYAVYHGPE
GLKNIAKRVY GFTTLLANEI ASSHEITNKN WFDTLTVRLS GTTADEILAK ALNEHNINLF
KVNDSTVSVT LDETVEAQDL ASLVEVFTGK EYDVASIGEL PQFPAEILRT DDILTHEVFN
THHSETAMLR YLHLLQSKDL SLANSMIPLG SCTMKLNATV EMQTLSIPGF NSIHPFAPID
QAQGYKELVD EFEKDLNDIT GFDATTSMPN SGAQGEYTGL SLIRQYHKSR GDYEKRNICL
IPVSAHGTNP ASAAMCGLKV VPIKCLDNGS IDLKDLQEKA EKHAENLCSI MITYPSTYGL
FEPGVKTAID TVHKYGGLVY LDGANMNAQV GLTSPGDLGA DVCHLNIHKT FALSHGGGGP
GQAPVCVKEH LKPFLPSHHF LQTPHSTSES IKAVNSAPYG SASVIPVSYS YIKMLGAEAL
PYVSTIAMLN ANYILNKLKD HYKILFIDPN ASADEGLKHC AHEFILDLRE FKAVGIEAID
VAKRLQDYGF HAPTMSFPVA GTLMIEPTES ENLEELDRFI DSLLAIRKEI EAYANKEPLG
LVLKNAPHSL EDVVSTPQAD WDARGYTREE AAYPLPFLKT SKCWPTVARL DDTYGDMHLM
CTCPSVEEVA SEQ