Gene PICST_66642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66642 
SymbolNGR1 
ID4851786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2822988 
End bp2826152 
Gene Length3165 bp 
Protein Length690 aa 
Translation table 
GC content45% 
IMG OID640393494 
Productnegative growth regulatory protein 
Protein accessionXP_001387102 
Protein GI126275598 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCTTCTCCA GTAGTTCTAG GAGACAGCAT CTAGCGGTCT TGTTGATATT GTTTGTGTGT 
TATTGTTTAT TATAATTGTT GTTCGTTCTT TGCTGTTCGT TTTTGTGTTT GTTCTGTAGT
TTTCTTGCAA TCATTCCCCC TTTTCCATTC GGAAGAACGA ATCATTACCA TCAACCTGGC
TACGTTATCC TAATTACTTC GGTATTGCAA CAGTGGATCC AAGCTGTTTG TGATTGTGTG
ATTTATCAAG TAGCCACACA AAACTGCTGC CTCATTCCAG CTGGTGGCAT CGTGCAAGTC
TGATTACCCC ATACTTTTTT CAGTCCAGAA AGGGGATCTG GATTTTCCAG TATTGAAGCA
TCGTTTTTTT GCATTGAGAT ATACTGGTAG AAATTCTTGA ATTGGTTCAA TTGCTGGATT
AGTAGAAATT GAATTCTGTG AACTCTGTTT GTGGTTCCTG ACAATACTTG AAGAGTAGAA
GCGAACAAGT CGTATACATA TACTATATTG ATACGACTCT TTTCGGCTTT TTGAAAACTC
AATTGATCCT CAGAATTGAC TTGGACTGAA TATTTGATAA TTGAAATTCA ATAGTGTTCT
CAGTTGAATC ATTGAATCGT TTGAATAGTT AAATCCTATC AAAATGTCGT ACTTGCAGGA
CCAGGAGTAC CACCCTGGCC ATCCGGCCAT TGAAAACGAC GAGAACCACT CCAGCACAGC
CGCATCCGGC CCCAAGCCCC CGAAAGCAAC CATAACTGGT CCGACAAAGA TACCAGTGAA
TGCCAATACG ACTACCTCTA GCACTGCTAG CCACCAACAC CAACAGGAAC AGGAAATTCT
CTCCAGTTTT GCCATTCCCA ACCCTCCGAT TTCGACCCAG CAGTCGAGCG TAGCTGCAGA
AAAGGATCAG ACCGGCTCCG ATAACTCCGG AGAGGTACAG TCGCCGCGGA CTTTGTGGAT
GGGAGACTTG GATCCATGGC TCGACGAAAA CGGCATTGCC GACTTGTGGT GGAAAATCCT
CCAGAAACGG GTTACCGTCA AGATCATCAA GCCAAAGACG TCCAAACCCG ATATCACGTA
CCAGGGTCTT TCCCATTCTG GCTATTGCTT TGTAGAATTC GAGTCTTTTG AAGATGCCCA
GCTAGCCCTT GGACTAAATG GCCAATTGCT TCCAGACATC GCCATGCCGT CACAACAACA
TTTCCCCAAT AACCCCGATA ACCAGAAGAA GTACTTTCGG TTGAACTGGG CAAGCGGAGC
TACGTTGAGT GCACCCATAA TTCAGAGTCC GGAGTACTCG TTGTTTGTAG GAGACCTTTC
TGCCTCTACG ACAGAAGCAC ATTTACTAGC ATTCTTCCAG AAGAACTTCC CAGCCTCTAT
CAAGACGGTG AGAGTGATGA CAGATCCGGT TTCAGGCAAA TCGCGCTGTT TTGGTTTTGT
CAGGTTTACA GACGAGTCTG AAAGGCAGAG AGCATTGGTG GAAATGAATG GTGTCTGGTT
TGGAGGTAGA CCTCTCCGTG TAGCTTTGGC CACACCTAGA AATGTCAACA GAAACAAGTT
CCAGAACCAG AACCACCAAG GAAACCCAGT CAACTTTTAT GGTGGTGGCG GTGACTCTCA
GCAGGAGATG GTATATATGC AGCCACCCCC TCCTCAGATG AGAGTCGAGC TGCCCTATGC
TTACTACGGC AATCCGCAAG TTCCTCCTTC TGGCTCTGGA GCTCCTTATG ACATACCAGG
AGATGTAGAT CGAGGAGGAC TTGATCCTGG AACTGGTATG GGAACTATAA AGTCTCCTAT
GCAATCGCCT GGTGTTCAGC CTCAGCCATA TACCGATCCA AATAACACAA CAGTGTTTGT
GGGAGGGCTT TCTTCAGAGG TAACCGAGTC AACTTTGTTT ACTCTTTTCA AGCCATTTGG
AATCATCCAG CAAGTGAAGA TTCCTCCTGG CAAGAACTGC GGTTTTATCA AGTACTCCAC
GCGTGAAGAG GCTGAAGAGG CCATAGCCGC GATGCAGGGC TTCATCATCG GTGGAAACAG
GGTCAGACTC AGCTGGGGCA GAGTATCTAT GAATAATAAG AAGTTCCAGC AGCAACAGCA
GCAAGTAGCG CATGCTGCGC AGATGCAAGC TGCGGCGGCA TTGTCAATGG GAATGGATCC
ATCCAGTGCT ATTGCGGCAG CAGCAGCCGC CGCTGCTGCT GGGGGCTATC CTCCTCCAAT
GGGAGCTCCT CCACAGATGG GTGGCATGCC TCCCTTGGGA ATACACCCTG CCATGCCCCC
TGGAGCTGGG GCTTTCCAAT CTCCGATGTC GCAGGGACAG GGAGGTTCTG AAGGCTACGA
TAAGGGACAG AGTGATCGTT CTACTTCATT TGACGAATCT ACCAGTAATC CCAGTATTTC
TCCGTACTAC ATTCCCATGC CCCCACCACC TGGAGCTGAG TACAGCTTGC CTCCACATCC
TCATCCACCT TCTCAGGAAG CATTGATCAA GGCTATGGGC AACATCGATC TCGGTGGAGG
GCCCGAGAGA GCAGACGGTG CCGACCAGAT GTACTTGAAT GCACCTTATA TGGGTCCTCC
TCGGTACGCT GGTGGATATC CACTCCCAGG AGAAATGCAA TATCAGCAGT TTATTCCTCA
TCCTGGACCA GAAGAGCCTC AAGAAAAACA GCCATTACCA GAAGAGGATG ACGATGGCGA
TAACGAAGAG AAATAAAAAA TTTAGGTGGT CATCGCTTTC ACAGGTATTC TACATCTATT
CTTTACTTCC ATTCCATTAC TGCTACTATT TTATTCCATT ACTACTATCA TTATTCATTA
TTACTATCAT TTGATTCCAT TACCATTATT TCTATTAGTT CTCACCATTC TTACCATTTT
TTTGTTGAAC CGTAGGTTCA TGATGTTGTT TTATGTCTGT AACAATTTGG ATTTATGTTA
CTTATGAGTT TTGCTGTTAA TGTTATTGTT TTTGTTGGTT CGTTTGTTGG CTATGCCAAA
GACACGAGGG TACGACAAAA GGAAACGGAA TTACTATTAT TATTATTACT ACTGTAGTTG
TTCTTACTGG ACAGCCTTCT ACTACTATTG ATTCCCTCTG ACCAAAATCT ATTTTGACCT
TTATATTATG TTTGTCCATT TGTGTTAATA CGATGATATT GAATG
 
Protein sequence
MSYLQDQEYH PGHPAIENDE NHSSTAASGP KPPKATITGP TKIPVNANTT TSSTASHQHQ 
QEQEILSSFA IPNPPISTQQ SSVAAEKDQT GSDNSGEVQS PRTLWMGDLD PWLDENGIAD
LWWKILQKRV TVKIIKPKTS KPDITYQGLS HSGYCFVEFE SFEDAQLALG LNGQLLPDIA
MPSQQHFPNN PDNQKKYFRL NWASGATLSA PIIQSPEYSL FVGDLSASTT EAHLLAFFQK
NFPASIKTVR VMTDPVSGKS RCFGFVRFTD ESERQRALVE MNGVWFGGRP LRVALATPRN
VNRNKFQNQN HQGNPVNFYG GGGDSQQEMV YMQPPPPQMR VELPYAYYGN PQVPPSGSGA
PYDIPGDVDR GGLDPGTGMG TIKSPMQSPG VQPQPYTDPN NTTVFVGGLS SEVTESTLFT
LFKPFGIIQQ VKIPPGKNCG FIKYSTREEA EEAIAAMQGF IIGGNRVRLS WGRVSMNNKK
FQQQQQQVAH AAQMQAAAAL SMGMDPSSAI AAAAAAAAAG GYPPPMGAPP QMGGMPPLGI
HPAMPPGAGA FQSPMSQGQG GSEGYDKGQS DRSTSFDEST SNPSISPYYI PMPPPPGAEY
SLPPHPHPPS QEALIKAMGN IDLGGGPERA DGADQMYLNA PYMGPPRYAG GYPLPGEMQY
QQFIPHPGPE EPQEKQPLPE EDDDGDNEEK