Gene PICST_29898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29898 
SymbolRVB2 
ID4837476 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1454719 
End bp1456269 
Gene Length1551 bp 
Protein Length484 aa 
Translation table12 
GC content44% 
IMG OID640388791 
Producttranscriptional regulator 
Protein accessionXP_001383047 
Protein GI126133044 
COG category[K] Transcription 
COG ID[COG1224] DNA helicase TIP49, TBP-interacting protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.287738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.530711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACGTT TTTGTATCTA TGCTTGTGAA ACCACGAGTA TATGATGAAA TGCTCTGCGG 
ACTTTGAGTT CTGAATGGCT ACTAACTTTT TATTTATAGG CATCTACACC TACGATCACA
ACCAAAGTCC AGACAAAGGA CTTGTCTGGG TTATCTCTTA TAGCTGCCCA CTCCCACATT
TCGGGTCTAG GCTTGGACGA GAACTTGAAG CCAAAAGAAT CAGCTGAAGG GATGGTTGGA
CAATTGAAAG CCAGAAAGGC GGCTGGCGTG ATTTTAAAGA TGATTCAGGC TGGTAAGATT
GCTGGCCGTG CTGTGCTTAT TGCCGGGCCT CCATCTACTG GTAAGACTGC CATTGCTATG
GGTTTGTCGC AGAGCTTAGG TACAGATGTT CCATTTACAG CAATAGCCGG TTCTGAAGTC
TTTTCTTTAG AATTATCCAA GACTGAATCA TTGATACAAG CTTTCCGTAA ATCTATTGGT
ATCAAGATCA AAGAAGAAAC AGAAATAATC GAAGGTGAAG TCGTCGAGAT CCAAATCGAC
AGATCAATTA CCGGCGGTCA CAAGCAGGGA AAGTTGACCA TTAAGACGGC TGATATGGAG
ACAATTTATG AGTTGGGTAA CAAGATGATT GAAGGCTTAA CTAAGGAAAA GGTATTGGCT
GGAGATGTTA TTTCCATCGA CAAAGCTAGT GGTAAAATCA CCAAGTTAGG TAAATCATTC
ACCAGGGCCA GAGACTACGA TGCTATGGGT CCAGAAACCA AGTTTGTCCA ATGTCCCGAA
GGTGAGTTGC AGAAGAGAAA AGAAGTTGTC CATACTGTTT CGTTGCACGA GATAGACGTT
ATAAATTCTA GACAACAGGG GTTCCTTGCC TTGTTCTCGG GTGACACTGG TGAGATCCGC
TCTGAGGTTC GTGACCAAAT CAACACCAAA GTCGCCGAAT GGAAGGAAGA AGGTAAGGCC
GAGATCGTGC CTGGTGTCTT ATTCATTGAT GAGGTTCACA TGTTGGATAT TGAGTGCTTT
TCATTCATCA ATAGAGCATT GGAGGACGAC TTTGCACCCA TTGTTATCAT GGCCACTAAC
CGAGGAATCA CCAGGACTCG TGGTACTAAC TACAAGTCCC CTCATGGCTT ACCTGTAGAT
TTGTTGGACA GATCTATCAT CATCCACACT TCATCATACA GTGCCGACGA GATCAGAACC
ATTCTTTCCA TAAGAGCCAA CGAAGAAGAA GTAGAATTGA CCCCTGATGC TTTGGCATTG
TTGACCAAGA TTGGTCAAGA AACAAGCTTG AGATACGCCT CTAACTTGAT TTCAGTTTCC
CAACAGATTG CATTGAAGAG AAGAAGCACT TCTGTTGAGC TTCCAGATAT CAAGAGAGCA
TACATGTTGT TTTTGGATGC TGACAGATCG GTACAATACT TGGAAGAGTT CCCAAACCAA
TTCATCGACA ATTCTGGTAA TGTTACAATT GGCCAGAAGG ATGAGTCTTC GGCCAATGGC
AACGGCGCTA CTCCTATTGT TGTAGATGAA GACAAGATGG AGACCGATTA G
 
Protein sequence
MASTPTITTK VQTKDLSGLS LIAAHSHISG LGLDENLKPK ESAEGMVGQL KARKAAGVIL 
KMIQAGKIAG RAVLIAGPPS TGKTAIAMGL SQSLGTDVPF TAIAGSEVFS LELSKTESLI
QAFRKSIGIK IKEETEIIEG EVVEIQIDRS ITGGHKQGKL TIKTADMETI YELGNKMIEG
LTKEKVLAGD VISIDKASGK ITKLGKSFTR ARDYDAMGPE TKFVQCPEGE LQKRKEVVHT
VSLHEIDVIN SRQQGFLALF SGDTGEIRSE VRDQINTKVA EWKEEGKAEI VPGVLFIDEV
HMLDIECFSF INRALEDDFA PIVIMATNRG ITRTRGTNYK SPHGLPVDLL DRSIIIHTSS
YSADEIRTIL SIRANEEEVE LTPDALALLT KIGQETSLRY ASNLISVSQQ IALKRRSTSV
ELPDIKRAYM LFLDADRSVQ YLEEFPNQFI DNSGNVTIGQ KDESSANGNG ATPIVVDEDK
METD