Gene PICST_88627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88627 
SymbolHDT2 
ID4838081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp750073 
End bp751788 
Gene Length1716 bp 
Protein Length571 aa 
Translation table12 
GC content41% 
IMG OID640389396 
Productconserved hypothetical protein 
Protein accessionXP_001383775 
Protein GI150864799 
COG category[R] General function prediction only 
COG ID[COG1272] Predicted membrane protein, hemolysin III homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00146356 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACTG AAATCAGACA CAGAGGAACA TCTTCCAATG ACAAGTCTTT CGAGTCTGTT 
CCTACAACTG AAGAGCTTTT GGTTGAAAAA TTAGACGTTT TCTTATCGTC TATCGAGTCT
CGTTTAGACA ACTTTGAGCA CTTTTTCAAA TTCAAGTCCC AAGAACTCAA AGAAGCTAAA
GAGCTCAACT TGTCAACTTT TGCGGAAAAT GTCTCTCGTT CAAGACGAGA TAGCACAGCT
AGCTTGTCCA ACATCAAGAA CTACTCCATC AACAATTTGA ACCTCATTCA CCAGAGACTT
CATCTCATCA AGGACTCCGT ATTGCGATCT TCTTTCACCA ACTTGGAGTA CTTGTACAAT
ACGCTTGATG ACCAGTATAA CTATTTATTC AACAGCACAT CTTCTGAAGA TAGTGATTCT
GCAAGTTCCG TAGACAAAGA ATTAATAGCT AGTTCAAGCA ACAGAAAGGA GATCTTGTCT
CAAAAAATCA TAGCCACCAT CCAGTATTTC GATGAAAAAT TGCTACTGAT CGATGCCTTC
ATCAACGACA ACAAGCCAGA AAGAAGACTC GACTACGAAG AAGACTCCAC ACTTCTTCAG
TTACGATTTT TTAACTTCAA TAGAGCTTTG AAGAATGCCA AAGACAGATA CTTGCACTAC
TACGAGTTGC CTCTAAGTTG GAGAGAAAAC AAGTACATAG TATATGGTTA TCGGTACTCG
TTGAACCACA GCACCATGCT AAAGTCGATC TTTGAATTCA ACCACAACGA ATCTATGAAC
ATATGGACCC ATATCATAGG ATTCTTTTTT GTGGGATATT TAACTCTCTG GCACTTCCCT
AATACCGATG TTTACAAGCT GAACTCGTTC AATGACAACT TGCCCGTCTT CGTCTTTTTT
GGAGCGGCCT TGAAGTGTCT TATCAGTTCT GTGACATGGC ATACGTATTC CTGTTTTGCC
CACTTGCCTA CCAGACAAAT GTGTGCATGT GTAGACTATA CTGGAATCAC TGTGTTGATT
ACTTGTTCCG TTATAGCTGC AGAATACTGT GCATTGTTCA ATTACCCCAA GATCCTAAAG
GTGTATATAA CGTTTTCCAC GATTTGTGGT ACCTCCGGGT TCGCATTTAA TTGGTCGCCT
TACTTCGATA AGCCCGAATG TAGATCCGTG AGAATCGGCT TCTTCATGGG GTTAGCATTT
TTGGGTGCAT CTGCCGGTGT TTGTATGGCG ATATACGAAG GCGTTCTCCC CACACTCAAG
TTCTTCTTTC CTCTTGTCTA CAAGAGTTTT GTGTGGTACT GGTTGGGGGT TATTTTCTAC
GGAGGTCTAA TTCCAGAAAG ATGGAGATAT GATGTGATCA TTGAAGAGAA CACTAAAGTA
TGCCACCACA ACTATGACAC CAGAGACGTC TTATTGGACA ATATCGAAAA TAGTGGTCGT
GAAGAAATCG AAGAAATTGA AGAGGAGTTT GAGAATATCG AGGAAAAGCA TTTACCAGAG
GACGAAGAAG AGGCCAGATT CAAAGATATT ATAGCCAAAC ATTTTCCCGA GAAGCCAACC
ACTACTCCCT ACAGTCATGA GTTCCTTTCG TTGTGGTGGG TCGACTACTT TTTGGCAAGT
CATAATATCT GGCATATCTG TGTCGTGTTG GGGGTAGTGG GACATTATTT TAGTTTGTTG
GACATGTATA GAAATATCGA AAGAGCTGCT ACATAG
 
Protein sequence
METEIRHRGT SSNDKSFESV PTTEELLVEK LDVFLSSIES RLDNFEHFFK FKSQELKEAK 
ELNLSTFAEN VSRSRRDSTA SLSNIKNYSI NNLNLIHQRL HLIKDSVLRS SFTNLEYLYN
TLDDQYNYLF NSTSSEDSDS ASSVDKELIA SSSNRKEILS QKIIATIQYF DEKLLSIDAF
INDNKPERRL DYEEDSTLLQ LRFFNFNRAL KNAKDRYLHY YELPLSWREN KYIVYGYRYS
LNHSTMLKSI FEFNHNESMN IWTHIIGFFF VGYLTLWHFP NTDVYKSNSF NDNLPVFVFF
GAALKCLISS VTWHTYSCFA HLPTRQMCAC VDYTGITVLI TCSVIAAEYC ALFNYPKILK
VYITFSTICG TSGFAFNWSP YFDKPECRSV RIGFFMGLAF LGASAGVCMA IYEGVLPTLK
FFFPLVYKSF VWYWLGVIFY GGLIPERWRY DVIIEENTKV CHHNYDTRDV LLDNIENSGR
EEIEEIEEEF ENIEEKHLPE DEEEARFKDI IAKHFPEKPT TTPYSHEFLS LWWVDYFLAS
HNIWHICVVL GVVGHYFSLL DMYRNIERAA T