Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88627 |
Symbol | HDT2 |
ID | 4838081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 750073 |
End bp | 751788 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389396 |
Product | conserved hypothetical protein |
Protein accession | XP_001383775 |
Protein GI | 150864799 |
COG category | [R] General function prediction only |
COG ID | [COG1272] Predicted membrane protein, hemolysin III homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00146356 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACTG AAATCAGACA CAGAGGAACA TCTTCCAATG ACAAGTCTTT CGAGTCTGTT CCTACAACTG AAGAGCTTTT GGTTGAAAAA TTAGACGTTT TCTTATCGTC TATCGAGTCT CGTTTAGACA ACTTTGAGCA CTTTTTCAAA TTCAAGTCCC AAGAACTCAA AGAAGCTAAA GAGCTCAACT TGTCAACTTT TGCGGAAAAT GTCTCTCGTT CAAGACGAGA TAGCACAGCT AGCTTGTCCA ACATCAAGAA CTACTCCATC AACAATTTGA ACCTCATTCA CCAGAGACTT CATCTCATCA AGGACTCCGT ATTGCGATCT TCTTTCACCA ACTTGGAGTA CTTGTACAAT ACGCTTGATG ACCAGTATAA CTATTTATTC AACAGCACAT CTTCTGAAGA TAGTGATTCT GCAAGTTCCG TAGACAAAGA ATTAATAGCT AGTTCAAGCA ACAGAAAGGA GATCTTGTCT CAAAAAATCA TAGCCACCAT CCAGTATTTC GATGAAAAAT TGCTACTGAT CGATGCCTTC ATCAACGACA ACAAGCCAGA AAGAAGACTC GACTACGAAG AAGACTCCAC ACTTCTTCAG TTACGATTTT TTAACTTCAA TAGAGCTTTG AAGAATGCCA AAGACAGATA CTTGCACTAC TACGAGTTGC CTCTAAGTTG GAGAGAAAAC AAGTACATAG TATATGGTTA TCGGTACTCG TTGAACCACA GCACCATGCT AAAGTCGATC TTTGAATTCA ACCACAACGA ATCTATGAAC ATATGGACCC ATATCATAGG ATTCTTTTTT GTGGGATATT TAACTCTCTG GCACTTCCCT AATACCGATG TTTACAAGCT GAACTCGTTC AATGACAACT TGCCCGTCTT CGTCTTTTTT GGAGCGGCCT TGAAGTGTCT TATCAGTTCT GTGACATGGC ATACGTATTC CTGTTTTGCC CACTTGCCTA CCAGACAAAT GTGTGCATGT GTAGACTATA CTGGAATCAC TGTGTTGATT ACTTGTTCCG TTATAGCTGC AGAATACTGT GCATTGTTCA ATTACCCCAA GATCCTAAAG GTGTATATAA CGTTTTCCAC GATTTGTGGT ACCTCCGGGT TCGCATTTAA TTGGTCGCCT TACTTCGATA AGCCCGAATG TAGATCCGTG AGAATCGGCT TCTTCATGGG GTTAGCATTT TTGGGTGCAT CTGCCGGTGT TTGTATGGCG ATATACGAAG GCGTTCTCCC CACACTCAAG TTCTTCTTTC CTCTTGTCTA CAAGAGTTTT GTGTGGTACT GGTTGGGGGT TATTTTCTAC GGAGGTCTAA TTCCAGAAAG ATGGAGATAT GATGTGATCA TTGAAGAGAA CACTAAAGTA TGCCACCACA ACTATGACAC CAGAGACGTC TTATTGGACA ATATCGAAAA TAGTGGTCGT GAAGAAATCG AAGAAATTGA AGAGGAGTTT GAGAATATCG AGGAAAAGCA TTTACCAGAG GACGAAGAAG AGGCCAGATT CAAAGATATT ATAGCCAAAC ATTTTCCCGA GAAGCCAACC ACTACTCCCT ACAGTCATGA GTTCCTTTCG TTGTGGTGGG TCGACTACTT TTTGGCAAGT CATAATATCT GGCATATCTG TGTCGTGTTG GGGGTAGTGG GACATTATTT TAGTTTGTTG GACATGTATA GAAATATCGA AAGAGCTGCT ACATAG
|
Protein sequence | METEIRHRGT SSNDKSFESV PTTEELLVEK LDVFLSSIES RLDNFEHFFK FKSQELKEAK ELNLSTFAEN VSRSRRDSTA SLSNIKNYSI NNLNLIHQRL HLIKDSVLRS SFTNLEYLYN TLDDQYNYLF NSTSSEDSDS ASSVDKELIA SSSNRKEILS QKIIATIQYF DEKLLSIDAF INDNKPERRL DYEEDSTLLQ LRFFNFNRAL KNAKDRYLHY YELPLSWREN KYIVYGYRYS LNHSTMLKSI FEFNHNESMN IWTHIIGFFF VGYLTLWHFP NTDVYKSNSF NDNLPVFVFF GAALKCLISS VTWHTYSCFA HLPTRQMCAC VDYTGITVLI TCSVIAAEYC ALFNYPKILK VYITFSTICG TSGFAFNWSP YFDKPECRSV RIGFFMGLAF LGASAGVCMA IYEGVLPTLK FFFPLVYKSF VWYWLGVIFY GGLIPERWRY DVIIEENTKV CHHNYDTRDV LLDNIENSGR EEIEEIEEEF ENIEEKHLPE DEEEARFKDI IAKHFPEKPT TTPYSHEFLS LWWVDYFLAS HNIWHICVVL GVVGHYFSLL DMYRNIERAA T
|
| |