Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1029 |
Symbol | |
ID | 4616525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 928957 |
End bp | 930165 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639784126 |
Product | threonine dehydratase |
Protein accession | YP_930546 |
Protein GI | 119872539 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR00260] threonine synthase [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000000000010709 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATATTAC TAGAAGAGGC ACTTTCAGTA ATAAGAGAAA AACAGAAAGA GGGTAAGATT CACCGCACTC CTACCCTACG CTCTGAGTCG CTTTCTAGAA TATCTGGAGG CGATGTATAT CTAAAGTTGG AATCTCTGCA GAAGACGGGT AGTTTCAAAA TTAGGGGAGC TTACTTCGCC ATGTACAAAT ATATGCAAGA GGGGTATAGA GAGTTTATAA CTGCGTCTTC AGGAAACCAC GCACAGGGAG TTGCATATGC CGCACAATTA CATGGAGTGA AGGCGACTGT GGTAATGCCA GAGACTACGC CTTGGCTTAA GGTAAAGAAG ACACAAGACT ATGGCGCATC TGTAATACTA TATGGCGAGA GTTACTACGA AGCAGAGAAG AAGGCTTATG AGCTTTTAAA AAGTGATGCA AAATTCCTTC ACGCATATAA TGATTATTAC GTCATATCAG GACAAGCTAC ATTAGGAGTT GAAATAGTAG AAGATGTAAA GGATGTAGAT GTTGTCATAG TTCCCGTAGG AGGCGGCGGG TTAATCTCCG GCGTAGCATA TGCCGTCAAA AAAATGAGAC CAAACGCAAA GATAATAGGG GTTCAAGCCA GCGGAGCACC TGCTGTATAT CTTTCACTTA AAGAGGGGAA GCCTGTCTTA ATTGAGCGAG TTGACACAAT AGCCGACGGC ATTGCTGTAA AACGGCCAGG AGACATCACG TTAAAAATAA TCCAGGAGTA TGTAGACGAT GTGGTACTAG TAGACGATAA CGAAATCGCC GATGCTATAT TTTTACTGCT AGAAAGGACG AGAGTAATAG CGGAAGGCGC AGGAGCTGTG GCAGTTGCAG CTCTAATGTC CGGTAAGGTA AATGTAAGAG GGAAGAAGGC CGTTGCTGTA GTGTCGGGAG GTAATATTGA CGCGCCTATT TTAATGCGTG TGTTAATGAA AGCTTTAGCA AGACAGAGAC GTATCATAAA ACTAGTCGGC GAAGTACCCG ACAGGCCGGG TACGCTAGCT AAAGCCTCAT CTATTCTGGC GTCTCATAAT GTAAACATAC TTGAAGTTTA CCATGAACGC TACGACCCTG AACAAAGACC CAACTACGTC CGCCTTGTCT TCATAGTGGA GGTACCCGGC ACATTAGATA TGTCAAAACT TCTAGATGAA CTTGAAAAAA ACGGCTTCTA CTTTAAAGTA ACGGCTTAA
|
Protein sequence | MILLEEALSV IREKQKEGKI HRTPTLRSES LSRISGGDVY LKLESLQKTG SFKIRGAYFA MYKYMQEGYR EFITASSGNH AQGVAYAAQL HGVKATVVMP ETTPWLKVKK TQDYGASVIL YGESYYEAEK KAYELLKSDA KFLHAYNDYY VISGQATLGV EIVEDVKDVD VVIVPVGGGG LISGVAYAVK KMRPNAKIIG VQASGAPAVY LSLKEGKPVL IERVDTIADG IAVKRPGDIT LKIIQEYVDD VVLVDDNEIA DAIFLLLERT RVIAEGAGAV AVAALMSGKV NVRGKKAVAV VSGGNIDAPI LMRVLMKALA RQRRIIKLVG EVPDRPGTLA KASSILASHN VNILEVYHER YDPEQRPNYV RLVFIVEVPG TLDMSKLLDE LEKNGFYFKV TA
|
| |