Gene Pisl_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1029 
Symbol 
ID4616525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp928957 
End bp930165 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content44% 
IMG OID639784126 
Productthreonine dehydratase 
Protein accessionYP_930546 
Protein GI119872539 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR00260] threonine synthase
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000010709 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATATTAC TAGAAGAGGC ACTTTCAGTA ATAAGAGAAA AACAGAAAGA GGGTAAGATT 
CACCGCACTC CTACCCTACG CTCTGAGTCG CTTTCTAGAA TATCTGGAGG CGATGTATAT
CTAAAGTTGG AATCTCTGCA GAAGACGGGT AGTTTCAAAA TTAGGGGAGC TTACTTCGCC
ATGTACAAAT ATATGCAAGA GGGGTATAGA GAGTTTATAA CTGCGTCTTC AGGAAACCAC
GCACAGGGAG TTGCATATGC CGCACAATTA CATGGAGTGA AGGCGACTGT GGTAATGCCA
GAGACTACGC CTTGGCTTAA GGTAAAGAAG ACACAAGACT ATGGCGCATC TGTAATACTA
TATGGCGAGA GTTACTACGA AGCAGAGAAG AAGGCTTATG AGCTTTTAAA AAGTGATGCA
AAATTCCTTC ACGCATATAA TGATTATTAC GTCATATCAG GACAAGCTAC ATTAGGAGTT
GAAATAGTAG AAGATGTAAA GGATGTAGAT GTTGTCATAG TTCCCGTAGG AGGCGGCGGG
TTAATCTCCG GCGTAGCATA TGCCGTCAAA AAAATGAGAC CAAACGCAAA GATAATAGGG
GTTCAAGCCA GCGGAGCACC TGCTGTATAT CTTTCACTTA AAGAGGGGAA GCCTGTCTTA
ATTGAGCGAG TTGACACAAT AGCCGACGGC ATTGCTGTAA AACGGCCAGG AGACATCACG
TTAAAAATAA TCCAGGAGTA TGTAGACGAT GTGGTACTAG TAGACGATAA CGAAATCGCC
GATGCTATAT TTTTACTGCT AGAAAGGACG AGAGTAATAG CGGAAGGCGC AGGAGCTGTG
GCAGTTGCAG CTCTAATGTC CGGTAAGGTA AATGTAAGAG GGAAGAAGGC CGTTGCTGTA
GTGTCGGGAG GTAATATTGA CGCGCCTATT TTAATGCGTG TGTTAATGAA AGCTTTAGCA
AGACAGAGAC GTATCATAAA ACTAGTCGGC GAAGTACCCG ACAGGCCGGG TACGCTAGCT
AAAGCCTCAT CTATTCTGGC GTCTCATAAT GTAAACATAC TTGAAGTTTA CCATGAACGC
TACGACCCTG AACAAAGACC CAACTACGTC CGCCTTGTCT TCATAGTGGA GGTACCCGGC
ACATTAGATA TGTCAAAACT TCTAGATGAA CTTGAAAAAA ACGGCTTCTA CTTTAAAGTA
ACGGCTTAA
 
Protein sequence
MILLEEALSV IREKQKEGKI HRTPTLRSES LSRISGGDVY LKLESLQKTG SFKIRGAYFA 
MYKYMQEGYR EFITASSGNH AQGVAYAAQL HGVKATVVMP ETTPWLKVKK TQDYGASVIL
YGESYYEAEK KAYELLKSDA KFLHAYNDYY VISGQATLGV EIVEDVKDVD VVIVPVGGGG
LISGVAYAVK KMRPNAKIIG VQASGAPAVY LSLKEGKPVL IERVDTIADG IAVKRPGDIT
LKIIQEYVDD VVLVDDNEIA DAIFLLLERT RVIAEGAGAV AVAALMSGKV NVRGKKAVAV
VSGGNIDAPI LMRVLMKALA RQRRIIKLVG EVPDRPGTLA KASSILASHN VNILEVYHER
YDPEQRPNYV RLVFIVEVPG TLDMSKLLDE LEKNGFYFKV TA