Gene Pars_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1533 
Symbol 
ID5054031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1390617 
End bp1391825 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content57% 
IMG OID640469074 
Productthreonine dehydratase 
Protein accessionYP_001153739 
Protein GI145591737 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR00260] threonine synthase
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.331802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTC TGGAGGAGGC AACTGCTATT ATTAAAGAAG AGCAGAAAAG AGGGCGGATA 
CACAGAACTC CCCTGCTTAG GTCTGAGTCG CTTTCCAGGC TGGCGGGGGG CGACGTCTTT
CTGAAGCTGG AGGCCTTGCA GAAGACTGGC AGTTTCAAGA TTAGAGGGGC CTACTTCGCC
ATGCACAAGT ACATACAGGA GGGGTACAGA GAGTTTATCA CAGCCTCGTC GGGTAACCAC
GCCCAAGGGG TTGCCTACGC CGCACAGCTC CACGGCGTCA AGGCCACTGT GGTAATGCCC
GAGTCCACAC CTTGGCTGAA GGTAAAGAAG ACTCAGGACT ACGGCGCCAC TGTGATTCTG
CACGGCGAGA GTTACTACGA GGCGGAGCTT AAGGCCAGAG AGCTGTTGAG AGACGGCGTT
AAGTTCCTAC ATGCTTACAA CGACTGGTTC GTGATATCGG GCCAGTCCAC CCTCGGCGTG
GAGATAATTG AGGATCTCCA AGACGTCGAC TTGGTAGTAG TGCCGGTGGG GGGCGGCGGG
CTGATCTCCG GCGTAGCCTA TGCGGTGAAG CAGAGGCGGC CCAGCGCCAA AGTGATAGGG
GTCCAGGCCA GCGGAGCGCC CTCTGTCTAT CTGTCGTTGA AGGAGGGGCG GCCAGTCGTT
ATCGAGCGGG TAGATACCAT AGCAGACGGT ATTGCCGTGA AGAGGCCGGG TGACATAACG
CTTAAGCTTA TCCAGGAGTA CGTAGACGAC GTTGTGTTGG TAGACGATAA CGAGATTGTT
GACGCTATCT TTCTCCTAAT GGAGAGGACT AGAGTGGTGG CTGAGGGGGC GGGCGCAGCG
GCGGTGGCGG CCCTGATGTC TGGGAAGGTA AAGGCTGAGG GGCGGCGGGC CGTTGCCGTG
GTCTCCGGTG GGAACATAGA CGCCCCGATT TTGATGAGGG TGTTAATGAA GGCGTTGGCT
AGGCAGAGGC GGATTGTGAA ACTAGTAGGC GAAGTTCCGG ACCGGCCGGG TATGTTGGCA
AAGGCGTCTT CTATCTTGGC GTCGCGCCAG GTTAACATCC TCGAGGTTTA CCACGAGCGC
TACGACCCGG AACAGAGGCC TAACTACGTC CGCCTGTCTT TTGTAGTGGA GATACCGGCT
ACGCTGGACT TGTCAAAGGT GATAGAAGAG CTCGAGAAGG CCGGGTTCTA CTTCAAGGTG
TTAGACTAA
 
Protein sequence
MILLEEATAI IKEEQKRGRI HRTPLLRSES LSRLAGGDVF LKLEALQKTG SFKIRGAYFA 
MHKYIQEGYR EFITASSGNH AQGVAYAAQL HGVKATVVMP ESTPWLKVKK TQDYGATVIL
HGESYYEAEL KARELLRDGV KFLHAYNDWF VISGQSTLGV EIIEDLQDVD LVVVPVGGGG
LISGVAYAVK QRRPSAKVIG VQASGAPSVY LSLKEGRPVV IERVDTIADG IAVKRPGDIT
LKLIQEYVDD VVLVDDNEIV DAIFLLMERT RVVAEGAGAA AVAALMSGKV KAEGRRAVAV
VSGGNIDAPI LMRVLMKALA RQRRIVKLVG EVPDRPGMLA KASSILASRQ VNILEVYHER
YDPEQRPNYV RLSFVVEIPA TLDLSKVIEE LEKAGFYFKV LD