Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1533 |
Symbol | |
ID | 5054031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1390617 |
End bp | 1391825 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469074 |
Product | threonine dehydratase |
Protein accession | YP_001153739 |
Protein GI | 145591737 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR00260] threonine synthase [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.331802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTTC TGGAGGAGGC AACTGCTATT ATTAAAGAAG AGCAGAAAAG AGGGCGGATA CACAGAACTC CCCTGCTTAG GTCTGAGTCG CTTTCCAGGC TGGCGGGGGG CGACGTCTTT CTGAAGCTGG AGGCCTTGCA GAAGACTGGC AGTTTCAAGA TTAGAGGGGC CTACTTCGCC ATGCACAAGT ACATACAGGA GGGGTACAGA GAGTTTATCA CAGCCTCGTC GGGTAACCAC GCCCAAGGGG TTGCCTACGC CGCACAGCTC CACGGCGTCA AGGCCACTGT GGTAATGCCC GAGTCCACAC CTTGGCTGAA GGTAAAGAAG ACTCAGGACT ACGGCGCCAC TGTGATTCTG CACGGCGAGA GTTACTACGA GGCGGAGCTT AAGGCCAGAG AGCTGTTGAG AGACGGCGTT AAGTTCCTAC ATGCTTACAA CGACTGGTTC GTGATATCGG GCCAGTCCAC CCTCGGCGTG GAGATAATTG AGGATCTCCA AGACGTCGAC TTGGTAGTAG TGCCGGTGGG GGGCGGCGGG CTGATCTCCG GCGTAGCCTA TGCGGTGAAG CAGAGGCGGC CCAGCGCCAA AGTGATAGGG GTCCAGGCCA GCGGAGCGCC CTCTGTCTAT CTGTCGTTGA AGGAGGGGCG GCCAGTCGTT ATCGAGCGGG TAGATACCAT AGCAGACGGT ATTGCCGTGA AGAGGCCGGG TGACATAACG CTTAAGCTTA TCCAGGAGTA CGTAGACGAC GTTGTGTTGG TAGACGATAA CGAGATTGTT GACGCTATCT TTCTCCTAAT GGAGAGGACT AGAGTGGTGG CTGAGGGGGC GGGCGCAGCG GCGGTGGCGG CCCTGATGTC TGGGAAGGTA AAGGCTGAGG GGCGGCGGGC CGTTGCCGTG GTCTCCGGTG GGAACATAGA CGCCCCGATT TTGATGAGGG TGTTAATGAA GGCGTTGGCT AGGCAGAGGC GGATTGTGAA ACTAGTAGGC GAAGTTCCGG ACCGGCCGGG TATGTTGGCA AAGGCGTCTT CTATCTTGGC GTCGCGCCAG GTTAACATCC TCGAGGTTTA CCACGAGCGC TACGACCCGG AACAGAGGCC TAACTACGTC CGCCTGTCTT TTGTAGTGGA GATACCGGCT ACGCTGGACT TGTCAAAGGT GATAGAAGAG CTCGAGAAGG CCGGGTTCTA CTTCAAGGTG TTAGACTAA
|
Protein sequence | MILLEEATAI IKEEQKRGRI HRTPLLRSES LSRLAGGDVF LKLEALQKTG SFKIRGAYFA MHKYIQEGYR EFITASSGNH AQGVAYAAQL HGVKATVVMP ESTPWLKVKK TQDYGATVIL HGESYYEAEL KARELLRDGV KFLHAYNDWF VISGQSTLGV EIIEDLQDVD LVVVPVGGGG LISGVAYAVK QRRPSAKVIG VQASGAPSVY LSLKEGRPVV IERVDTIADG IAVKRPGDIT LKLIQEYVDD VVLVDDNEIV DAIFLLMERT RVVAEGAGAA AVAALMSGKV KAEGRRAVAV VSGGNIDAPI LMRVLMKALA RQRRIVKLVG EVPDRPGMLA KASSILASRQ VNILEVYHER YDPEQRPNYV RLSFVVEIPA TLDLSKVIEE LEKAGFYFKV LD
|
| |