Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0793 |
Symbol | |
ID | 5055133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 706517 |
End bp | 707566 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468354 |
Product | hypothetical protein |
Protein accession | YP_001153031 |
Protein GI | 145591029 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3425] 3-hydroxy-3-methylglutaryl CoA synthase |
TIGRFAM ID | [TIGR00748] hydroxymethylglutaryl-CoA synthase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTCG GCATAGTTAG CTGGGGAGCA TATATCCCCA AGTACCGTAT CCGGACCGAG GAGGTTGCGC GGATCTGGGG CGATGACCCG CTCCGCATAG TCGACGTCTA CCTCGTAGAT GAAAAAAGCG TGGAGGGCAT AGACGAAGAC GCGGTGACCA TAGCCGTGGA GGCTGCGAGG AGGGCCATAA GAAGGGCCGG CATAGACCCC AAGAAGATCG GCGCAGTATA CGCCGGCACC GAGTCGAAGC CATATGCCGT GAAGCCCATC TCCTCAATTC TCGTAGACGC CCTTGGCCTC AGTAACAACG TATTCGCGGT TGACATGGAG TTCGCTTGCA AAGCCGGTAG CGAGGGGTTA GTGGCAGCCA TTGGGCTGGT CAAGGCGGGG CAGGTGGAGT ACGGCATGAC CGTCGGCACC GACACCTCCC AAGGCGAGCC TGGGGAGCAC TTGGAGTACT CTGCAAGTAG CGGGGGCGTG GCATTGATAG TGGGCAGAGA CGGCGTCGCC GCCGAGCTTG AGGCCGTGTA TTCCTACGTC TCGGATACGC CCGACTTCTG GAGGAGGGAG GGCTCCCCCT ACCCCATGCA CGGCGAGGGC TTCACAGGCG AGCCGGCCTA CTTTAGACAC ATAATAGGTG CGGCGAAGGG GCTTATGGAG AAATACGGCT ACAAGCCCTC CGACTTTGCA TACGTGGTGT TCCACCAGCC CAACGGGAGG TTCCCCGTCC GCGCCGCATC TATGCTGAAC ATACCAATGG AGAAGATAAA GCCCGGCATT GTGGTGACTC ACATAGGCAA CACCTACAAC GCCTCTGCCC TCATGGGCTT TGCCAAGGTG CTGGACGTGG CGAAGCCCGG CGACAAGATC CTCCTAGTGC CCTTCGGCAG CGGCGCTGGG TCAAACGCCT TCGTCTTCAC CGTCACCGAC GTGGTGCAGG AGCGGCAGAA GACAGGTGTC CCCACGGTGG AAGACATGCT AAGAGACAAG ATCTACGTCG ACTACGCCCA GTACCTCAAA ATGCGTAAAA TGATCAAACT ATTTGACTAA
|
Protein sequence | MKVGIVSWGA YIPKYRIRTE EVARIWGDDP LRIVDVYLVD EKSVEGIDED AVTIAVEAAR RAIRRAGIDP KKIGAVYAGT ESKPYAVKPI SSILVDALGL SNNVFAVDME FACKAGSEGL VAAIGLVKAG QVEYGMTVGT DTSQGEPGEH LEYSASSGGV ALIVGRDGVA AELEAVYSYV SDTPDFWRRE GSPYPMHGEG FTGEPAYFRH IIGAAKGLME KYGYKPSDFA YVVFHQPNGR FPVRAASMLN IPMEKIKPGI VVTHIGNTYN ASALMGFAKV LDVAKPGDKI LLVPFGSGAG SNAFVFTVTD VVQERQKTGV PTVEDMLRDK IYVDYAQYLK MRKMIKLFD
|
| |