Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0381 |
Symbol | |
ID | 5055192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 329288 |
End bp | 330523 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640467948 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001152635 |
Protein GI | 145590633 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACTATA GCGTTGTCGC ACACCGCTTC TGGGGCGACC CAGGTGGGGG GCAATTGGTA TGTGCCGCCG TTGCCTACTC GTTGGAGGGA CTTGGTCTAA CGCCTGTGTT GTCCGGGGTG TTCAAGTTTG ACCCAGCAAA GTACAAGGAA TGGTTCGGCA TAGATCTGTC GAGATACCCC GTCGTCACGT TACCGTTTGA GCTGAATGCC TTCGGTCTCT ACTCCCGCCT AGCTTCGTGG TGGCCGGCTA AAAAAGCTAT AGATAAATAC AAGCCGTCGT TGGTGTTTAT AGATGAGCCG ACTTATAAAC CTCTGGCTAA AGGGAGAATG TATCGGCTTA TAGAATACAT TCATTTTCCG CTGGAGGTGG TTCTCAGCCC CGAGATTAAG AAACGGGCGT ATGCGGAGGG CCGTGATCCT TACTTCGAAG AGCGGTACTC GAAATTCCCG CTCAACGTGT ACTGGTGGCT TTTCTCGAAG CTGTTGCCAA TGGTTAAAAG AGAGAATCCT TTCCACTCGG CCGATCTCGT CCTCGTGAAC TCCCGGTGGA CGGCCGACCT GGTGCAACTC GCCTTTGGGG AGAGGCCGGA GGTGCTCAAC CCGCCCATAG CGCCTAATGT CGACGTGATG GAGAGGCCGA GGCCCTTCGA GGAGCGTAAG CCTATCGTCG TCATGCTAGG CCGCTTCTCG CAGGAGAAGC GCTACCACTG GGTGGTAAGG GAGGTTGCGC CGCGCCTCGT TAAGGAGATC CCCGGCGCTA GGCTTGTTAT TTTCGGCGGG GCGGCCACGC CGACGCTGAG GGCCTACTAC GAGCGCGTCA AGAGCCTCGC CTCGGAGGCG GGGCTGAGGG TCTCAGACGA CTTGTCCAAG GAGGCCGATG TCTATCTTGT GGCCAACGCC CCCCGCCGCC TCATAAACGA GGTGATGGAC GGGGCTAGGG CGTTTCTCCA CGCGACGATA AACGAGCACT GGGGCATCGC GGTGGCAGAG GCCATGGCCC GTGGATTGCC AGTGGTTGTC CACAAAAGCG GCGGCGCCTG GACAGACCTG GCGGAAGAGG GCCGCGTCGG CTTGGGCTAC GAAGACGCCG GCGGGGCAGT AGACGCGGTG GCGCGGCTCC TCACAGACGG CAGGCAGTGG GCCGTCCTAT CGGCGAAGAG CGTGGAGAAA GCCAGGGGCC TGCGCCTAGA GATCTTTGCG CAGAAATTTG GCGAGTTTGT AAGAAGCTTG TCATAA
|
Protein sequence | MNYSVVAHRF WGDPGGGQLV CAAVAYSLEG LGLTPVLSGV FKFDPAKYKE WFGIDLSRYP VVTLPFELNA FGLYSRLASW WPAKKAIDKY KPSLVFIDEP TYKPLAKGRM YRLIEYIHFP LEVVLSPEIK KRAYAEGRDP YFEERYSKFP LNVYWWLFSK LLPMVKRENP FHSADLVLVN SRWTADLVQL AFGERPEVLN PPIAPNVDVM ERPRPFEERK PIVVMLGRFS QEKRYHWVVR EVAPRLVKEI PGARLVIFGG AATPTLRAYY ERVKSLASEA GLRVSDDLSK EADVYLVANA PRRLINEVMD GARAFLHATI NEHWGIAVAE AMARGLPVVV HKSGGAWTDL AEEGRVGLGY EDAGGAVDAV ARLLTDGRQW AVLSAKSVEK ARGLRLEIFA QKFGEFVRSL S
|
| |