Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0597 |
Symbol | |
ID | 5056127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 532439 |
End bp | 533653 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468156 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001152841 |
Protein GI | 145590839 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.287593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCG CCGTAGTGGC GCCCCAGAGC TCCCACTGGG AGGACACATA CCGCGCCGCC GCCGTCTTGG TAAAGGCGTT TCTAAAGCTT GGGCACAAGT CGTGGCTAAT TACAAGCATC TTCCACGATG GAAGGCCGGC GGTCGACGTA GATGCCGTGG AGAAAAGCGA GGGTGGCTAC GTGGTGGTGG AGGGGGACGT CTCCGGGGTT CCTGCTATCC GGGTAATCAG TGGCAGGTCC CTAGTCCCGC CGTCTGTGAT ATATCTGAGA AACTTCCCAA GGGTGCTCAA CGCAATCGAC GAGGCCTACG GCCTAGACGC TGTGGTGGTC GTATCAAGCT TCTGGAACGG GCCGGAGGAC GTGGCGAGGT GGATTTCGAT AAAGAAGTCC CTCCTCACCA TCGGCGAGGT GTCTAAAAGG CCTTTTTTCG TATACGTGCC CGTACTAGGT GGAAGGGCGC CTTTGAAAAA ACCTATGGAG GCCGCCTCTA GAGTTATGTG GTCGACTCTC TACCTCCCAC AGGTTTTGCA ACAAGTCGAT GTTGTGGTGG CCGTCTCTAG CAACGAGTTC TACGACCTGC GCCAATACCG CGTTCCAGAA GATAAGATAG TTGAGTGCAG GGACTGGGTA GACCCCGACG TGGCCGAGCT AGCTGGGGGG CAACTGGAAA GGCCCAAGCA AGCGGAGGGA TACGACTTCT ACGTCTCTTA CATTGGCCCT CTTGACGAAG ACCGGAACAT ACGCGGCTTA ATAAAAGTCG CGGAGAGAAT CGCATCAATG GGAAACGGAG CACTAATAGT CGCGGGGGCG GGTGAGGCAG AGGAGAAATT TAGGCGGGAG GCAGAAGGCC GGAAGAACGT GATACTCATT AGAGAGCATG GTATTAGGAC CATAGCTTCT ATTATTAGAT GGTCGCTGGC AGGCGTGGAC TTAGCCTTTT ACGAGCCGAT GGGCATAAGG GCGCTGGAGT ACTTATACTT TGGAGTGCCG TACGCCGCTC CCCCGACCTC AAACGCGGCT TACTTTATTA CTAACGGCGT AGACGGCATA CACCTAGAAA GCGCCAATGA CATAGAAGGG TTTGTCAACT GGGTCTCAAC ATTATTGCGC GAGCCCGAGC TCAGAGACGA AATGAGCCTC AAGGCAAGGA AAAAGGCAAC CGAGCGAACT GCCGTTAAGC TGGCGGAGAC TTTACTAATG CGGCTGGCGT CATGA
|
Protein sequence | MNIAVVAPQS SHWEDTYRAA AVLVKAFLKL GHKSWLITSI FHDGRPAVDV DAVEKSEGGY VVVEGDVSGV PAIRVISGRS LVPPSVIYLR NFPRVLNAID EAYGLDAVVV VSSFWNGPED VARWISIKKS LLTIGEVSKR PFFVYVPVLG GRAPLKKPME AASRVMWSTL YLPQVLQQVD VVVAVSSNEF YDLRQYRVPE DKIVECRDWV DPDVAELAGG QLERPKQAEG YDFYVSYIGP LDEDRNIRGL IKVAERIASM GNGALIVAGA GEAEEKFRRE AEGRKNVILI REHGIRTIAS IIRWSLAGVD LAFYEPMGIR ALEYLYFGVP YAAPPTSNAA YFITNGVDGI HLESANDIEG FVNWVSTLLR EPELRDEMSL KARKKATERT AVKLAETLLM RLAS
|
| |