Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0361 |
Symbol | |
ID | 5054869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 311458 |
End bp | 312522 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640467931 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001152618 |
Protein GI | 145590616 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCC TTTTCGTGGC CCCCAGCTAC TACCCCCACG TAGGCGGCGT GGAGTACGTC GTGAAGAGTG TGGCCGAGAG GCTGGCCAAG TTGGGGCACG AGGTAGCTGT GCTGGCCGGG GAGCCCGGGG CGGAGGCGCC GAGGGAGGAG GAGGTAAACG GCGTGAGGGT CGTGAGGTGG CCCGTGTGGA GCCCGGGAGG GGCCTACCAC GTGCCGAGGG CACGGAGGAG GCTCGAGGCG CTCGTGAGGG ACGAGGCGCG GGTTGCTGAC GTGATCCACC TCCACAGCGT CCACAGCGTG TTCACCATGC ACGTGCTGAG GGCGGCGGGG GGCGTCGGCG CGAGGAAGGT GCTGACGCCG CATTACCACG GAACGGGCCA CACGCCGGTC AGGAGGGCTT TGTGGACCGC GTGGAGGATC GCCGTGAGGC GGCTACTGAA AGACGTCGAC GTGGTGCACG CGGTGTCTCC ATACGAGGCG GAGCTCGTGG AGAAACACTT CGGCCGGAGG CCGGTGGTCG TGGAACACGG CGTGGAGGAG TGGATAACGT CGGTGGAGTG GAGGCCCGAG AACTACGTCA TGTACAGCGG CCGCATCGAG AAGTACAAGA ACGTGCACAG GCTCGGAAAC ATCGTGAAGC TACTCAACGA GAGAGGACAC GACCTGGAGC TCAGGATATA CGGGGACGGC CCCTACAGGA GGGAGCTCGA GAGGCGCCTC AAGCGCGCAG GCGTGAAGCA CGTCGTGGAG CCGCCTCAGC CCTACGAGAA GTACATAGAG GCCTTGTCCC GCGCCGCGCT ATTCGCGCTA CTGTCAGAGA AGGAGGCGTT CGGCCAGACG ATCAACGAAG CAAACGCCGT GGGGACGCCG GCGGTGGCCG CGGAGCCCTG GGGGAAAAAC TTCGCCGGAA GGCCGAGGAC GCTCATCGTC CCGCTCCAGG GGCCCGACGA AATTATAGCT GATAAAATAA AGCGTTTCTT AGAGATAGTG CCGTCGAAAC CCAAGCCCAT AGTACCTACA TGGAGCCAAG TTGTGGCTCG ATATCTTTCC GCTCTCTATA TATAA
|
Protein sequence | MRILFVAPSY YPHVGGVEYV VKSVAERLAK LGHEVAVLAG EPGAEAPREE EVNGVRVVRW PVWSPGGAYH VPRARRRLEA LVRDEARVAD VIHLHSVHSV FTMHVLRAAG GVGARKVLTP HYHGTGHTPV RRALWTAWRI AVRRLLKDVD VVHAVSPYEA ELVEKHFGRR PVVVEHGVEE WITSVEWRPE NYVMYSGRIE KYKNVHRLGN IVKLLNERGH DLELRIYGDG PYRRELERRL KRAGVKHVVE PPQPYEKYIE ALSRAALFAL LSEKEAFGQT INEANAVGTP AVAAEPWGKN FAGRPRTLIV PLQGPDEIIA DKIKRFLEIV PSKPKPIVPT WSQVVARYLS ALYI
|
| |