Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0362 |
Symbol | |
ID | 5056306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 312632 |
End bp | 313726 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640467932 |
Product | glycosyl transferase family protein |
Protein accession | YP_001152619 |
Protein GI | 145590617 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.279909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTTCG AGCTTGGGCT GGCTCTAGCG GCGCTCCACT TCGGCGTCCC CCTGGGATAC TACGCCGCCG CAAAGAGGTG GCTGAGGAGG GACTGGGGCA TAAAGGAGGA CGTTAGATAC ACGCCGAGAG TCACCGTGAT AATACCCACC TACAACGAGG CCGACAACAT CGCCCAGAGA CTCGAAAACA TCTACCAACA GGACTACCCC CGGGACAGGC TGGAGGTGAT CGTGGCGGAC GGTGCGTCGA CCGACGGGAC GCCCGAGATA GCGGAGAGGT GGGCGAGGGA GCATCCAGAT TTAAAGGTCA AGTTAATCCG AGAACCCCAG CGAAGAGGGC TCGTTCCCTC CTTGAACGAG GCTCTGAAAT ACGTGTCAGA TGGCAGCGAA ATAGTAATCT TCGCCGATGC TGACGCCTTA TGGCCACACG ATGCCATCTC GAAAATTGTG AAATATTTCG CGAACCCCTC TATAGGAGCG GTTTCCTCAA CCATAGCGCC GCTGGATTAC GATGAAAACG AGAGTACATA TAGGAGCTAC TTTAATGCGA TAAGAGTCGC GGAGTCTAAA AAGCACAGCA CACCTATACA CAATGCGCCG CTCATGGCCT TTCGAGCGGA GCTAATACGA AAAGTAGGCC TGCCGCTCTA CACGGGAAAT AACGATAGCA CGCCTGCATC CATAATAGCC TTCATGGGAT ATAGAGCTAT CTTGGTGGAC GACGTAGTAG CAAAAGAAAT ACTGAGAAAT CAAACCATGA GGAAAATTAG AAGGGCGCAA CATCTAATAT TACATTTCCT TAAAACAAAA CAATATGCAA AGAAGCGTGG ATTTTATAAA AAATCAGAAT TCGATATAAT ATGGAAAATC GAATGGTGGC TCCACATAGT CAACCCCTGG CTATTGATAG CCGGCATAGC CTTGCTAGCT ACGGCTCTGG TGCTATATAG ATCGCTGCAT GCACTTGTAT TGTTGGCCAT CGGAATGGCG TTGCTGACAT TCAAACTTTA CCGAGTGTGG ATCCAAAACC AACTATACCT GGTAGCCGGC TTCATCAGAA ATCTCTGGAA CAAAGACCTG GTCTGGGAAA AATGA
|
Protein sequence | MLFELGLALA ALHFGVPLGY YAAAKRWLRR DWGIKEDVRY TPRVTVIIPT YNEADNIAQR LENIYQQDYP RDRLEVIVAD GASTDGTPEI AERWAREHPD LKVKLIREPQ RRGLVPSLNE ALKYVSDGSE IVIFADADAL WPHDAISKIV KYFANPSIGA VSSTIAPLDY DENESTYRSY FNAIRVAESK KHSTPIHNAP LMAFRAELIR KVGLPLYTGN NDSTPASIIA FMGYRAILVD DVVAKEILRN QTMRKIRRAQ HLILHFLKTK QYAKKRGFYK KSEFDIIWKI EWWLHIVNPW LLIAGIALLA TALVLYRSLH ALVLLAIGMA LLTFKLYRVW IQNQLYLVAG FIRNLWNKDL VWEK
|
| |