Gene Pars_0381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0381 
Symbol 
ID5055192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp329288 
End bp330523 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content59% 
IMG OID640467948 
Productglycosyl transferase, group 1 
Protein accessionYP_001152635 
Protein GI145590633 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTATA GCGTTGTCGC ACACCGCTTC TGGGGCGACC CAGGTGGGGG GCAATTGGTA 
TGTGCCGCCG TTGCCTACTC GTTGGAGGGA CTTGGTCTAA CGCCTGTGTT GTCCGGGGTG
TTCAAGTTTG ACCCAGCAAA GTACAAGGAA TGGTTCGGCA TAGATCTGTC GAGATACCCC
GTCGTCACGT TACCGTTTGA GCTGAATGCC TTCGGTCTCT ACTCCCGCCT AGCTTCGTGG
TGGCCGGCTA AAAAAGCTAT AGATAAATAC AAGCCGTCGT TGGTGTTTAT AGATGAGCCG
ACTTATAAAC CTCTGGCTAA AGGGAGAATG TATCGGCTTA TAGAATACAT TCATTTTCCG
CTGGAGGTGG TTCTCAGCCC CGAGATTAAG AAACGGGCGT ATGCGGAGGG CCGTGATCCT
TACTTCGAAG AGCGGTACTC GAAATTCCCG CTCAACGTGT ACTGGTGGCT TTTCTCGAAG
CTGTTGCCAA TGGTTAAAAG AGAGAATCCT TTCCACTCGG CCGATCTCGT CCTCGTGAAC
TCCCGGTGGA CGGCCGACCT GGTGCAACTC GCCTTTGGGG AGAGGCCGGA GGTGCTCAAC
CCGCCCATAG CGCCTAATGT CGACGTGATG GAGAGGCCGA GGCCCTTCGA GGAGCGTAAG
CCTATCGTCG TCATGCTAGG CCGCTTCTCG CAGGAGAAGC GCTACCACTG GGTGGTAAGG
GAGGTTGCGC CGCGCCTCGT TAAGGAGATC CCCGGCGCTA GGCTTGTTAT TTTCGGCGGG
GCGGCCACGC CGACGCTGAG GGCCTACTAC GAGCGCGTCA AGAGCCTCGC CTCGGAGGCG
GGGCTGAGGG TCTCAGACGA CTTGTCCAAG GAGGCCGATG TCTATCTTGT GGCCAACGCC
CCCCGCCGCC TCATAAACGA GGTGATGGAC GGGGCTAGGG CGTTTCTCCA CGCGACGATA
AACGAGCACT GGGGCATCGC GGTGGCAGAG GCCATGGCCC GTGGATTGCC AGTGGTTGTC
CACAAAAGCG GCGGCGCCTG GACAGACCTG GCGGAAGAGG GCCGCGTCGG CTTGGGCTAC
GAAGACGCCG GCGGGGCAGT AGACGCGGTG GCGCGGCTCC TCACAGACGG CAGGCAGTGG
GCCGTCCTAT CGGCGAAGAG CGTGGAGAAA GCCAGGGGCC TGCGCCTAGA GATCTTTGCG
CAGAAATTTG GCGAGTTTGT AAGAAGCTTG TCATAA
 
Protein sequence
MNYSVVAHRF WGDPGGGQLV CAAVAYSLEG LGLTPVLSGV FKFDPAKYKE WFGIDLSRYP 
VVTLPFELNA FGLYSRLASW WPAKKAIDKY KPSLVFIDEP TYKPLAKGRM YRLIEYIHFP
LEVVLSPEIK KRAYAEGRDP YFEERYSKFP LNVYWWLFSK LLPMVKRENP FHSADLVLVN
SRWTADLVQL AFGERPEVLN PPIAPNVDVM ERPRPFEERK PIVVMLGRFS QEKRYHWVVR
EVAPRLVKEI PGARLVIFGG AATPTLRAYY ERVKSLASEA GLRVSDDLSK EADVYLVANA
PRRLINEVMD GARAFLHATI NEHWGIAVAE AMARGLPVVV HKSGGAWTDL AEEGRVGLGY
EDAGGAVDAV ARLLTDGRQW AVLSAKSVEK ARGLRLEIFA QKFGEFVRSL S