Gene Pars_0362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0362 
Symbol 
ID5056306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp312632 
End bp313726 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content50% 
IMG OID640467932 
Productglycosyl transferase family protein 
Protein accessionYP_001152619 
Protein GI145590617 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.279909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTTCG AGCTTGGGCT GGCTCTAGCG GCGCTCCACT TCGGCGTCCC CCTGGGATAC 
TACGCCGCCG CAAAGAGGTG GCTGAGGAGG GACTGGGGCA TAAAGGAGGA CGTTAGATAC
ACGCCGAGAG TCACCGTGAT AATACCCACC TACAACGAGG CCGACAACAT CGCCCAGAGA
CTCGAAAACA TCTACCAACA GGACTACCCC CGGGACAGGC TGGAGGTGAT CGTGGCGGAC
GGTGCGTCGA CCGACGGGAC GCCCGAGATA GCGGAGAGGT GGGCGAGGGA GCATCCAGAT
TTAAAGGTCA AGTTAATCCG AGAACCCCAG CGAAGAGGGC TCGTTCCCTC CTTGAACGAG
GCTCTGAAAT ACGTGTCAGA TGGCAGCGAA ATAGTAATCT TCGCCGATGC TGACGCCTTA
TGGCCACACG ATGCCATCTC GAAAATTGTG AAATATTTCG CGAACCCCTC TATAGGAGCG
GTTTCCTCAA CCATAGCGCC GCTGGATTAC GATGAAAACG AGAGTACATA TAGGAGCTAC
TTTAATGCGA TAAGAGTCGC GGAGTCTAAA AAGCACAGCA CACCTATACA CAATGCGCCG
CTCATGGCCT TTCGAGCGGA GCTAATACGA AAAGTAGGCC TGCCGCTCTA CACGGGAAAT
AACGATAGCA CGCCTGCATC CATAATAGCC TTCATGGGAT ATAGAGCTAT CTTGGTGGAC
GACGTAGTAG CAAAAGAAAT ACTGAGAAAT CAAACCATGA GGAAAATTAG AAGGGCGCAA
CATCTAATAT TACATTTCCT TAAAACAAAA CAATATGCAA AGAAGCGTGG ATTTTATAAA
AAATCAGAAT TCGATATAAT ATGGAAAATC GAATGGTGGC TCCACATAGT CAACCCCTGG
CTATTGATAG CCGGCATAGC CTTGCTAGCT ACGGCTCTGG TGCTATATAG ATCGCTGCAT
GCACTTGTAT TGTTGGCCAT CGGAATGGCG TTGCTGACAT TCAAACTTTA CCGAGTGTGG
ATCCAAAACC AACTATACCT GGTAGCCGGC TTCATCAGAA ATCTCTGGAA CAAAGACCTG
GTCTGGGAAA AATGA
 
Protein sequence
MLFELGLALA ALHFGVPLGY YAAAKRWLRR DWGIKEDVRY TPRVTVIIPT YNEADNIAQR 
LENIYQQDYP RDRLEVIVAD GASTDGTPEI AERWAREHPD LKVKLIREPQ RRGLVPSLNE
ALKYVSDGSE IVIFADADAL WPHDAISKIV KYFANPSIGA VSSTIAPLDY DENESTYRSY
FNAIRVAESK KHSTPIHNAP LMAFRAELIR KVGLPLYTGN NDSTPASIIA FMGYRAILVD
DVVAKEILRN QTMRKIRRAQ HLILHFLKTK QYAKKRGFYK KSEFDIIWKI EWWLHIVNPW
LLIAGIALLA TALVLYRSLH ALVLLAIGMA LLTFKLYRVW IQNQLYLVAG FIRNLWNKDL
VWEK