Gene Pars_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0597 
Symbol 
ID5056127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp532439 
End bp533653 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID640468156 
Productglycosyl transferase, group 1 
Protein accessionYP_001152841 
Protein GI145590839 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.287593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG CCGTAGTGGC GCCCCAGAGC TCCCACTGGG AGGACACATA CCGCGCCGCC 
GCCGTCTTGG TAAAGGCGTT TCTAAAGCTT GGGCACAAGT CGTGGCTAAT TACAAGCATC
TTCCACGATG GAAGGCCGGC GGTCGACGTA GATGCCGTGG AGAAAAGCGA GGGTGGCTAC
GTGGTGGTGG AGGGGGACGT CTCCGGGGTT CCTGCTATCC GGGTAATCAG TGGCAGGTCC
CTAGTCCCGC CGTCTGTGAT ATATCTGAGA AACTTCCCAA GGGTGCTCAA CGCAATCGAC
GAGGCCTACG GCCTAGACGC TGTGGTGGTC GTATCAAGCT TCTGGAACGG GCCGGAGGAC
GTGGCGAGGT GGATTTCGAT AAAGAAGTCC CTCCTCACCA TCGGCGAGGT GTCTAAAAGG
CCTTTTTTCG TATACGTGCC CGTACTAGGT GGAAGGGCGC CTTTGAAAAA ACCTATGGAG
GCCGCCTCTA GAGTTATGTG GTCGACTCTC TACCTCCCAC AGGTTTTGCA ACAAGTCGAT
GTTGTGGTGG CCGTCTCTAG CAACGAGTTC TACGACCTGC GCCAATACCG CGTTCCAGAA
GATAAGATAG TTGAGTGCAG GGACTGGGTA GACCCCGACG TGGCCGAGCT AGCTGGGGGG
CAACTGGAAA GGCCCAAGCA AGCGGAGGGA TACGACTTCT ACGTCTCTTA CATTGGCCCT
CTTGACGAAG ACCGGAACAT ACGCGGCTTA ATAAAAGTCG CGGAGAGAAT CGCATCAATG
GGAAACGGAG CACTAATAGT CGCGGGGGCG GGTGAGGCAG AGGAGAAATT TAGGCGGGAG
GCAGAAGGCC GGAAGAACGT GATACTCATT AGAGAGCATG GTATTAGGAC CATAGCTTCT
ATTATTAGAT GGTCGCTGGC AGGCGTGGAC TTAGCCTTTT ACGAGCCGAT GGGCATAAGG
GCGCTGGAGT ACTTATACTT TGGAGTGCCG TACGCCGCTC CCCCGACCTC AAACGCGGCT
TACTTTATTA CTAACGGCGT AGACGGCATA CACCTAGAAA GCGCCAATGA CATAGAAGGG
TTTGTCAACT GGGTCTCAAC ATTATTGCGC GAGCCCGAGC TCAGAGACGA AATGAGCCTC
AAGGCAAGGA AAAAGGCAAC CGAGCGAACT GCCGTTAAGC TGGCGGAGAC TTTACTAATG
CGGCTGGCGT CATGA
 
Protein sequence
MNIAVVAPQS SHWEDTYRAA AVLVKAFLKL GHKSWLITSI FHDGRPAVDV DAVEKSEGGY 
VVVEGDVSGV PAIRVISGRS LVPPSVIYLR NFPRVLNAID EAYGLDAVVV VSSFWNGPED
VARWISIKKS LLTIGEVSKR PFFVYVPVLG GRAPLKKPME AASRVMWSTL YLPQVLQQVD
VVVAVSSNEF YDLRQYRVPE DKIVECRDWV DPDVAELAGG QLERPKQAEG YDFYVSYIGP
LDEDRNIRGL IKVAERIASM GNGALIVAGA GEAEEKFRRE AEGRKNVILI REHGIRTIAS
IIRWSLAGVD LAFYEPMGIR ALEYLYFGVP YAAPPTSNAA YFITNGVDGI HLESANDIEG
FVNWVSTLLR EPELRDEMSL KARKKATERT AVKLAETLLM RLAS