Gene Pars_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1564 
Symbol 
ID5055130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1414234 
End bp1415238 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content50% 
IMG OID640469105 
Productsugar phospate transferase 
Protein accessionYP_001153770 
Protein GI145591768 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000011841 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAAATTG CGTTGGTAGG ATCTCTTAGC CCTAGGCTGG CTGTACCTTC GCCCTATGTC 
ATAGCTGGCG GCTTTAGACT TGTAGAACTG GCTGCATTGT CCACGTGCGA AGATGGCAAG
GCGGTGATCT ACGTAGAGGA TAGTGCGGAT CTACCCCTAA TCTTAGAGGG CTGTAAGCTT
GAGATCAGGA GGGGGATTCC CAGAGACGTT CCGAAAGTTC ACGTGGGGTG CGTCCCGTAC
TTGTTGAGAT CGAGAAATTT AAAATGCGGC GGCTTTTACG TAAATCTCGG CGGCGAAGAG
ATGGTCGTGG AGCCCCTCAC ATCGCTTGCC GATATTATTG AGAAAAACGT AGAAATTATG
AATATGGCCT TTAACAAACT TAGGGAGCTT GGCGTGGAGC TTGTAAGAGG AGACGTGAAA
GGAGAGACAA GGGGTTTGGT ATATGTAAGA GGCAAAATAT ACGAATACAC ATACGTTGAG
GGCCCCGCCG TGGTTGGGCC CTCCTCTGCG GTTTTGCCTT TTACCTATGT GAGACCGGGC
ACTACGCTTT ATTTCGACTC GAAGATCAGG GAGGAGGCCA AAAATGCTAT TCTCGACGCC
TATACCCGGA AGCAACATAC GGGATATCTA GGCGATGCGT ATGTATCTGC CTTTGTCAAC
ATGGGGGCCG GCACCACGGT CTCTAACTTG AAGAACACCC TGGGCCTAAT TAGGCCTTCC
TACACCTCCA GGGCGTACCG AAAGCTGGGC CCTATACTAG GCGAGTTTGT GAAGACGGCT
ATCGGCACTT TAATATACGG CGGGCGGTAT GTAGGCCCTC TCTCGCATGT ATACGGGGTA
GTTGATAGGG ATGTGCCTCC TCTTTCCATA TTCAAAAACG GAGAAGTGAA GCCGATGGAT
AGAGACAAGG CTATTGAGTT ATTACAGCGC GACTTGGCCC AATTCGGCCG CTCTGATCTT
GGCCCCTATT TTAAAACACG CCTAATCGAG CAGTCTCTGT TTTAG
 
Protein sequence
MEIALVGSLS PRLAVPSPYV IAGGFRLVEL AALSTCEDGK AVIYVEDSAD LPLILEGCKL 
EIRRGIPRDV PKVHVGCVPY LLRSRNLKCG GFYVNLGGEE MVVEPLTSLA DIIEKNVEIM
NMAFNKLREL GVELVRGDVK GETRGLVYVR GKIYEYTYVE GPAVVGPSSA VLPFTYVRPG
TTLYFDSKIR EEAKNAILDA YTRKQHTGYL GDAYVSAFVN MGAGTTVSNL KNTLGLIRPS
YTSRAYRKLG PILGEFVKTA IGTLIYGGRY VGPLSHVYGV VDRDVPPLSI FKNGEVKPMD
RDKAIELLQR DLAQFGRSDL GPYFKTRLIE QSLF