Gene Pars_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2113 
Symbol 
ID5054951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1888723 
End bp1889667 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content57% 
IMG OID640469665 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001154311 
Protein GI145592309 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.661925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTG TAGTGACGGG CGGAGCCGGC TTTATAGGTA GCCACATCGT GGATAGACTC 
GTCGAGGAGG GCCACGAGGT GGTGGTTGTT GACAACTTAT CCAGCGGCAG GAGGGAGTTT
GTGAACAAGT CTGCCGAGTT CCACGTAAGG GATCTAAAGG AAAGGGAGTG GGGTGTGGGA
ATCAGGGGGG ATGTCGTCTT CCACTTTGCG GCGAATCCGG AGGTTAGGAT CTCCACTACG
GAGCCCTCCG TCCACTTTAA CGAAAACGTG TTGGCAACGT TCAACGTCTT AGAGTGGGCG
AGGCAGACGG GGGTGAGGAC CGTGGTGTTT GCCTCCTCTT CCACGGTATA CGGCGACGCC
CAAGTTCTGC CCACCCCAGA GGAGGAGCCG CTTAGGCCTA TCTCGGTATA CGGCGCTGCA
AAGGCGGCAG GCGAGATAAT GTGCGGAACC TACGCCCGGC TCTACGGCAT TCGCTGTCTG
GCAATCCGCT ACGCCAATAT TGTTGGGCCG AGGCTGAGGC ACGGCGTCAT ATACGACTTC
ATTATGAAGC TGAAGAAGAA CCCAAACGTC CTCGAAGTTC TCGGAGACGG GACACAGAGG
AAGAGCTACC TCTATATAAA AGATGCCGTG GACGCCACGC TCCTTGCGTG GAGGAAATTC
GAGGAGTTGG GCGAGCCGTT CTTGGCGCTG AACGTCGGAA ATGTTGACGC CGTTAGAGTG
CTAGACATCG CCCAAATAGT GGCCGAAGTC CTCGGCCTCA AGCCTGAAAT AAAGCTAATC
CCTACAACTC CAGATGGGAG GGGGTGGCCT GGGGATGTGA AGTACATGAC CCTCTCTATC
AACAAGCTCT TAAAACTCAC TGGCTGGAAG CCGGCGATGA CAAGCGCCGA GGCGGTGCGA
AAGACCGCCG AGGAACTCGC CGGGGAGCTA TGGCGGACAC CGTAG
 
Protein sequence
MRIVVTGGAG FIGSHIVDRL VEEGHEVVVV DNLSSGRREF VNKSAEFHVR DLKEREWGVG 
IRGDVVFHFA ANPEVRISTT EPSVHFNENV LATFNVLEWA RQTGVRTVVF ASSSTVYGDA
QVLPTPEEEP LRPISVYGAA KAAGEIMCGT YARLYGIRCL AIRYANIVGP RLRHGVIYDF
IMKLKKNPNV LEVLGDGTQR KSYLYIKDAV DATLLAWRKF EELGEPFLAL NVGNVDAVRV
LDIAQIVAEV LGLKPEIKLI PTTPDGRGWP GDVKYMTLSI NKLLKLTGWK PAMTSAEAVR
KTAEELAGEL WRTP