Gene Pars_0552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0552 
Symbol 
ID5054519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp494019 
End bp495101 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content49% 
IMG OID640468114 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001152799 
Protein GI145590797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000617979 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCC TGGCTGGGTT AGGCCTTGCT ATAGGGAGGG TGATAAATTT GTTAAGAAAA 
GAGGGGGTTG ATGCGGTTGA GTTTAAAACA TCTATCGTGG CCTCCGATTT AGAGGTTCCC
TGGTCTATTA CCCCTCTTGG AGGTAGGCGT TATCTAGTTA CAGAGCGCCC TGGTCGCTTA
GTGTTGATAA GCCCCAGCGG AAAAAAGCTC GTGGCTTCAT TTGACGTGGC AAGCGTCGGC
GAGGCAGGCC TGCTGGGTTT GGCGCTACAC CCTGATTTCC CTAAGAAAAC CTGGGTTTAT
CTCTACGCCT CCTACTTCGA CAGTGCGGGG CAGATAAAGA ATAAGTTAAT TAAAGGACGT
CTAGATCCAC TCACCTTTAG GCTTAGTGAA GTGAAGACTT TAATTGAGGA TATTCCGGGC
GCCTATATTC ATAATGGAGG GCGCATTAGG TTCGGTCCTG ACGGCATGTT ATACATAACT
ACAGGGGATG CGGCCAAGCC GCTACTTTCC CAAGACTTAT CCAGTCTAGG TGGTAAAATC
CTCCGCGTAG ATGACGATGG AAAACCTTCC CCTGATAACC CCTTCCCTAA CAGTCCCATC
TGGTCTTACG GCCACAGAAA TCCTCAAGGC ATTGACTGGC ACCCCGACAG TGGTGTGATG
GTAACAACTG AGCATGGCCC AGTAGGCCAC GACGAAGTAA ACGTAATAGT GAAAGGGGGC
AACTACGGGT GGCCGTTGGC AGTGGGGAAG GCCGATAGAG GCGAATTCAT AGATCCAATA
ATCGAATCGG GCGGAGATAC TTGGGCGCCT TCGGGGGCCT CCTTTGTGCA CGGAGATGCG
TTCCCAGAGC TTCGCAGTTG GTTGTTAATC GCATGTCTCA GAGGGAGTAT GATACTGGGA
GTTGAGTTTG TCAACCAAAT GAAAGTGTTT GGAATTCACA TGTTTTTTAA AAATGTCTTT
GGGAGACTCC GCGATGTTGT TATTGACGAA GACGGAGGTA TACTAATAAG TACCAGTAAT
AGAGATGGTA GAGGTAACCC GAGAGACGGA GATGATAAGA TTTTAAAAAT TGTCCCCGCC
TAA
 
Protein sequence
MALLAGLGLA IGRVINLLRK EGVDAVEFKT SIVASDLEVP WSITPLGGRR YLVTERPGRL 
VLISPSGKKL VASFDVASVG EAGLLGLALH PDFPKKTWVY LYASYFDSAG QIKNKLIKGR
LDPLTFRLSE VKTLIEDIPG AYIHNGGRIR FGPDGMLYIT TGDAAKPLLS QDLSSLGGKI
LRVDDDGKPS PDNPFPNSPI WSYGHRNPQG IDWHPDSGVM VTTEHGPVGH DEVNVIVKGG
NYGWPLAVGK ADRGEFIDPI IESGGDTWAP SGASFVHGDA FPELRSWLLI ACLRGSMILG
VEFVNQMKVF GIHMFFKNVF GRLRDVVIDE DGGILISTSN RDGRGNPRDG DDKILKIVPA