Gene Pars_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0375 
Symbol 
ID5055058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp323340 
End bp324626 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content59% 
IMG OID640467944 
Producthypothetical protein 
Protein accessionYP_001152631 
Protein GI145590629 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.427757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTAG TAGTATTAGG TTGCGGCTGG TCTGGCGTCG TGGCGGCGCA TAGTCTCAAG 
TCCAAATATC CCTCGGCGGG CGTCGTCTGT CTCGACAGGT CTTTTGACGG TGGTCTTCTG
CGGACCGAGG CGGTCGGCGG CTACCTATTC GACGTTGGGG GTTCTCACGT GCTCTTCAGC
CGGGACCCGG CTGTCGTTAA CGCCATAACG GCTATGGGCG GCCGTTGGGT TGCTAAGGAG
AGGAGGGCCT TTGTGTTGTT AGACGGCGTC TTCATCCCGT ACCCCTTCGA GAACGGGATA
TACGTCTTGC CGCCCGAGAG GAGGGCTAGG TACGGGCTTT CGCTGATAAG GGCGCTTATG
CAAGGCGATA GGAGACCGGA GAGCTTCAAG GAGTGGATAC TCAACACCTT TGGCGAGGAG
GTGGCCAAGG ACTACCTAAT CCCCTACAAC GAGAAGATCT GGAAGAGGCC GTTGGAGGAG
CTTTCGGCCG ACTGGGTATA CACGCCGGGC CGCCTCCCCC TACCCTCCCT CGAGGACATA
GTGAAGGCCG TGGCGGGGCT GGAGACAGTG GGCTATAGGG AGCAAGCGGT CTTCCGCTAC
CCCGAGGGGG GTATAATCGC GCAGTACCGG TCGGCGCTTA GAAAAGCCGA GGAGGCCGGC
GTCTTGCTCG TCAAGGAGGA AGTCAAGGAG GTGAAAAAGA GGACCGACGG CTTTTTGATA
AATGGAAGAC TCAGAGCGGA TCATATCGTC TCCACGCTTC CGCTCCGCGA TCTTCCGGCG
ATGCTGGACC CGCCTCCTCC CGAGGAGGTG TTTAAAGCGG CCGGGAGGCT GGACTACAAC
TCAGTGGCGG TGGTGGGGCT GGGGCTTAGG GCTAAGGCCC CGCCGCAGCA CTGGGTCTAC
GTGCCGGATA GGCGCGTCGT CTTCCACCGC TACGCCTGGA TATCCAACTA CCTCCCGGAG
CCTCCCGAGG ATAGGTCGGC TCTTATCGCG GAGATAACAA TACCGCCAAG CCGCGAGGTG
GATACGGAGG CTCTGGCGGC CGAGGCTGTG AGGGGGCTTT CAGAACTGGG CATTGTGAGG
GAGAAAGACG TGGAGGTCGT CAAGGTTTGG CTTCACAAAT ACGGCTATCC CATATACACG
AGGACCCACC GGCAGGACCG AGAAGCCGTG GAGAGGTACC TAGCCGAGGT CGGCATAGCC
ACCTTCGGCA GATGGGGAAA CTGGCACTAC TGGAACACCG ACGCGATATA CAAGAGGGCT
ATGGAAATTC GTAACTTAGT GTCTTAA
 
Protein sequence
MKVVVLGCGW SGVVAAHSLK SKYPSAGVVC LDRSFDGGLL RTEAVGGYLF DVGGSHVLFS 
RDPAVVNAIT AMGGRWVAKE RRAFVLLDGV FIPYPFENGI YVLPPERRAR YGLSLIRALM
QGDRRPESFK EWILNTFGEE VAKDYLIPYN EKIWKRPLEE LSADWVYTPG RLPLPSLEDI
VKAVAGLETV GYREQAVFRY PEGGIIAQYR SALRKAEEAG VLLVKEEVKE VKKRTDGFLI
NGRLRADHIV STLPLRDLPA MLDPPPPEEV FKAAGRLDYN SVAVVGLGLR AKAPPQHWVY
VPDRRVVFHR YAWISNYLPE PPEDRSALIA EITIPPSREV DTEALAAEAV RGLSELGIVR
EKDVEVVKVW LHKYGYPIYT RTHRQDREAV ERYLAEVGIA TFGRWGNWHY WNTDAIYKRA
MEIRNLVS