Gene Pars_0929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0929 
Symbol 
ID5054268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp822955 
End bp824085 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content54% 
IMG OID640468485 
Productcytochrome c biogenesis protein, transmembrane region 
Protein accessionYP_001153161 
Protein GI145591159 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0785] Cytochrome c biogenesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.505775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTCT TGGCGATTTT GCTGGTGTTG TCTCTTCTCC TCTCTGCGGC ATATGTGGAG 
AAGGGGGCTC TTCAATTCAA GCTCGTGGAC TCGCCTTACG ATGTGGAGGG CAAGTTTTTC
CTCTACATAT ACCAGCCTCT GTGCGAGGAG TGTAAGAAGT TAGAGGCTTT GTTCGGTAGG
GAGGACGTTG CTGCGGCGTT GTCGGGGTAT AGGTTGTATG CTCTTGACTT GTCGAAATCG
GGCGTTGCCG CAGTCGACGT CTATGTGGAG GGGCAGGTAG TATATGTAGA TCACGGAGTT
GTTAAGTACT TCGCCGCCTC TGGGAGGAGG AGTTTTCTAA TTCCCGGAAC GCCCACAGTG
TTGATCGGCA TTAAGGAGAA CGGCACGGTG AGACTTCTAG GTTTTTGGAC TGGGGCTGAT
ATGCCGCCTG GCGAGGACTT GGGTGCGGCG TTTGTGCGTT TTCTAGGCGA GGTGTCGGAG
TCCGGTAGTG CGGCTGAGTC GAGTTACCGC GTTGATGTAT TATGGGTGTT TCCGCTTGCG
TTTATCATGG GCGCCGTAAG CGCTTTTTCG CCCTGCGTAT TGCCTGTTCT CGGCATAGCG
GCGGTTACGC ACTTCGCGAG GAGGAGTCTG GGCAGAGTTC TAGCAGGCAT GGTGGTTTCT
TACGCGGCTA TCGGCGCGTT AGTCGCGGCG TCGGGCGTGG CGGCGGCTTC CGTAAGGCCC
TTGCTTGCGG CTATAGGCGG CGTCCTACTC GTGGCGCTGG GCGCAGTTCT CTTAGTAGAG
AGACTTAATG TAAAATACGC CACTGCTATG TCTAGGATTC AGACCTCTGC TTATAAAAAG
CTCTCCAAGG CTGGGGACTT CTTGGTAGGA GTGTCGCTCG GCGCCGTGTG GGTTCCATGT
ATCCTCCCCT ATATGGGCTT TGCCACTGTG CTGGCCCTCA TATCGCTTGC AGGGGATTAT
CTCTTGTTGC TAACGGCTCT GCTGATGTAC GGAGGGGGAC TTGCGTTGAT GGTATACGTA
ATTGTTAGGG GACTTCTTAA ACGTGTGAAA CCAAAGAGGT GGTATGAGAA AGCCGTGGGT
GTTTTATCGA TTCTAATCGG CTTTTACCTC GTGGCGTCGG TATGGATTTA G
 
Protein sequence
MRFLAILLVL SLLLSAAYVE KGALQFKLVD SPYDVEGKFF LYIYQPLCEE CKKLEALFGR 
EDVAAALSGY RLYALDLSKS GVAAVDVYVE GQVVYVDHGV VKYFAASGRR SFLIPGTPTV
LIGIKENGTV RLLGFWTGAD MPPGEDLGAA FVRFLGEVSE SGSAAESSYR VDVLWVFPLA
FIMGAVSAFS PCVLPVLGIA AVTHFARRSL GRVLAGMVVS YAAIGALVAA SGVAAASVRP
LLAAIGGVLL VALGAVLLVE RLNVKYATAM SRIQTSAYKK LSKAGDFLVG VSLGAVWVPC
ILPYMGFATV LALISLAGDY LLLLTALLMY GGGLALMVYV IVRGLLKRVK PKRWYEKAVG
VLSILIGFYL VASVWI