Gene Pars_0591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0591 
Symbol 
ID5056379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp526093 
End bp527709 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content55% 
IMG OID640468150 
Productcytochrome d1, heme region 
Protein accessionYP_001152835 
Protein GI145590833 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCCG AGGACAAACA ACCGAGGCGT GACTTCCTCA AGGCGGCCGC CATGGCGGGG 
ATTGGCTTCG CGGTGGGTAG CTGGGCAGTG GCGCTGAGCC GCGGAAAAGC CGTTGTGGAG
GTGACCCAGG AGAAAGTTGT GGAGACCAGA GTAATACCAC AGGTACAAGT AATGCAGACG
GCTCCCACCG CGCCGGCGGC GCCGCCGCAC CCAGCTTTTG AAAAGAGGGG GCTGGCGTAT
TTAAACCCTG AGACAATTCA GAACACGCTC AGGGTGCTGG TGCCGGAGGA CTCCCTATCC
CCCAAGCCTA CTGCCTACAA TATAAATGAC TTGGACTGGA TAGCTATATT GATTGAGAGC
CGCTACCACG AGCCGGGGGT CGAGATGGTG GGCGCCTACA CATTCCTAGA CATGAAAAAC
TTCAATGTGT TGAAGAGGCT TAAGAACGCC GGCGACCGCG TCCACGTGGT GAGGTTCGGC
CGGGAGGAGT GGCCCGAAAA CAAGAGGAGG TTTGCTCTAG GCATGTCGAG GGACTGCTGG
CTTTCCAAGA TAGATCTCTA CACCATGCAG ATTGTGAGGC AGATAAAAAT AGGCGTCGAC
TGCCGTAGCG CTGCATATGA CAAGGACGGG AAATACATAA TAGCAGGCTC CAAAGACCCG
GGACACGTGG TTATACTAGA TGCAGACACC TTCAAGGTGT TGAAGGTGAT ACCATTCCTG
GGCGTCTCTA AGTTCTTCCC AACCCCCATG ATGGGGCGCC AAGGGGCTAT ACTCACCACA
GACCTGGGCT ACTGGCTTGT CAATGTAAAA GATGCCGAGA TGGTCCTAGT TATAGATTAC
AGAGACCCGG AGTTTCCCAT TGTCCACGCA TTCACGAGCT ACGACAACAA CTCCAAGAAC
AGAAGCGTGA AAGTGCAGAT AGGCGACAAG ACATATGAGG TCACTGGAAT TGGCAAGAGC
CCACACGAGC TAAACAAACT GGATAAGGAG GGCCGCTACG TAGCCGTCAC CGGGCAGGAG
AGCAACACCA TATCCATACT CGACATGAAG AACTTCGAAA TTATCAACGT CGTCCCGTGC
GGCAAGAAGC CGCACCCAGG GCCTGGGACC TTAGTCCCCG GCAAGTACTT CCTAACCAAC
GCAATTGCGG AGGGGAAGAT CACCGTTATA AACCTCCAGA CGATGGACGT GGAGAAGTAC
ATCACCTACC CCAAGGAGTT CCCCGCCGAC ACAGGAGGGG GGCTATACTC CACTCCGCCG
CTGCCAGACG GCAGAATCCC CAAGGGGCTG GCCTGGTTCG ACACGTCGTT TAACATAAAC
AAGGGCGTAT TCGCCGTTGA CATTAACCTC ATGGACGTGG CCACCGCCCC GCCCAAGCCT
GCCGTGTTCT CTACCAACAA GCCAGGCAAG TGGGCAATGC ACCCGGGCTA CACCCCCGAC
GGGCGCTATG TGATAAGCGC CTTGGAGAGG ACAGACTCTG TGTATAGAGT AGACGCCGAG
ACTGGCGAGA TCGTGGGAAC AATAAAGCTA AAGGAGATAG AGCCTGTCCA GTTGCTAGAA
GAGCCCTCTC CCACCGGCAT ATTCCCAGCC TGGAGGATAA AGGCGCCTTG GTTCTAA
 
Protein sequence
MGAEDKQPRR DFLKAAAMAG IGFAVGSWAV ALSRGKAVVE VTQEKVVETR VIPQVQVMQT 
APTAPAAPPH PAFEKRGLAY LNPETIQNTL RVLVPEDSLS PKPTAYNIND LDWIAILIES
RYHEPGVEMV GAYTFLDMKN FNVLKRLKNA GDRVHVVRFG REEWPENKRR FALGMSRDCW
LSKIDLYTMQ IVRQIKIGVD CRSAAYDKDG KYIIAGSKDP GHVVILDADT FKVLKVIPFL
GVSKFFPTPM MGRQGAILTT DLGYWLVNVK DAEMVLVIDY RDPEFPIVHA FTSYDNNSKN
RSVKVQIGDK TYEVTGIGKS PHELNKLDKE GRYVAVTGQE SNTISILDMK NFEIINVVPC
GKKPHPGPGT LVPGKYFLTN AIAEGKITVI NLQTMDVEKY ITYPKEFPAD TGGGLYSTPP
LPDGRIPKGL AWFDTSFNIN KGVFAVDINL MDVATAPPKP AVFSTNKPGK WAMHPGYTPD
GRYVISALER TDSVYRVDAE TGEIVGTIKL KEIEPVQLLE EPSPTGIFPA WRIKAPWF