Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0591 |
Symbol | |
ID | 5056379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 526093 |
End bp | 527709 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468150 |
Product | cytochrome d1, heme region |
Protein accession | YP_001152835 |
Protein GI | 145590833 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCCG AGGACAAACA ACCGAGGCGT GACTTCCTCA AGGCGGCCGC CATGGCGGGG ATTGGCTTCG CGGTGGGTAG CTGGGCAGTG GCGCTGAGCC GCGGAAAAGC CGTTGTGGAG GTGACCCAGG AGAAAGTTGT GGAGACCAGA GTAATACCAC AGGTACAAGT AATGCAGACG GCTCCCACCG CGCCGGCGGC GCCGCCGCAC CCAGCTTTTG AAAAGAGGGG GCTGGCGTAT TTAAACCCTG AGACAATTCA GAACACGCTC AGGGTGCTGG TGCCGGAGGA CTCCCTATCC CCCAAGCCTA CTGCCTACAA TATAAATGAC TTGGACTGGA TAGCTATATT GATTGAGAGC CGCTACCACG AGCCGGGGGT CGAGATGGTG GGCGCCTACA CATTCCTAGA CATGAAAAAC TTCAATGTGT TGAAGAGGCT TAAGAACGCC GGCGACCGCG TCCACGTGGT GAGGTTCGGC CGGGAGGAGT GGCCCGAAAA CAAGAGGAGG TTTGCTCTAG GCATGTCGAG GGACTGCTGG CTTTCCAAGA TAGATCTCTA CACCATGCAG ATTGTGAGGC AGATAAAAAT AGGCGTCGAC TGCCGTAGCG CTGCATATGA CAAGGACGGG AAATACATAA TAGCAGGCTC CAAAGACCCG GGACACGTGG TTATACTAGA TGCAGACACC TTCAAGGTGT TGAAGGTGAT ACCATTCCTG GGCGTCTCTA AGTTCTTCCC AACCCCCATG ATGGGGCGCC AAGGGGCTAT ACTCACCACA GACCTGGGCT ACTGGCTTGT CAATGTAAAA GATGCCGAGA TGGTCCTAGT TATAGATTAC AGAGACCCGG AGTTTCCCAT TGTCCACGCA TTCACGAGCT ACGACAACAA CTCCAAGAAC AGAAGCGTGA AAGTGCAGAT AGGCGACAAG ACATATGAGG TCACTGGAAT TGGCAAGAGC CCACACGAGC TAAACAAACT GGATAAGGAG GGCCGCTACG TAGCCGTCAC CGGGCAGGAG AGCAACACCA TATCCATACT CGACATGAAG AACTTCGAAA TTATCAACGT CGTCCCGTGC GGCAAGAAGC CGCACCCAGG GCCTGGGACC TTAGTCCCCG GCAAGTACTT CCTAACCAAC GCAATTGCGG AGGGGAAGAT CACCGTTATA AACCTCCAGA CGATGGACGT GGAGAAGTAC ATCACCTACC CCAAGGAGTT CCCCGCCGAC ACAGGAGGGG GGCTATACTC CACTCCGCCG CTGCCAGACG GCAGAATCCC CAAGGGGCTG GCCTGGTTCG ACACGTCGTT TAACATAAAC AAGGGCGTAT TCGCCGTTGA CATTAACCTC ATGGACGTGG CCACCGCCCC GCCCAAGCCT GCCGTGTTCT CTACCAACAA GCCAGGCAAG TGGGCAATGC ACCCGGGCTA CACCCCCGAC GGGCGCTATG TGATAAGCGC CTTGGAGAGG ACAGACTCTG TGTATAGAGT AGACGCCGAG ACTGGCGAGA TCGTGGGAAC AATAAAGCTA AAGGAGATAG AGCCTGTCCA GTTGCTAGAA GAGCCCTCTC CCACCGGCAT ATTCCCAGCC TGGAGGATAA AGGCGCCTTG GTTCTAA
|
Protein sequence | MGAEDKQPRR DFLKAAAMAG IGFAVGSWAV ALSRGKAVVE VTQEKVVETR VIPQVQVMQT APTAPAAPPH PAFEKRGLAY LNPETIQNTL RVLVPEDSLS PKPTAYNIND LDWIAILIES RYHEPGVEMV GAYTFLDMKN FNVLKRLKNA GDRVHVVRFG REEWPENKRR FALGMSRDCW LSKIDLYTMQ IVRQIKIGVD CRSAAYDKDG KYIIAGSKDP GHVVILDADT FKVLKVIPFL GVSKFFPTPM MGRQGAILTT DLGYWLVNVK DAEMVLVIDY RDPEFPIVHA FTSYDNNSKN RSVKVQIGDK TYEVTGIGKS PHELNKLDKE GRYVAVTGQE SNTISILDMK NFEIINVVPC GKKPHPGPGT LVPGKYFLTN AIAEGKITVI NLQTMDVEKY ITYPKEFPAD TGGGLYSTPP LPDGRIPKGL AWFDTSFNIN KGVFAVDINL MDVATAPPKP AVFSTNKPGK WAMHPGYTPD GRYVISALER TDSVYRVDAE TGEIVGTIKL KEIEPVQLLE EPSPTGIFPA WRIKAPWF
|
| |