Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0754 |
Symbol | |
ID | 5056374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 672336 |
End bp | 674267 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640468313 |
Product | hypothetical protein |
Protein accession | YP_001152992 |
Protein GI | 145590990 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.39993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCTA GGTATGTAGA GTACCTAGGA GTGCCTCATT ACACTTTTTT AATTGTCTAC ACGCTGGCGG CGCTGTCGGT TCTCGCCCTC TTATACGGCC TTTCAAAGAC GTATAGAGGA GTTGGTCTCT TGGCGGTGAC TAAGCAGGGC GTGAGAAGAC TTGACAAGTC TTTGAGATTT GTGGCCGAGT TTCTGTCTCA TTACCGCTTC CTGTCCCGAC AGGTATTTGG AGGAACTGTC CACCTTCTTC TTATAGTTGG GCTTGGCATA TCTCTCATCG GCACTATTAT AGTGTCTGTG GTGCATTACA CGGGCGTGGA GTACGGAGGC TTATTCTTCT TAGTCATGAG GTTTTTGCTC GACGTCGCCG CCGTGTTTAT AATCTACGGG TCGTTAGCCG GCATATACAG GGTATATGCC GGTAGGGAGA GGTACGGGAA GCTGGCGAAC CAATACATCT TGGTGTTGTT AGGCTTTCTC GCAATAGCCG TAACTGGTAT GATAATGAGG AGATACCGCG TAGACTACTA CCTTGGGGGG CCCTCGCCGT GGTCTCCACT ATCGTATTTA GTGCCTCCTG TGACGCATGA GGTGTACCTC GCCGCCTACT TTGTGCACAT TGCCGTTGCA TTCGCGTTAA TAGCCGTGTC GCCTCTGGTA ATGCTACGAC ACATGTATTT GGCCTACGCC AACTACCTCC TCGTTGACAG GCCGCTGGGC GAGTTGACCA CGCCGTTTGA GCTTGAAAAG GTTATGGAGA GCGGCGAGGC GGAGATCACC GTGGGAGTTA AAAAGAAGGG GGAGTGGCGG GGAATACACG GCATAATGTT CGACGCCTGT ACTAGGTGTA GTAGATGCCA AGACGTCTGC CCCGCGTTTG CCGCCAAGAG GCCGCTCAGC CCTATGTCCC TAATAACAAA AATTGCAGAG GCTAAAAACG ACGCCGATTT ATTCGAGGTG GGGATAACGG AAGACGAGGT GTGGGCGTGT ACCACCTGCG GCGCTTGTAT GTACCAGTGC CCCGTATATA TCAGACACGT CGACTATATA GTAGATCTAA GGAGGGCTCT GGTGTTTGAG TCTAAGGTTG ACCAGAAAAA GGCAGATCTG CTCATGTCTG TAAGCCAGTA CCACAACACG CTGATGCAGG CAAATGTGGG GAGGCACGAC TGGCTTCGCG AACTTGGGGT TAAGCACATA TCGGAGAATC CCCAGGCCGA GTACCTCCTC TGGGTTGGGT GTATGGGTAG CTTCGACGGG CGGGCGAGGG AGATAGTCAA GGCTTTTGTA AAGATTCTCG AGAAGGCAGG CATGTTGGAC AAGGTGGCTG TGCTCGGAGA CGAGGAGACG TGTTGTGGAG ACCCCGTGAG GAGGCTGGGG GAGGAGAGCA GATTCCAAGA GCTTGTTCTA AACAACAAGC AGATATTTGA AAAGTACGGG GTGAAGAAGC TAGTTACGAT ATGCCCCCAC GGGTACAATA CTTTCAAAAA CGAGTATCCC AGATTCGGCG TAAAGCTGGA GGTGTACCAC CACGTAGAGG TGTTGCAACG CCTCGTAGAA GAGGGCAAGA TAACAGTGAG GGGGTCCTTG GAGAGTCTGA CAATACACGA CCCCTGCTAC TTATCTCGCC ACAACAAGGT GGTAGAGCCG CAGAGAAAGA TAGTAGTAAA GCTGGGGGCC TTGAAAGAGC CTCCTAGACA CGGGGAGAGG ACCTTCTGTT GCGGCGCGGG TGGGGCGAAC TATTGGTACG ACGTCCCCGA GGAGAAGAGG ATAAGCCACA TCAGGTTCGA GGAGCTGGCC GGCACCGGGG CTGAGACAAT AGTTACGCAA TGTCCCTTCT GCAACGCCAT GTTAACCGAG GCTAAGAGAG CCAAAGACAG TGCAGTAAAC GTGAAGGACA TAGCGGAGGT GGTAGCTGAG AAGCTGGCGT AA
|
Protein sequence | MDARYVEYLG VPHYTFLIVY TLAALSVLAL LYGLSKTYRG VGLLAVTKQG VRRLDKSLRF VAEFLSHYRF LSRQVFGGTV HLLLIVGLGI SLIGTIIVSV VHYTGVEYGG LFFLVMRFLL DVAAVFIIYG SLAGIYRVYA GRERYGKLAN QYILVLLGFL AIAVTGMIMR RYRVDYYLGG PSPWSPLSYL VPPVTHEVYL AAYFVHIAVA FALIAVSPLV MLRHMYLAYA NYLLVDRPLG ELTTPFELEK VMESGEAEIT VGVKKKGEWR GIHGIMFDAC TRCSRCQDVC PAFAAKRPLS PMSLITKIAE AKNDADLFEV GITEDEVWAC TTCGACMYQC PVYIRHVDYI VDLRRALVFE SKVDQKKADL LMSVSQYHNT LMQANVGRHD WLRELGVKHI SENPQAEYLL WVGCMGSFDG RAREIVKAFV KILEKAGMLD KVAVLGDEET CCGDPVRRLG EESRFQELVL NNKQIFEKYG VKKLVTICPH GYNTFKNEYP RFGVKLEVYH HVEVLQRLVE EGKITVRGSL ESLTIHDPCY LSRHNKVVEP QRKIVVKLGA LKEPPRHGER TFCCGAGGAN YWYDVPEEKR ISHIRFEELA GTGAETIVTQ CPFCNAMLTE AKRAKDSAVN VKDIAEVVAE KLA
|
| |