Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1282 |
Symbol | |
ID | 5055215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1160075 |
End bp | 1161043 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640468829 |
Product | hypothetical protein |
Protein accession | YP_001153498 |
Protein GI | 145591496 |
COG category | [R] General function prediction only |
COG ID | [COG5643] Protein containing a metal-binding domain shared with formylmethanofuran dehydrogenase subunit E |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.692652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.303536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGCT GTAATCCGCA TAACGTAGCC TGCGCCTTCA AGGAGCTTGG CAGAAAGCTC CTGGCCCTAA TCGAGGAGAA GATAGGCGCT TTAGGCCCGG AAAACGCCTT CGTCGCCACT AACGCGAAGT TTCTGACCTC TAGAGGATAC AGCGTAGCTC CGGTCATGAC GGCGTTTCTA GAAAAAGGGC TCGAGCTTTA CTACAACGTG GTGCCGGTCC ACTCGCCGTA TTATGCCGAT TTACATCTGG TGGCATACAA CAAGGATTCC TCCAAAGGAG TATACGCGAG GGTCGCTAGA AGCGCCGTGG AGAAAATCGC CGCATGCGAC CGGGCGACTG CGGATTACAT CAAGTGGGAG CCCCTCTCGC CGGGAGGAGC TGAGGGGTGG AGCGATAAGG CCCGATATTT GACGATACTC GCGGCATGGC TCTCGGGCGC TTCACATGAG CTGTTGCTAG GCGCGGAGTT GCACAATCAC GTGTGCCCGG GCCTAATATC GGGCTATTTG ATCATACGCC GCCTGGAGGA ACTCGGCCTT CTCAAACAAG GCGTCGAGAT AAAACTCGTC TCCGCACCGC CGTGGTGCAA GGACGACTTG TTGATACAAG TCCTCGACGC GACGCCCGGC AAGAGGAACT TCTCCGTTAA GTTCTTAACG GAAGATCGCC GGCAAGAGCT GAGGAGATCG TTGGGAGGAG ATCCCGCATA TATAGCGTTT ATACACGACA GAGGGCGCGA GGAGGTTCTC GTGATCTCCT TCGACTGGGA GAAGGCTAGA GAAATCGCCG GCGTGGTGGG GGACACCCCA AGTTCGAAGC TGGCCATGTC CGCCGCCTTG TTGAGACACT TGGATGAGGC CGCGCAACTG ATCCACGTTA AACACGTAGA GGCAGGCGAT CCGGAGCTCT TCGAGAAGGC ATCTCTTTCG GGAACCGATC CATATAAGAC AGTATTACTA CATGAGTAA
|
Protein sequence | MKGCNPHNVA CAFKELGRKL LALIEEKIGA LGPENAFVAT NAKFLTSRGY SVAPVMTAFL EKGLELYYNV VPVHSPYYAD LHLVAYNKDS SKGVYARVAR SAVEKIAACD RATADYIKWE PLSPGGAEGW SDKARYLTIL AAWLSGASHE LLLGAELHNH VCPGLISGYL IIRRLEELGL LKQGVEIKLV SAPPWCKDDL LIQVLDATPG KRNFSVKFLT EDRRQELRRS LGGDPAYIAF IHDRGREEVL VISFDWEKAR EIAGVVGDTP SSKLAMSAAL LRHLDEAAQL IHVKHVEAGD PELFEKASLS GTDPYKTVLL HE
|
| |