Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0133 |
Symbol | |
ID | 5055581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 119927 |
End bp | 121225 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640467712 |
Product | hypothetical protein |
Protein accession | YP_001152400 |
Protein GI | 145590398 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAGC CCTGCTTGGT ACATATTAAT AGGCGGTCGC TCATGGCAAC CGTGCTGGTG CTCATAGGCA TCGACGACAC GGACAGCTAT ACAGGTGGGT GCACCACCCA CGTGGGCTAC CTATTAGCAA AAGAGGTGCT GAGGAAGTGG GGGGAAGAGG CGTTTCTGGA TTTCCCCCGC CTCGTTAGGC TGAACCCCAA CATCCCGTTT AAGACTAGGG GAAACGCCGC AGTGGCGCTT GCGCTGGAGG TGCCGGAGGG CGACGTGGAG GAGGTGTGGA GGCTCGCCTT GGACGTTGTT AAGAACAACG CAAGGCGAGA GGGCAAGACA GACCCGGGCC TCGCAATGGC TGTGAGGGAG GTGCCGAAGA GGGCGCGTGT GATCTACAAA ATGGCCTTGA CGCAGGTGGT GAGCAGAAGC GCCGCGGAGA GGGCTGGGCT GATTACCTGG GGCGGCCGCG GCGTTGTGGG GGCAGTAGCG GCAGTTGGGG CCGACTTGTC TAGATCCACC TTTGAGCTGA TTGCATACCG CGAGGGGGAG CGCGCGGCTA TCCCCGCCGA GGTTGTGAGG CTGATGGAGG CGCTGACGTA CCCCTTCACC TTCCACAACT TAGACGGCAG GCGGGTCCTA ATCCAGCCAA GGGGGCCGGA CCCCGTGTAC TACGGCATAA GAGGTCTTTC CCCTCCCCAT CTGCTACTCG CCCAGAGCAT CCTCGAAGCC CACGGCTACA AGCCCGCCGG CTGGGTGATA TACCGCACCA ACCAGGCGAC AGACGCACAT CTTGCACATG GGGTAATCCT GGCCGAGCCA ACCCCATACT CCTACTACCG TGTCAGGGGC GTTGTGGTCG AGGCCAGGAG GGTGGCGGGA AGGCACGTGG TGGGTAGGCT AGACAACGGG TTGGCGTTCG TGGCGTATAG ACACTTAGGG AGGCTGGCGA GCGAGCTGGA GAGGTGTGTT ATGTGCGACG TGGAGCTGTA CGGCGGCCTC AAGCCTAGGT TCGGCCAACT ATTTCTCTAC GTAGAGCGGG CATACGTCTT GGGGAGGTAC GCCCCGTCGC CTTCCCGCTG TCCCTACTGC TGGGGGTCGC TAGAGAGCTT GGGAAGGGAC AGGGGGTGGA GGTGCCGCCG CTGCGGCGTT GTGTTCAGGA GCAGAGAGGT GAGGTGGCTC TACGACAGCT CGATCGCGAA GGCAGTCTTC CCTAGACCAG GGGAGTGGAG ACACCTATTA AGGCCCCCGG ACCTGGACGT TGAGGTGGCC GGTTTCTTCT CGCCTAGATC TGTGGAGTGG ATTAGCTAA
|
Protein sequence | MSEPCLVHIN RRSLMATVLV LIGIDDTDSY TGGCTTHVGY LLAKEVLRKW GEEAFLDFPR LVRLNPNIPF KTRGNAAVAL ALEVPEGDVE EVWRLALDVV KNNARREGKT DPGLAMAVRE VPKRARVIYK MALTQVVSRS AAERAGLITW GGRGVVGAVA AVGADLSRST FELIAYREGE RAAIPAEVVR LMEALTYPFT FHNLDGRRVL IQPRGPDPVY YGIRGLSPPH LLLAQSILEA HGYKPAGWVI YRTNQATDAH LAHGVILAEP TPYSYYRVRG VVVEARRVAG RHVVGRLDNG LAFVAYRHLG RLASELERCV MCDVELYGGL KPRFGQLFLY VERAYVLGRY APSPSRCPYC WGSLESLGRD RGWRCRRCGV VFRSREVRWL YDSSIAKAVF PRPGEWRHLL RPPDLDVEVA GFFSPRSVEW IS
|
| |