Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0134 |
Symbol | |
ID | 5055601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 121873 |
End bp | 122883 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640467713 |
Product | hypothetical protein |
Protein accession | YP_001152401 |
Protein GI | 145590399 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAAAA TTAGGCTGGC CCTCGCGCCT GGGCTCAAGG TCCAGTTCTC TGACCGCGAA CAAGCCCTCA GGAAGGTGGA GGAGTGGGCC AAGGAAAGCA CCCGGCTCCC CGAAGTGGTT TTCGGCCCCG AGGGTTGCGG TAAAACCGCT TGGCTAAGGC AGTCCGCTGT GCTCCTCAAG GAGCTGGGCT ACCACGTGGT GTACGTTGAC CCTCTTCGAA AATACTTCGA GGCGTATACA GACGTAAAGG AGGTGGTGAG GAGACTGGCC GATGCGGCGG CTGAGGCGCT GGGGGAGGCC AAGGTTAAGC TTGCCTCCCT CGTCATTGAC GTGACAGTGG AGCTTATAAA GCGGAGGAGG AAGAAGGTGG CTGTTTTGGT AGACGACGTA TTCCAAGCAG TAGGACTGAA CGAAGCCGCC TCGTACGTGA AGGCGCTCCT GGGGCTGATC GAGTATCCCC CTAGGAGCGG GGACTCTATT GTCGTTGTCG TGGCCACCGG TGAAGGGATT ACTAGGAGGG AGATTGGGAG GCACCGCTGG GCCGACTTAA TGCCGATGTG GAATATGCCG AGAGACGGCT TCAAACAGCT GTATGATCAG ATCCCGGGAG AGAAGCCGCT GTTTGAAGTG GTGTGGAAAG CCGCCGGCGG CAACCCAGGC CTATTGGCGT GGCTGTACAA GACAAAGTGG AGGTCAGACA GTGTCGTTAA GAAGCTGATA AGAGAAAAGA GGATAAAAGC CTTCATCAGC AAGTTGGGAG ACGAGGAGAG GGAGTTGCTG AGGAGAGCTG TGGAGGATCC CGACGTCCTC TACACGAGGG AGGGCATACC CCTAATGGAG AGATTAGTAG ACATGTACTT AATTGTAGAT ACCCTACCCG AAAGAGAGCC CTGGTTCTGG GCAGGTGAGT CGCCGTCGGA GAGGGATCCA GAGCTAGGCG TTGGGAAGGA CGTCGCCTGG CAGACCCCCC TCCACCGTGA GGCGGTGAAG AGGGCGCTGG AGGGAGCGTA G
|
Protein sequence | MKKIRLALAP GLKVQFSDRE QALRKVEEWA KESTRLPEVV FGPEGCGKTA WLRQSAVLLK ELGYHVVYVD PLRKYFEAYT DVKEVVRRLA DAAAEALGEA KVKLASLVID VTVELIKRRR KKVAVLVDDV FQAVGLNEAA SYVKALLGLI EYPPRSGDSI VVVVATGEGI TRREIGRHRW ADLMPMWNMP RDGFKQLYDQ IPGEKPLFEV VWKAAGGNPG LLAWLYKTKW RSDSVVKKLI REKRIKAFIS KLGDEERELL RRAVEDPDVL YTREGIPLME RLVDMYLIVD TLPEREPWFW AGESPSERDP ELGVGKDVAW QTPLHREAVK RALEGA
|
| |