Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0333 |
Symbol | |
ID | 5055939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 286953 |
End bp | 287963 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640467909 |
Product | hypothetical protein |
Protein accession | YP_001152596 |
Protein GI | 145590594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAGGA TTAAACTGCC TTTTGTGCCG GGGCTGGAGG TGGAGTTTAC AGACAGGGAA ATGGCGCTCA AAAAGATAGA GGAGTGGGCC CGGGAGAGCA CCCGCCTGCC CCAGCTGGTG TTTGGCCCCG AGGGCTGTGG GAAGACGGCG TGGCTTAGGC AGTCGGCCGT GTTGCTTAGG GAGCTGGGGT TTCACGTAAT ATACGTCGAC CCTCTGCACA GGTACTTCGA GGCGTATACC GACGTGAAGG AAGTGGCTAG GAGGCTGGCC GAGGCCGCGG CGGGCGTTCT GGGAGATGCG GAGGCGTGGC TCGCCACCCT AGCCATCGAC CTCACCAAGC TCCTCATCGA GAGGATGAAG AGGAGGGTGG CCGTCCTCGC CGACGACGTA TTCCAAGCCA TAGGGCTGGG TGAGGCGGCG AAGTACGTCA AAGCCCTCCT CGGCCTCATA GAATACCCGC CGAGAAGCAT AGACGCCATA GTAGCCGTCG TGGCGACTAG CGAAGGCATA ACGCGGCGTG AAATAGGCCG CCACAGATGG GCTAACCACA GGCCCATGTG GAACATGCCC AGAGACGGCT TCCGAGAGCT CTACGACCAG TTGCCGGGCG AAAAGCCGCC GTTTGAAGAC GTATGGAAAG CCACCGGCGG GAATCCCCAC CTCCTGGGGC AACTCTACAA GGCCAAATGG GGCGCCGACA AGGTGTTGAA GGAATTAGCC GATGTCAAGA AAATAGCAAC TCTCGTCAGT GCGTTGGGAG AGGAAGAGAG GGAGTTGTTG AGACGGGCTG TCGACGATCC AGATGTCCTC TACACGAGGG AGGGCATACC GCTTATGGAT AAGCTCGTCG ACATGAACCT CGTAGTCGAC ACGTTGCCAG AGCGCGATCC CTGGTTCTGG GTCGGGGAGC TGCCGCCCGA GAAGGACTTA GAGCTTGGGA TCGGGAGGCG CGTGGCTTGG CAGACGCCGC TCCACAGAGA GGCGGTGAGG AGGGCGCTGG AGGGCGCCTA G
|
Protein sequence | MKRIKLPFVP GLEVEFTDRE MALKKIEEWA RESTRLPQLV FGPEGCGKTA WLRQSAVLLR ELGFHVIYVD PLHRYFEAYT DVKEVARRLA EAAAGVLGDA EAWLATLAID LTKLLIERMK RRVAVLADDV FQAIGLGEAA KYVKALLGLI EYPPRSIDAI VAVVATSEGI TRREIGRHRW ANHRPMWNMP RDGFRELYDQ LPGEKPPFED VWKATGGNPH LLGQLYKAKW GADKVLKELA DVKKIATLVS ALGEEERELL RRAVDDPDVL YTREGIPLMD KLVDMNLVVD TLPERDPWFW VGELPPEKDL ELGIGRRVAW QTPLHREAVR RALEGA
|
| |