Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0515 |
Symbol | |
ID | 5054448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 466225 |
End bp | 467865 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468077 |
Product | hypothetical protein |
Protein accession | YP_001152762 |
Protein GI | 145590760 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3848] Phosphohistidine swiveling domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.292882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTACC CACTGAGGCA GAGCAGAAAG CCGGAAGAGG GCTTCTGGTA CCGCGACGTG GTGCACTTCG GCGACGCTCC CCTATACCCC TTGGACTCCT ACTTCACGGT GTCTATGATG GACTTAGCCC AGTCCTATTA CTACGGCAGG TATTTCTCCA TGCCCACCTC CTCGGGGAGG GACACGGCGT TGGTGGAAGG GAGGCCCTTT AGGACTTCAT ACCCCCCGAG GCCTTTTGCG GATATATTTG AGCGGAGGGC CAGGGAGTAT TTGGAGAATT GGGACGCGAA ATACGCCGAG TGGAAAAAAG AAGTCGTGGC GATAATTGAG GAGATGTCTA AACTGCCCGT GGATCTCACC GAGGGCGTGG ATTTGAACGG CGCGGCGCCG TATCGGGTAA TTGAGAGCTG GCTCAAGCTC TACCTCCTTT GGCTGAGGCT TTGGTTTAAA CACTACGAAT TCCTAATGTT GGGCTACCTC ATTTATCAAT TGTTTTATAA GTTTATAAAG ACGTTTTTCC CCGACGCGCC AGATCACCAC ATTTCAGAAA TGCTGGCCCA ACGCGACATT GACACCTTTA GGCCTACAAA AGAGCTGGAG AGGTTGGCGG AGCTGGCGCG CGAGTTGGGA ATCGCCGAGA GATTAGCTGC GTTTAGCAAC GCGGCTGAGA TGGAGAGATC TTTCGCCGAG AGCGGCGATC CTAAGGAGAG GAAGTGGCTT GAGGAGTGGA ACGCCGTGAA ATACCCCTGG TTCTATATAT CCACAGGGAC GGGCTTTTTA CATTGGGAGG AGAGGTGGAT AGACAACCTC GACATCCCCT TTACCTATTT AAAAAAGTTG TTGAAGGAAG GGCCTTCCCG CAAGCACGGG GAGAGGGGCG GGGTGCTGGC GAGGGGCTAC GCCGATTTAT TGCCCGAGGG CTACCGAGGG GTGTTTTATA AATACCTGGA GGCGGCCCGG AGGGCCTACC GCTACATAGA GGAGCACAGC TTCTACGTGG AGCACCTGGG GTTCACGGTG GGGTACAGGA AGATAAGGGA GTTCGGCCTC TTGCTGGCCA AACTGGGCGT GTTGGAGAGG GAGGACGACA TATGGTATTT GACCTGGGGC GAGGTTCTGG AGGCGTTGTT AGACGGGCTG ACCGGCTGGT GCAACCTCAC GGGACCGGCG GCGCATAAAG TTCTCCGTAT GCGCATAGCC GAGAGGAAGG CGTTGTTGGA GAAGATGGCC TCCTCCCAGC CGCCGACGCA TATAGGGGAG CAGGGGGAGG TGTCGGACGC CAATTTGGCC TTGCTCCACG GCGTCGGCAG GAAGACCGGC GGCGACGTGG TGGCCGGGAT AGCCGCATCG CCCGGCAGAG CCAGGGGGCG GGTAGTGGTG GTTAAAAGCC CGCGGGACTT GGAGAAGGTG GTCGAGGGGT GCGTAGTGGT GACGTCTACT ATTTCGCCCA CTTGGATCCC TGCGTTGAGG CTGGCCGCGG CCGTGGTGTC GGAAAGCGGA GGCGCCATGT CGCACGCGGC GATAATAGCT AGAGAGCTCG GGAAGCCGGC CGTCGTCGGG GCAGCGGGGG CCACCTCTCT TTTCAAAGAT GGCGACGAAG TGGAGGTCGA CGGCAATATA GGCGTGGTGA GGCGGGTATG A
|
Protein sequence | MMYPLRQSRK PEEGFWYRDV VHFGDAPLYP LDSYFTVSMM DLAQSYYYGR YFSMPTSSGR DTALVEGRPF RTSYPPRPFA DIFERRAREY LENWDAKYAE WKKEVVAIIE EMSKLPVDLT EGVDLNGAAP YRVIESWLKL YLLWLRLWFK HYEFLMLGYL IYQLFYKFIK TFFPDAPDHH ISEMLAQRDI DTFRPTKELE RLAELARELG IAERLAAFSN AAEMERSFAE SGDPKERKWL EEWNAVKYPW FYISTGTGFL HWEERWIDNL DIPFTYLKKL LKEGPSRKHG ERGGVLARGY ADLLPEGYRG VFYKYLEAAR RAYRYIEEHS FYVEHLGFTV GYRKIREFGL LLAKLGVLER EDDIWYLTWG EVLEALLDGL TGWCNLTGPA AHKVLRMRIA ERKALLEKMA SSQPPTHIGE QGEVSDANLA LLHGVGRKTG GDVVAGIAAS PGRARGRVVV VKSPRDLEKV VEGCVVVTST ISPTWIPALR LAAAVVSESG GAMSHAAIIA RELGKPAVVG AAGATSLFKD GDEVEVDGNI GVVRRV
|
| |