Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0107 |
Symbol | |
ID | 5054775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 93482 |
End bp | 94792 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640467686 |
Product | hypothetical protein |
Protein accession | YP_001152374 |
Protein GI | 145590372 |
COG category | [R] General function prediction only |
COG ID | [COG4882] Predicted aminopeptidase, Iap family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGAGG TTTTTAGAAA ATGCACCGCC TACAAGGATT TAACGGCTGG AAGCCCCAGG GAGAGGGAGT TTCTGCACTG GCTTCTTACA TTTTTAGACA CGCCGAGCGT CTGGTTCCAC CTGTCCCCGG TGGAGGTCTT GGCCTGGGAG GAGGTAGAGA CTAGGCTCGA GGTGGGCGAC GTGGCCCTTT CAGGCTTGGC CTTGCCCTAC TCGAGGTCGG CATCTGTGGA AGGGAGGCTA GTGCCCGTTG ACGGGGATGT GGAGGGGAAT ATCGCCGTGG CCAAGTTTCC CGAGGATGTT GACGACGCCA AGTACTTGCT TTTAGAAGCG GCTCGGAGGG GGGCACAAGC CGTCCTCTTC ACTGGGAGGC CCCAGCGGCG TATTGTAATC TCCGGCGAGC CAGGGTTTAA GCTAGACTCG GCCCCCGCGC CCATACCAGC CGCAAGTTTC GAAGATTTAC AGAGCCACAT AGGGAAGAGG GCCCGGCTTG TGATGGACGT AAATATGCAG ACCACGTACA GCTACAACCT CGTAGCTTTC AATAGTCTAG ACAACACGCC GATGATATCT GCACATTGGG ACCATTGGCT TGTAGGCGCC GCGGACAATT GCGCCGGGGT TGAGGCGGCT GTTCTGGCGT TTAGCGAGCT GGTCGCCGAC GATGTGCCGA TAGCCCTGGG GCTATTTACA GCAGAGGAGG GGGTAGCCCC CCACCTGCCG TCTTTCTACT GGGCGTGGGG GTCGCTCAAC TACTTCAAAA GGTGGCAACC TACGCTTCTA GTCAACATTG ACGTAGTTGG GGCCGGCACG CCGAGGATTT ACGCAATGCC TTACCTGCAC AGGTATTTGA CGGGGTTGGG CCCCGTGGAG GACCCCGTAC CATATTTCGA CAGCGTGCAC CTCGAGAGGT GGGGGCTTCC ATCGGTAACC ATATCCTCGC TGAGGGACAC GTGGGGGATC TACCACAGCC CTCTAGACGC ATATGCTGAG CCTGAGAGTA TCCTATACGC GGCGGAGTTG GCAAAAAGGA TAGCCAAGAT TAAGCCGACG CCTCCCGAAG TGTCTTTGCA CGAGTACGGC ATTCCGCACG TGTATAGCCC GTACGAGGCT TGGTCTATTG TGTACAACTA TCTTGTGCTC TTCCGCGACT TTACGCACTC CGACATAGTC TACACCAATG TATTTAAATT TCTGAGAAGC GGCGCTGGGT ATCGCCGGAT AGATCTACTC GGCGGCCCAA CTCTCTGTGT GGAGGACTGC AAAAATGCTG TGGAGATTTA TCGAGAGCTA GTTCTGCTTA GACTGCTCTA A
|
Protein sequence | MAEVFRKCTA YKDLTAGSPR EREFLHWLLT FLDTPSVWFH LSPVEVLAWE EVETRLEVGD VALSGLALPY SRSASVEGRL VPVDGDVEGN IAVAKFPEDV DDAKYLLLEA ARRGAQAVLF TGRPQRRIVI SGEPGFKLDS APAPIPAASF EDLQSHIGKR ARLVMDVNMQ TTYSYNLVAF NSLDNTPMIS AHWDHWLVGA ADNCAGVEAA VLAFSELVAD DVPIALGLFT AEEGVAPHLP SFYWAWGSLN YFKRWQPTLL VNIDVVGAGT PRIYAMPYLH RYLTGLGPVE DPVPYFDSVH LERWGLPSVT ISSLRDTWGI YHSPLDAYAE PESILYAAEL AKRIAKIKPT PPEVSLHEYG IPHVYSPYEA WSIVYNYLVL FRDFTHSDIV YTNVFKFLRS GAGYRRIDLL GGPTLCVEDC KNAVEIYREL VLLRLL
|
| |