Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1897 |
Symbol | |
ID | 5055507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1703282 |
End bp | 1705918 |
Gene Length | 2637 bp |
Protein Length | 878 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469446 |
Product | cell wall anchor domain-containing protein |
Protein accession | YP_001154100 |
Protein GI | 145592098 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAA GAACAATCCT ATTGGCAATA CTAGTATTCG GTATAACGCT TTTGGCTCAA TACACGCCTC CAACCACAAA CCCCGGCCCC GCCGTTGAGA GGATAGTGGG TAAGTCTGTA CCAATAGCTC AGGCAGCGGC CGCGGTCAAG GCGGGAGATA TCGATGTGTA CATATTCGGA ATGAGAGCAT CCCTAGCGGC CCCGCTAAAG GGAGACCCCG CCCTCGCGTT GTACACGGCG GCTGCGGGCT TCAACGACAT AATCCTTAAC CCAGCCCCCG GCAAAGCGCC GTGCGAAAAC CCGTTCTCAG ACCGCGAAAT TAGGTTTGCC ATGCAGTTTG TGATCGACAG AGACTACGTA GCTAATGAAA TCTTCAAAGG CTTCGCGGTG CCCATGTATA TTTGGTTGTC GCAGTACGAC CCAACGTACT CGGTAGTGGC AGGTATTATA TCCCAGCTCG GCATCAAATA CGACCTAGAC TACGCCAGGT CGGTAGTGGA GAGAAGAATG CCCGCCCTCG GCGCCACCAA GGGGGCTGAC GGCAAGTGGT ATTGCCAAGG CAAGCCGGTG ACAATAATTG GCTTGATACG TGTTGAAGAC GAGCGTAAGG ACATCGGCGA CGCCTTCGCC TCGGCTCTGG AACAGCTAGG CTTCACAGTC GACAGGAAAT ACGTGACTTT TGATGTGGCG ATACAGACCA TCTACGGCAC AGATCCCGCC CAGTTCCAGT GGCACTTCTA CACAGAGGGC TGGGGCAAGG GTGCGCTGGA CAGATGGGAC ACCAGTTCGA TATCCCAGTA CTGTGCTTCT TGGTTCGGCT ATATGCCGGG TTGGGGCACC ACCGGGTGGT ATAACTACGC CAACGCCACC ATTGACCAGC TCACGGAAAA GTTGTATAAA GGCGCCTTCA AATCCTTCCA AGAATATATA GAGACGTACA GAAAGGCCAC ACTACTGTGT CTCAGCGAGT CTATAAGAGT TTTTGTGAAT ACTAACCTAA ACGTCTATGT GGCCTCGCAG CAACTCAAGG GAGTGACTGT GGATCTGGGC GCAGGTCTGA GGGCTTCTGT GTTCAATGCT CGTAATTGGT ACGTCCCAGG CAAGGACGTG GTTAACGTGG GCCACCTCTG GGTATGGACT GCATCAAGCG CTTGGAACCC AGTACCGCAG GGCGGATTCA CAGACGTTTA CTCGGTGGAT TGGTTCAGGA TGATGTACGA CCCAGCTATG TGGAACCACC CGTTCACAGG CGAGCCCATG CCCTTCAGGG TAACTTACAC TGTGGAGACC AAGGGGCCCG ACGGCGCCTT TGATGTGCCG GCTGACGCCT ACAGATGGGA CGCGAGACAA AAGGCTTGGG TTAGCGCCGC TGGCGCAAAG GCTAAGTCTA AGGTGGTCTT CAACTACGCC AAGTACACCA GCTCTAAGTG GCACCACGGC CAGCCCATAA AGCTGGCCGA CGTGATGTTC ATATACGCCT TCCTCTGGGA CATAGCCAAC GACCCCGCGA AAGTGGCTAG GGAGTCCGGA GTCGCCTCAT ACGTGAACAG CACGATGTCG CTAATAAAAG GCATAAGAGT GTTGAACGAC ACGGCGATAG AGGTATATGT GGACTACTGG CACTTCGATC CCAACTACAT AGCCTATCAG GCGGTGATTA CGCCAGATAT GCCGTGGGAG GTGTACTACG CAGTTGACCA GCTGGTGTAC GTAAAGCAGA CATATGCCGC TTCTAGATCA AGTGCTACAA AATACAACGT GCCTTGGCTC TCGCTGATAC TGAAAGACCA TGCCAAGGCA GTTGCCGACG TACTGCAAGG AGCTGCAGAT CAGGGCGCAT TTCCTGAAAG CTGGTTCAAG ATAGGGAACA AGTTACTGCT CACAAAAGAT GAGGCGCTGG CTAGGTACAA GGCAGCAGTG GACTGGTTTA ACAAGTACGG TCACATGGTA ATATCACAGG GGCCGTTCTA CCTATACGCC ATAGACACCG CGAAGCAGTA CATCGAGCTG AGGGCGTATA GAGATCCCAC ATATCCGTAT AAGCCCGGGG CGTTCTACTT CGGCGTGGCG ACGCCTGTGT CTGTAAAGGC TATAAACGTC CCCACGTTAG CTGTAGGCCA GTCTGGCACG GTCTCTGTGT CGCTTGAGGT GCCTCAGGGC GCCGGAAAGA TCTACTATAA GTGGGGCATA ATAGACCCGA CCACTGGCAA GTTCGTGTAT ATATCCGACG AAGCCTCCGC CACCACAACG CCTATATCCA TTACAGTGCC GGCAGACGTG ATGTCTAAGC TAACCGTCAA CAAGCCGTAT AAATTCTGGC TCTACGCATA CGCCGAGAAT GTGCCCGTAG TGGCAGAGGC GACGCAAGTA TTTGTGCCCA AGACAGCGGC CCCATCTCCT TCGCCAACAG CTCCGCCTCC AACTTCGCCG ACCACTCCAT CTCCTTCGCC AACAGCTCCG TCACCTTCTC CAACTACGCC CGGCGCCACA GCCACGACTG TGACCACGGT TGCGCCTGCA ACTGGAACTA CTGAGGCTCT TGCGGCCGGG GTTGTCGGAA TACTTGCCGT GTTGGCGGCG CTTGCCTTTG CGCTTAGGAG AAGAGGCGGA GGCGGCGAGG AGACAAAAAC TAGATAG
|
Protein sequence | MKLRTILLAI LVFGITLLAQ YTPPTTNPGP AVERIVGKSV PIAQAAAAVK AGDIDVYIFG MRASLAAPLK GDPALALYTA AAGFNDIILN PAPGKAPCEN PFSDREIRFA MQFVIDRDYV ANEIFKGFAV PMYIWLSQYD PTYSVVAGII SQLGIKYDLD YARSVVERRM PALGATKGAD GKWYCQGKPV TIIGLIRVED ERKDIGDAFA SALEQLGFTV DRKYVTFDVA IQTIYGTDPA QFQWHFYTEG WGKGALDRWD TSSISQYCAS WFGYMPGWGT TGWYNYANAT IDQLTEKLYK GAFKSFQEYI ETYRKATLLC LSESIRVFVN TNLNVYVASQ QLKGVTVDLG AGLRASVFNA RNWYVPGKDV VNVGHLWVWT ASSAWNPVPQ GGFTDVYSVD WFRMMYDPAM WNHPFTGEPM PFRVTYTVET KGPDGAFDVP ADAYRWDARQ KAWVSAAGAK AKSKVVFNYA KYTSSKWHHG QPIKLADVMF IYAFLWDIAN DPAKVARESG VASYVNSTMS LIKGIRVLND TAIEVYVDYW HFDPNYIAYQ AVITPDMPWE VYYAVDQLVY VKQTYAASRS SATKYNVPWL SLILKDHAKA VADVLQGAAD QGAFPESWFK IGNKLLLTKD EALARYKAAV DWFNKYGHMV ISQGPFYLYA IDTAKQYIEL RAYRDPTYPY KPGAFYFGVA TPVSVKAINV PTLAVGQSGT VSVSLEVPQG AGKIYYKWGI IDPTTGKFVY ISDEASATTT PISITVPADV MSKLTVNKPY KFWLYAYAEN VPVVAEATQV FVPKTAAPSP SPTAPPPTSP TTPSPSPTAP SPSPTTPGAT ATTVTTVAPA TGTTEALAAG VVGILAVLAA LAFALRRRGG GGEETKTR
|
| |