Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1494 |
Symbol | |
ID | 5055386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1351403 |
End bp | 1352983 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469036 |
Product | extracellular solute-binding protein |
Protein accession | YP_001153702 |
Protein GI | 145591700 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.69615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTGTAA TACTAGCAGT GTTGGCTGCT ATTTTGTTCA CCTCTCAGCC GCCTCCGCAG ACGCCTACCC CAACCCCACC TGCCACATCT TCACCAACCA CGCCTCAAAC CACCACGACC CCCGCTGAGA TTACTCTCAC CATAGGCGTT ACAGATAAGG TGACCGACTT AGACCCGGCG AATGCCTACG ACTTCTTCAC GTGGGAGGTC TTGTACAACA CAATGGCAGG CCTCGTACGG TATAAACCGG GGACTACGGA GATCGAGCCC GACCTTGCGG TGAGTTGGAC CACGTCGGAG GGGGGCAGGG TTTGGACATT TAAGCTGAGA CCTGGCTTGA AATTCTGCGA CGGCACCCCG CTTACGGCGC AAGACGTCAA GAGATCAATT GAACGCGCCA TGAAGATAAA CGGCGACCCC GCGTGGTTAG TGACCGATTT TGTTGAGAAG GTCGAAGCGC CAGACGACGC CACTGTTGTT TTCTACCTTA AAAAGCCCGT GTCGTATTTC TTAGCCCTCG TAGCGACCCC GCCATATTTC CCAGTCCATC CAAAATACGC ACCTGACAAG GTAGACTCTG ACCAGACAGC CGGTGGGGCG GGTCCCTACT GTATAAAGAA TTTTGTCAGA GACCAGCAGA TCGTGCTTGA GGCTAACCCC TACTACTACG GAGGCAAGCC CCAAGTCTCC AAGGTGGTGA TTAGGTTTTA CAAAGACGCC ACGACGCTGA GACTCGCCCT AGAGAGGGGC GAAATAGATC TGGCTTGGAG AACGCTTAAT CCGCCCGACT TGGAAGCCCT AAAAGCCTCC GGCAAGTACA AAGTTGTCGA AGTTCCCGGC TCTTTCATTA GATACATCGT CCTCAACCTG AACATGCCAG AGCTAAAAGA CGTGAACGTC AGGCGCGCCC TTGCCGCGGC GGTTTGTAGA AAAGATATCG CAACCGTGGT TTTCCACGGA ACTGTAACGC CGCTGTTTAC GCTCGTGCCT GAGGGAATGT GGTCTTCCTA CCCCGCTTTT AAGGAGAAAT ACGGCGACTG CAACACCGAC CTTGCTAAAC AACTCCTGCA ACAAGCCGGC TTCAGCCCCA GCAAGAAGCT CAATATCGAG CTGTGGTACA CGCCTACGCA TTACGGCGAC ACCGAGAAGG ACCTAGCTGC CATGTTGAAA GATCAGTGGG AGGCCACAGG CATAATCTCG GTTAGCGTAA AGTCTGCGGA GTGGGCCACA TACGTACAGC AGCTCAGAAG CGGGGCGATG ATGGTGTCGT TGCTAGGCTG GTACCCCGAC TACATAGACC CAGACGACTA CACCACGCCG TTTTTAAGAA GCGGGTCTAA TAAATGGCTT GGCAACGGGT ACAGCAATCC AACAATGGAC GATATTTTAG ACAAAGCCGC CCTCGAGCTT GACCAGACTA AGAGGGCTCA GCTGTACAAA GAAGCTCAGC TACTCTTAGC CGACGACGTG CCTATAATCC CGCTGATACA AGGCAAGCTG TTTATAGTCA CAAAGCCGAA TATACAAGTG GTAGTAGACC CCACGATGAT ACTTAGATAC TGGGCCATAA GGGTTTCTTA A
|
Protein sequence | MVVILAVLAA ILFTSQPPPQ TPTPTPPATS SPTTPQTTTT PAEITLTIGV TDKVTDLDPA NAYDFFTWEV LYNTMAGLVR YKPGTTEIEP DLAVSWTTSE GGRVWTFKLR PGLKFCDGTP LTAQDVKRSI ERAMKINGDP AWLVTDFVEK VEAPDDATVV FYLKKPVSYF LALVATPPYF PVHPKYAPDK VDSDQTAGGA GPYCIKNFVR DQQIVLEANP YYYGGKPQVS KVVIRFYKDA TTLRLALERG EIDLAWRTLN PPDLEALKAS GKYKVVEVPG SFIRYIVLNL NMPELKDVNV RRALAAAVCR KDIATVVFHG TVTPLFTLVP EGMWSSYPAF KEKYGDCNTD LAKQLLQQAG FSPSKKLNIE LWYTPTHYGD TEKDLAAMLK DQWEATGIIS VSVKSAEWAT YVQQLRSGAM MVSLLGWYPD YIDPDDYTTP FLRSGSNKWL GNGYSNPTMD DILDKAALEL DQTKRAQLYK EAQLLLADDV PIIPLIQGKL FIVTKPNIQV VVDPTMILRY WAIRVS
|
| |