Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1865 |
Symbol | |
ID | 5055882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1667542 |
End bp | 1668906 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469411 |
Product | extracellular solute-binding protein |
Protein accession | YP_001154068 |
Protein GI | 145592066 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAA ACCTTCTTAT AGGCACAGTG GTAGTCCTAT TAGCCATTCT TGCCGCTGTC GGTTATATAA TGTCACAGTC ACAACCTGCA CAAACTCCCA CGACCCAATC ACCGCCGCCA GCTCCCTCAA CGCCGACGAC CACACCGAGC CCCACACAAA CCCCCACGGC TCCCCCCACA CCCACCTCAC AACCAACGAC AACTACGCCG ACTCCCCCCC CACAAACTCC CAGCCCCACG CCGAGCCCAA GCCCAACGCA ACCGCCTGCC CAGAAAGTGA TAATTAAGAT CTGGCACGCC CTAAACCCAG AAGAGGAGGC CGTGTTCAAG GAGATCGCCT CGCTGTATAT GAAATCAAAC CCAAACGTCC AGATAGTGTT TGAAAACAAG GCCCCCGACC TACAAACCGC CGTATTGGCG GCAGTGGCTA CAGGTGAGAA ATTCGACTTG TTCATATGGG CCCACGACTG GGTTGGGCTA ATGGTAGAGG CAGGGGTGTT AAAGCCTGTA GATAATGATG TGGCCGACGT GTTGCCTAAA TTCGCAGTGC CGATCCCGCA GTACCAAGGA CACGTATACG GTTTGCCCTT TACGGCTGAG ACAGTGGCGC TGATCTGCAA CAGGAAAATG GTGCCGGGGC CACCCAAGAC TTTTGATGAG CTACTCGGTA TTATGAAGAG TTACAACAAT CCGCCGAAGA CATACGGCAT AGCTTATGTG GTGAACCCAT ACTTCATATC AGCGTGGATC CACGGAGCCG GCGGCTACTA CTTTGACGAC AAGACAGAGA AGCAGGGACT AAACAACCCC AAGTCTATAG CGGGGTTTGA GTTCTTTAAG AAGTACGTAA TGCCATATGT TGGGCCCAAC CCCACGGACT ACAACACGCA AGTTAGCCTA TTCCTAGGCG GACAAGCCCC CTGCATGGTA AACGGGCCGT GGAGCATAGG CGCTGTGAAG AAAGCCGGCA TAGACTTCTT CGTGGCGCCA CTCCCGCCGG TGAACAACAC ATTTGTGCCC AGGCCGTACG GCGGATTGAA GATGTTCTAC GTCACGATCT ACGCCTCGAA AGAGGCTATC GACTTCATGA AGTGGTTCAC CACAGATCCC CAGGTGGCTA AAATCTTGGT GGACAAGCTG GGCTACGTAC CTGTTATAAA GGACGTCCAA ATTCAAGACC CAGTGGTACA AGGCTTCTAC GAAGCTATTA AGAACGTCTA CTTAATGCCC GTGTCTCCGA AAATGCAACC GGTGTGGGGC GCCGTCGATT TAGTAATACA AAACGCCATA GTGTCCGACC AAAAGACCAT CCCTCAGGCG ATCAACGACG CAGTTAAGGA TCTCTGCGCC AGAGGCCTCT GTTAA
|
Protein sequence | MNKNLLIGTV VVLLAILAAV GYIMSQSQPA QTPTTQSPPP APSTPTTTPS PTQTPTAPPT PTSQPTTTTP TPPPQTPSPT PSPSPTQPPA QKVIIKIWHA LNPEEEAVFK EIASLYMKSN PNVQIVFENK APDLQTAVLA AVATGEKFDL FIWAHDWVGL MVEAGVLKPV DNDVADVLPK FAVPIPQYQG HVYGLPFTAE TVALICNRKM VPGPPKTFDE LLGIMKSYNN PPKTYGIAYV VNPYFISAWI HGAGGYYFDD KTEKQGLNNP KSIAGFEFFK KYVMPYVGPN PTDYNTQVSL FLGGQAPCMV NGPWSIGAVK KAGIDFFVAP LPPVNNTFVP RPYGGLKMFY VTIYASKEAI DFMKWFTTDP QVAKILVDKL GYVPVIKDVQ IQDPVVQGFY EAIKNVYLMP VSPKMQPVWG AVDLVIQNAI VSDQKTIPQA INDAVKDLCA RGLC
|
| |