Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0998 |
Symbol | |
ID | 5054336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 890317 |
End bp | 891312 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640468555 |
Product | periplasmic solute binding protein |
Protein accession | YP_001153230 |
Protein GI | 145591228 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0851489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACGGCG TCGGAATTAG TCCCGCGTAT ATTAACCGTC GTTGTGCACA AAATTTAAAA ATGTACATAG CCATAATGCT CATGCAGTGG AGCAAGGCCC TTTTGTGGTC GGTGTTGGTC GTAGCGTTAG CGGTGGCGGC TATAGTAACC ATCTTAGCTT CTCAGCCACA GACTCAGCCA CCTGTTGTCT CTCAGACCCC GGTAAGGCCC AAGATTATAG TCAGCTTTCC TGCATATGAT AAGATTTTAG CCCAAGCATT TCCAGAGGCT GAAGTCGTCC TCTTGACAAA AGGGATCTCC GACCCCCACG AATACCAGCT AACTCCCCAA GATCTTCAAT TGCTAAGAAG CCTTACAGAT AAGGACGTCG TGGTGGATAC TATGCACGCA TCCTTTGAGC TGAAGATAGC GGAGATGGCC CAGAGGGGCG AGATAAAGGC AAAGGTGATA AAGACGCCCG ACTTCGAGAC GTACCTTACC TGGGACGGGA AAGAGGTTAA GCTAACCAGC TACGGACAAG AGCAGGGAGG CGTGAACATG CACAGCCACG GGCTCTACCC CCCAGACGTG TTGAAGCTAA TTGACGCTGT GTCCAGTGCC TCAGGGCTTA CTCCAAACGC CACATTTGTC AATGGGCTTA GGCAGCTACA GGACAAGTAC GCCGGGAAAC TCAGCGGCAA GGCTGTTGCA CTGACCCCCG CGGCCCAGTA CATATTATAC TGGCTAGGCT ACAGAGACAT CGCGGTGTTT ATAAAAGAGC CCGGGGTGCC TCCTTCACAA GAAGACGTAG CAAAGGCGCT CCAATACGCG AAAGAAGGAG CCCCAGTGCT GGCAGCTGTG GTAAGTGGTG AGGCTCTACG TGTCGTTGAT ATGTTCAAGA ATAAGGCAGA GGAGGCAGGC ATCAACGCGA AGGTTATAGT AGCGGACTTC TCAAAGGGCT ACCTAGAAGT TCTTAGAGAA GTGACAGAGC AGATAGCCAG GTCACAAGGG GGTTAA
|
Protein sequence | MHGVGISPAY INRRCAQNLK MYIAIMLMQW SKALLWSVLV VALAVAAIVT ILASQPQTQP PVVSQTPVRP KIIVSFPAYD KILAQAFPEA EVVLLTKGIS DPHEYQLTPQ DLQLLRSLTD KDVVVDTMHA SFELKIAEMA QRGEIKAKVI KTPDFETYLT WDGKEVKLTS YGQEQGGVNM HSHGLYPPDV LKLIDAVSSA SGLTPNATFV NGLRQLQDKY AGKLSGKAVA LTPAAQYILY WLGYRDIAVF IKEPGVPPSQ EDVAKALQYA KEGAPVLAAV VSGEALRVVD MFKNKAEEAG INAKVIVADF SKGYLEVLRE VTEQIARSQG G
|
| |