Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2140 |
Symbol | |
ID | 5056178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1916517 |
End bp | 1917533 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469692 |
Product | ABC transporter periplasmic-binding protein |
Protein accession | YP_001154338 |
Protein GI | 145592336 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATAA GAAATTTATT GATTGTGCTC ATAGCGGTCG CTGGGGTGAT AGTGGCATTC CTAACGATAT CTGCTTTTCA GCAACCGCAG ACAAAGAAGC TCGTTATTGT CGGCCCCGCG GGCATAAGCG ACTTAGGGCA GGCGTTGGCG AAAAAGTTCA GTGAGAAGTA TGGGGTGAAC GCCACGTTTA TCCCACTTGG TGGGGCGGTA GAGATGGTGA ACGAGCTGGT GAGGAACAAG GACAACCCAC CATGGGACGT CGCCATTGGG ATTCCGGAGT TTTACTACAC TGTGTTAGTG GAAAGGGGTG TTTTACACTG TCCCAAGTTG TCTGTGGAGG GAGTCCCCCC CGAGGAGTTT TGGGATCCCA ACGGTTGCGT GTACCCGCTG GATAAGTCCT ACATCGGCAT TGTATACAAC GCCACTGCCC TCGAGAGGCT TGGGCTAAGG CCTCCAGAGA CGCTGGACGA CTTACTTAGG CCGGAGTACA AGGGGCTAGT AACATATCCC AACCCAGTCC AGTCGGGCAC CGGGCTTGCC GTCCTCTCGT GGATAATGTC AGTAAAGGGA GAAGAGGCAG GGTGGACATA TTTGAAACAG CTCAGCGGCC AAATAGCCAA GGTGGGGTAT CCCAGCGGCT TTACGGCTTT GAGAAGCGCC CTAAAACGTG GCGACGTGGT TATAGCGCTT TCATGGTACA GCCACGTGAT TGATCCTGGA ACCCCTCACA TGAGGGCTGC CACTTACAGC GCCTTTTTGT ATAGAGAGGG CGTGGCAGTG CTTAAAAATG CCAAAAACCG CGACTTAGCC GTGGAATTTG TTAAATTCGC CCTAAGCAAA GAGGGGCAGG ACCTAGTAGA TCCTTACAAC TACATGCTCC CCGTCAGGGC AGACGCCGTG GTGAAGAACA ACGTGGGCCT TCCCCAGCCG CGGTCGGTGG TGGTCTACAA CCCTGCCCTC GGCTCCAAGG CAGACGAGTG GAGGCTGAGG TGGCAGAGGG AGGTCGCTTC AGGTTAA
|
Protein sequence | MSIRNLLIVL IAVAGVIVAF LTISAFQQPQ TKKLVIVGPA GISDLGQALA KKFSEKYGVN ATFIPLGGAV EMVNELVRNK DNPPWDVAIG IPEFYYTVLV ERGVLHCPKL SVEGVPPEEF WDPNGCVYPL DKSYIGIVYN ATALERLGLR PPETLDDLLR PEYKGLVTYP NPVQSGTGLA VLSWIMSVKG EEAGWTYLKQ LSGQIAKVGY PSGFTALRSA LKRGDVVIAL SWYSHVIDPG TPHMRAATYS AFLYREGVAV LKNAKNRDLA VEFVKFALSK EGQDLVDPYN YMLPVRADAV VKNNVGLPQP RSVVVYNPAL GSKADEWRLR WQREVASG
|
| |