Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0149 |
Symbol | |
ID | 5056072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 139264 |
End bp | 140406 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640467728 |
Product | extracellular solute-binding protein |
Protein accession | YP_001152416 |
Protein GI | 145590414 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.489376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACTA GGAGGACACT CCTCGTGGCG GTCGGGGCGG CTGCGGTCCT GGCAGTAGCC GGGGGGTATA TCTACCTGGC ACAGCAACAG GGCGGGGCGG TGCCCACTCA GACAACGCCG CGGACAACGA CCGCGGCGGG GAGGAAGCTG GCTGTGTATA ACTACTCGTA CTACATCGAT AAGGCGCTTC TCGACGATTT TAAAAAGGAG TACGGCATTG AGGTGATATA CCAAGAGTAT GAAAGCGGCG AGGAGGCGTA CGCCGCGTTG TTGAGGGGAG GCGGCGGATA CGACTTAATA GTAGTTCCCG ATACGTACAT CAAGGAGGTG ATAGGGAAGG GCTATGTGAG GAAGATCGAC CACGCCAAGC TTTCCAACTT CGCCAACGTA GACCCTGTCT TCTTCCAGAA CCCCAACGAC CCCGGCCTCC AGTACTCGGT GCCGTACGCC TATGGAACCA CCGGCATTGC GGTGAACTAC TACGACATGA AAGCCGACGT TGGGAAAATA GAGAGCTGGG GCGACCTCTT TGACGAGACC AAGCTGGAGA AGGTCAAGGG GAGGATAGCC ATGTTGGAGG AGTTCGTGGA GCCCATAATG GCGGCGAAAT ACGCGCTGGG CATAGACCCA GACGACTGGA GCGACGACGC AGTCAACAAG ATAGTAGACC TCCTAAAGCG GCAAAAGGAG TACATCAGGG GGTACATGGG CATTAGCCAG ATTGTCCCCG TAATAGCCGC CGGGGAATTG TGGATATCCC AGATCTGGTC TGGCGACGCT CAATACGCCA AGGAGGAGTT TATAAAGAGG GCGGGGGAGG CAAACGCCGA CAAGTTCCAG TACGTATTGC CGAAGCCTAT GACCCACCGC TGGGTGGACT TCATGGTCAT CCCCCGCGAC GCTAAAAACG TCGAGGAGGC CTACCTCTTC ATGGACTTCT TACTGAGGCC TGAGAACTCG GCGAAGATAG TCGAGGCTAC GTACTACCCC ACCTCCTTGA AGAAGCAGCT ACTTGAGAAG TACGTGGACC CCAACCTGCT CAACGCCATA ACCCCGCCGG AGACAGCCAA GGTCATATAC CTAAACTACA CCGAGCAGAT GCTTAGGGCA ATTGAGAGGA TTAGCTACGC GGTTAAGGGC TGA
|
Protein sequence | MVTRRTLLVA VGAAAVLAVA GGYIYLAQQQ GGAVPTQTTP RTTTAAGRKL AVYNYSYYID KALLDDFKKE YGIEVIYQEY ESGEEAYAAL LRGGGGYDLI VVPDTYIKEV IGKGYVRKID HAKLSNFANV DPVFFQNPND PGLQYSVPYA YGTTGIAVNY YDMKADVGKI ESWGDLFDET KLEKVKGRIA MLEEFVEPIM AAKYALGIDP DDWSDDAVNK IVDLLKRQKE YIRGYMGISQ IVPVIAAGEL WISQIWSGDA QYAKEEFIKR AGEANADKFQ YVLPKPMTHR WVDFMVIPRD AKNVEEAYLF MDFLLRPENS AKIVEATYYP TSLKKQLLEK YVDPNLLNAI TPPETAKVIY LNYTEQMLRA IERISYAVKG
|
| |