Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0351 |
Symbol | |
ID | 5054714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 301359 |
End bp | 302618 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640467924 |
Product | sulfatase |
Protein accession | YP_001152611 |
Protein GI | 145590609 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAT ACAACGTCGT CCTCATCGTC CTCGACACCC TAAGGGCCGA CCACGCCCAG GGCCTAGACA AGCTACTCGA CCTCGGCTTC GTCAAATACG AAGACGTCTA CGCCACCGCC CCCTGGACCC TCCCCAGCCA CGCCTCCATG TTCACCGGCA TGTACCCCTC CGAACACGGG ATACACGAGA CAAGGGAATA CCAGTTGGAT GTAGCCAAGA TTGCCAGGTT GCGCATGGCT AAGCTAAACG GCGGCATATT GGGCCAGCTA AAAGAGGAAG GATACAACAC GTATCTTATA TCGGCGAATC CAATAGTCTC AAGCAACTTC GGCTTTAACG CCGACTACGA ATACATCATA GATCCCATAT ATACCTTGCT CATTACGTCA ATCGACATAA TACTAGATAA AATCTATGCC GAGAGCGGCT CCAGAGCAAA GGTATTATCA AAACTAATTG AAGAACGTAG ACTCGATATG TTACTTCATG GGATCAAGAT ATTTGTAGAA AGAAGAATTC GCGTAATCCC AAAATATCTT TCTGAAAAGG CGACCAGAAA TAAGGGAGGC GGAAAAATTG TAGGGTTACT AGGGAGATTA AAATTGGAAA CTCCATTTTT CCTCTTTGTC AACATAATGG AAGCACATGA CCCCTATAAT AAACCACTCG TTGATAGGCG TAGGCTGAAA TACATCGGCA AGTGGCTCGC AACAGGGTTG ATAGACCCCG AAGCGGTGAG GTTGTGGCGG AACTACCCGG CCCACGCCGA GGAAGCCGTC AAGAGGGCCC TGGAGGCCGT GGAGACGCTG AAGGCGAGGG GCTACTGGGA CGATACACTG ATAATCGCGA CGTCCGACCA CGGGGAGCTA CTGGGAGACG GCGGGCTCTA TCACATCTAT TCGCTCCTCG ACGGGAACCT CCGGGTCCCC CTCTACGTCA AGTACCCAGG GAAGCCCAAG AAGCAGAGAG GTCCCATCAC GCTCGCCGAC GTGCCCCGGC TGATCGACCC CTCGGCGGAG GAGGTAGGGC GCCCCCTAGT CATGGCGGAA ACGTTCGGCA TAAGCTCTCC GCCTAAAGCC CTCGGCATAG AGCCGGAGGA GAGGTTCTTC CACCACAAGA TAAGGGTAAT CGGCCGCAAG CTCGACTTCA TATACGACGC AACGGCCGGC GTCGTGGAGA GGGTCTTCCG CGGCGATAAG GAGGACGCGG CGAGGCTGCT CGAGGACGCG GGGGCCAAGC TCGGCGGCCG TAGTATATAA
|
Protein sequence | MRKYNVVLIV LDTLRADHAQ GLDKLLDLGF VKYEDVYATA PWTLPSHASM FTGMYPSEHG IHETREYQLD VAKIARLRMA KLNGGILGQL KEEGYNTYLI SANPIVSSNF GFNADYEYII DPIYTLLITS IDIILDKIYA ESGSRAKVLS KLIEERRLDM LLHGIKIFVE RRIRVIPKYL SEKATRNKGG GKIVGLLGRL KLETPFFLFV NIMEAHDPYN KPLVDRRRLK YIGKWLATGL IDPEAVRLWR NYPAHAEEAV KRALEAVETL KARGYWDDTL IIATSDHGEL LGDGGLYHIY SLLDGNLRVP LYVKYPGKPK KQRGPITLAD VPRLIDPSAE EVGRPLVMAE TFGISSPPKA LGIEPEERFF HHKIRVIGRK LDFIYDATAG VVERVFRGDK EDAARLLEDA GAKLGGRSI
|
| |