Gene Pars_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1897 
Symbol 
ID5055507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1703282 
End bp1705918 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content54% 
IMG OID640469446 
Productcell wall anchor domain-containing protein 
Protein accessionYP_001154100 
Protein GI145592098 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA GAACAATCCT ATTGGCAATA CTAGTATTCG GTATAACGCT TTTGGCTCAA 
TACACGCCTC CAACCACAAA CCCCGGCCCC GCCGTTGAGA GGATAGTGGG TAAGTCTGTA
CCAATAGCTC AGGCAGCGGC CGCGGTCAAG GCGGGAGATA TCGATGTGTA CATATTCGGA
ATGAGAGCAT CCCTAGCGGC CCCGCTAAAG GGAGACCCCG CCCTCGCGTT GTACACGGCG
GCTGCGGGCT TCAACGACAT AATCCTTAAC CCAGCCCCCG GCAAAGCGCC GTGCGAAAAC
CCGTTCTCAG ACCGCGAAAT TAGGTTTGCC ATGCAGTTTG TGATCGACAG AGACTACGTA
GCTAATGAAA TCTTCAAAGG CTTCGCGGTG CCCATGTATA TTTGGTTGTC GCAGTACGAC
CCAACGTACT CGGTAGTGGC AGGTATTATA TCCCAGCTCG GCATCAAATA CGACCTAGAC
TACGCCAGGT CGGTAGTGGA GAGAAGAATG CCCGCCCTCG GCGCCACCAA GGGGGCTGAC
GGCAAGTGGT ATTGCCAAGG CAAGCCGGTG ACAATAATTG GCTTGATACG TGTTGAAGAC
GAGCGTAAGG ACATCGGCGA CGCCTTCGCC TCGGCTCTGG AACAGCTAGG CTTCACAGTC
GACAGGAAAT ACGTGACTTT TGATGTGGCG ATACAGACCA TCTACGGCAC AGATCCCGCC
CAGTTCCAGT GGCACTTCTA CACAGAGGGC TGGGGCAAGG GTGCGCTGGA CAGATGGGAC
ACCAGTTCGA TATCCCAGTA CTGTGCTTCT TGGTTCGGCT ATATGCCGGG TTGGGGCACC
ACCGGGTGGT ATAACTACGC CAACGCCACC ATTGACCAGC TCACGGAAAA GTTGTATAAA
GGCGCCTTCA AATCCTTCCA AGAATATATA GAGACGTACA GAAAGGCCAC ACTACTGTGT
CTCAGCGAGT CTATAAGAGT TTTTGTGAAT ACTAACCTAA ACGTCTATGT GGCCTCGCAG
CAACTCAAGG GAGTGACTGT GGATCTGGGC GCAGGTCTGA GGGCTTCTGT GTTCAATGCT
CGTAATTGGT ACGTCCCAGG CAAGGACGTG GTTAACGTGG GCCACCTCTG GGTATGGACT
GCATCAAGCG CTTGGAACCC AGTACCGCAG GGCGGATTCA CAGACGTTTA CTCGGTGGAT
TGGTTCAGGA TGATGTACGA CCCAGCTATG TGGAACCACC CGTTCACAGG CGAGCCCATG
CCCTTCAGGG TAACTTACAC TGTGGAGACC AAGGGGCCCG ACGGCGCCTT TGATGTGCCG
GCTGACGCCT ACAGATGGGA CGCGAGACAA AAGGCTTGGG TTAGCGCCGC TGGCGCAAAG
GCTAAGTCTA AGGTGGTCTT CAACTACGCC AAGTACACCA GCTCTAAGTG GCACCACGGC
CAGCCCATAA AGCTGGCCGA CGTGATGTTC ATATACGCCT TCCTCTGGGA CATAGCCAAC
GACCCCGCGA AAGTGGCTAG GGAGTCCGGA GTCGCCTCAT ACGTGAACAG CACGATGTCG
CTAATAAAAG GCATAAGAGT GTTGAACGAC ACGGCGATAG AGGTATATGT GGACTACTGG
CACTTCGATC CCAACTACAT AGCCTATCAG GCGGTGATTA CGCCAGATAT GCCGTGGGAG
GTGTACTACG CAGTTGACCA GCTGGTGTAC GTAAAGCAGA CATATGCCGC TTCTAGATCA
AGTGCTACAA AATACAACGT GCCTTGGCTC TCGCTGATAC TGAAAGACCA TGCCAAGGCA
GTTGCCGACG TACTGCAAGG AGCTGCAGAT CAGGGCGCAT TTCCTGAAAG CTGGTTCAAG
ATAGGGAACA AGTTACTGCT CACAAAAGAT GAGGCGCTGG CTAGGTACAA GGCAGCAGTG
GACTGGTTTA ACAAGTACGG TCACATGGTA ATATCACAGG GGCCGTTCTA CCTATACGCC
ATAGACACCG CGAAGCAGTA CATCGAGCTG AGGGCGTATA GAGATCCCAC ATATCCGTAT
AAGCCCGGGG CGTTCTACTT CGGCGTGGCG ACGCCTGTGT CTGTAAAGGC TATAAACGTC
CCCACGTTAG CTGTAGGCCA GTCTGGCACG GTCTCTGTGT CGCTTGAGGT GCCTCAGGGC
GCCGGAAAGA TCTACTATAA GTGGGGCATA ATAGACCCGA CCACTGGCAA GTTCGTGTAT
ATATCCGACG AAGCCTCCGC CACCACAACG CCTATATCCA TTACAGTGCC GGCAGACGTG
ATGTCTAAGC TAACCGTCAA CAAGCCGTAT AAATTCTGGC TCTACGCATA CGCCGAGAAT
GTGCCCGTAG TGGCAGAGGC GACGCAAGTA TTTGTGCCCA AGACAGCGGC CCCATCTCCT
TCGCCAACAG CTCCGCCTCC AACTTCGCCG ACCACTCCAT CTCCTTCGCC AACAGCTCCG
TCACCTTCTC CAACTACGCC CGGCGCCACA GCCACGACTG TGACCACGGT TGCGCCTGCA
ACTGGAACTA CTGAGGCTCT TGCGGCCGGG GTTGTCGGAA TACTTGCCGT GTTGGCGGCG
CTTGCCTTTG CGCTTAGGAG AAGAGGCGGA GGCGGCGAGG AGACAAAAAC TAGATAG
 
Protein sequence
MKLRTILLAI LVFGITLLAQ YTPPTTNPGP AVERIVGKSV PIAQAAAAVK AGDIDVYIFG 
MRASLAAPLK GDPALALYTA AAGFNDIILN PAPGKAPCEN PFSDREIRFA MQFVIDRDYV
ANEIFKGFAV PMYIWLSQYD PTYSVVAGII SQLGIKYDLD YARSVVERRM PALGATKGAD
GKWYCQGKPV TIIGLIRVED ERKDIGDAFA SALEQLGFTV DRKYVTFDVA IQTIYGTDPA
QFQWHFYTEG WGKGALDRWD TSSISQYCAS WFGYMPGWGT TGWYNYANAT IDQLTEKLYK
GAFKSFQEYI ETYRKATLLC LSESIRVFVN TNLNVYVASQ QLKGVTVDLG AGLRASVFNA
RNWYVPGKDV VNVGHLWVWT ASSAWNPVPQ GGFTDVYSVD WFRMMYDPAM WNHPFTGEPM
PFRVTYTVET KGPDGAFDVP ADAYRWDARQ KAWVSAAGAK AKSKVVFNYA KYTSSKWHHG
QPIKLADVMF IYAFLWDIAN DPAKVARESG VASYVNSTMS LIKGIRVLND TAIEVYVDYW
HFDPNYIAYQ AVITPDMPWE VYYAVDQLVY VKQTYAASRS SATKYNVPWL SLILKDHAKA
VADVLQGAAD QGAFPESWFK IGNKLLLTKD EALARYKAAV DWFNKYGHMV ISQGPFYLYA
IDTAKQYIEL RAYRDPTYPY KPGAFYFGVA TPVSVKAINV PTLAVGQSGT VSVSLEVPQG
AGKIYYKWGI IDPTTGKFVY ISDEASATTT PISITVPADV MSKLTVNKPY KFWLYAYAEN
VPVVAEATQV FVPKTAAPSP SPTAPPPTSP TTPSPSPTAP SPSPTTPGAT ATTVTTVAPA
TGTTEALAAG VVGILAVLAA LAFALRRRGG GGEETKTR