Gene Pars_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0734 
Symbol 
ID5055160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp650948 
End bp654016 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content55% 
IMG OID640468292 
Productglycoside hydrolase family protein 
Protein accessionYP_001152972 
Protein GI145590970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.983175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.257164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGG CATTGTTGGT ACTAGTTCTG TTAGTTGTAG TTGTTTCGGC TCAAAATGTT 
GTGTTTGTGT GGCATATGCA CCAGCCCCCG TATTATATAC CGAAAGAGTC TGTCCCCACG
GATACGGGCA AGGCGGTTGT TGAGGCCCCC TGGGTGAGGC TTTGGACCGC CAAGGCCTAC
TACCCCATGC TCCTGCTCGT GGAGGAGACC GGGGTTAAAG TCACGTTTGA CATAACGCCT
ACGTTGCTCG AGCAGATAGA GATGTATGCC CAGGGCAGGC TGACGGACAA GTATCTGCAA
ATATCTTTGA AAAAAGCAGA GGACCTAACG GAGGAGGAGA AATCCTTTAT ATTAGAGCGG
TTCTTCGACA TTAGCTGGGA GGTGCAAATT CCCAAATTTC CGAGATATCA ATGCCTTCTT
GAAAAGAGGA ACAACGCGCC GGGAGCTTTC GACACCGCCG ACTACCGGGA TCTCCAAGTT
CTGTTCAACC TGGCTTGGAT AAACGAGAAG CTCCTCAACG AGGACCCAGA CCTACGCCCA
ATATATCAAA AAGCTAAGGG GAGCGACTGC GACACGCACT TCACAGACGG GGACAAGATG
GCCGTGCTTG CAAAACACCT GCAGTACCCC AAGGCGTTTC TAGACAAGTT GGCGCAGCTG
TACAAGAAGG GGCAAATAGA GGTGATAATG ACCCCCTACT ATTACCCCAT AGCCCCCCTC
GTGGAGAACA CCAACAACGC CCTGTCCACA GACCCGGGAA TTATAACGCT ACCTAAGCCC
TTCGCACATC CAGAAGACGT CTCCGCGCAG GTACAGCTGT CTCGGCACAA GTTCAAGACC
TTCTTCAAGG CCGAGTTGTT GGGAATATGG CCGCCGGAAC TCGCCGTAGA TGACCAATTC
ATACAAATCC TCGCCCAGAG CGGCATAAAA TACACCGTAG CCGACCAAGT AGCGCTGGAG
CGCTACCTGG GGAGACAGCC GACGCAGGAG GAGCTCTACA CGCCGTGGGA GAGATACGGA
GTGCTAATCT TCTTCAGAGA TAAAGAGCTG TCGGACTGGA TAGGCTTCAC AGGCTCTTCA
ACTTCTAGGC AGTACGGCGA AAAATATGCC GCTGAGCAAT TCCTCTCGCT CCTCGCTGGC
AAGGCTCAGG GCGGCCTCGT GGTGATAGCC CTAGACGGCG AGAACCCCTG GGAGTGGTAC
CCCAACGACG GCTACGTCTT CCTCACCGAG ATATACAAAG CAGTGAAAAA TAGTACAAAA
ACACTTAGAG AAGCAACAAG TTCGGCGACG CCGCGGAGAT CAGAGTCGCC GCTACCGACG
TCAAGCTGGG CCGGCGGAAG CCTCTCTGTG TGGGTAGGCG AGTGGGAGGA GAACTTGGCG
TGGAGGATAC TAGAAGGGGC GCGGCAAGTT GCGAAAAACA AGGCGTGGCA ACAGCTACTC
TACCCAGCAG AGGCGGGCGA CTGGTTCTGG TGGTACGGCC GCGACAGGGA AAGCCCCCGA
GAAGAGATCT TCGACTCACT GTTCCGCGAA ATTGTGAAAA AGTTCTACGA GAAAATAGGG
CTTACATACA ACTATACTTG GCCCCTAGAC GAGCCCATAC ACTACAGAAC CGCCTACACG
GTAGACTGGG CAGGTGCCCC ATTCAGAAGG CTTATCATCA GCGAAGTGGA GAAGATAAAC
ATAACAGTGG AGGTTTACTC CCAGTCTGCA AACACCGCGC AACATGGGAG AGCCCCCGGA
ATTAGGGTAA TCGCCCACTG GGGCCCAGTG GCCTACTGGG GCGGGCCGTG GGGAGATCTC
TACTTCGCCC CAATGACATA CGCAGGAGAT GTGGGTAACA ACGACCTCTA CGTCCTTACG
CCCAAGCTCG CCCCGGGGAG GTATGAGTTT ACATTCATAG CACAAGGCTC CAACGAAGTC
TTCGCCACGG CGCTTGGCCA AAACTACCAA GTCGAGGTAC TGCCTAGAAC AGACGGCCGC
GTCTGCGGCG TAGAGCTCAA GCGAGTGGAG GTATACGACG GCGGTCACCT CCTTGCCGTG
TTCACGGGAA ACGCCTTGGC TTACGTAGGA AACGTCATTA AAGCTTACTA CGAGGTCTGC
GGCGAGGGGC CAATCTACGT AGCGGCGTCG ATGGCCCTTA TCCGCCAGGA CTCAAGCGCG
CCCTGGGACG AGGTCTTCGT CCTAGCCAGC CCCACGGCGG AGGGGCTCTA CGTGGCTGAG
TTTAGGGTAA ACTACAGCGG CGTGTTTGAG CTGAGGGCCA AGGCCATGGG AGGCAATGTG
TATTACTCGA CGCCTGTGTA CATAAGAGTA GAGGGAGGGC CCGGCCCAAG AGTTGTGGAC
GGCAGAGAAG ACGACTGGGT AGGCCAGCCT CCCTTGCAGA CGCCTGGCGC CGCGGTGAGC
ATGTACGAGC TAATAGTGAC AGATCCACAA GACGACCAAT ACAGGTTCTA CCGCCCCGAT
TGGAGCTGGC CGCCGACAGA AGACCTAGAC GCCGTAGAGC TGAGGCTCTA CTCAGACGGC
CAAAACCTCT ACGGTCTGGT AAAACTCAAG CAGCTGAGCA ACATATACGC GCCGTATGTA
ATAATAGCCA TAGGCGTACC GGGCGGCGGC TTCAGCGAGT GGCTACCAGA CTGGAGCGAC
ACAAGGCTCG CGTTTAAGTG GGATTACATT ATAGGCATAA ACTACGGCAA AGGCTCCCCA
CTATTCCTCT TCGACCACGA TTGGTCGCCC AGGCCCACCG GTCAAATCGC CAGGTCTGGC
AACGTAATAG AATTCGCAAT ACCCCTAGAC CAGCTACCCC TCCTAAAAAC AGCCGCCGAG
ACTTACATAA CGGCGGTAGT CTTTGCTAAT AACTACGGAG GCGTATGGGA CCCCGGCAAG
AATAACGCAT ACGACCCAGT GAGCGGCAGG TACATAACCG AGGACAACTA CGCGTCTAAC
GTATATGATG TATTCGGCCA GGCACCTACC TCAGCCGAGG TCTACGGAGG CTGGGACGGC
GGCGACCAAA CAGTAGATTT CTACGTGAGA GCTGGCCTAG ACAACGGGAG AATAGTAGCG
GTTACCTAA
 
Protein sequence
MSRALLVLVL LVVVVSAQNV VFVWHMHQPP YYIPKESVPT DTGKAVVEAP WVRLWTAKAY 
YPMLLLVEET GVKVTFDITP TLLEQIEMYA QGRLTDKYLQ ISLKKAEDLT EEEKSFILER
FFDISWEVQI PKFPRYQCLL EKRNNAPGAF DTADYRDLQV LFNLAWINEK LLNEDPDLRP
IYQKAKGSDC DTHFTDGDKM AVLAKHLQYP KAFLDKLAQL YKKGQIEVIM TPYYYPIAPL
VENTNNALST DPGIITLPKP FAHPEDVSAQ VQLSRHKFKT FFKAELLGIW PPELAVDDQF
IQILAQSGIK YTVADQVALE RYLGRQPTQE ELYTPWERYG VLIFFRDKEL SDWIGFTGSS
TSRQYGEKYA AEQFLSLLAG KAQGGLVVIA LDGENPWEWY PNDGYVFLTE IYKAVKNSTK
TLREATSSAT PRRSESPLPT SSWAGGSLSV WVGEWEENLA WRILEGARQV AKNKAWQQLL
YPAEAGDWFW WYGRDRESPR EEIFDSLFRE IVKKFYEKIG LTYNYTWPLD EPIHYRTAYT
VDWAGAPFRR LIISEVEKIN ITVEVYSQSA NTAQHGRAPG IRVIAHWGPV AYWGGPWGDL
YFAPMTYAGD VGNNDLYVLT PKLAPGRYEF TFIAQGSNEV FATALGQNYQ VEVLPRTDGR
VCGVELKRVE VYDGGHLLAV FTGNALAYVG NVIKAYYEVC GEGPIYVAAS MALIRQDSSA
PWDEVFVLAS PTAEGLYVAE FRVNYSGVFE LRAKAMGGNV YYSTPVYIRV EGGPGPRVVD
GREDDWVGQP PLQTPGAAVS MYELIVTDPQ DDQYRFYRPD WSWPPTEDLD AVELRLYSDG
QNLYGLVKLK QLSNIYAPYV IIAIGVPGGG FSEWLPDWSD TRLAFKWDYI IGINYGKGSP
LFLFDHDWSP RPTGQIARSG NVIEFAIPLD QLPLLKTAAE TYITAVVFAN NYGGVWDPGK
NNAYDPVSGR YITEDNYASN VYDVFGQAPT SAEVYGGWDG GDQTVDFYVR AGLDNGRIVA
VT