Gene Pars_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1866 
Symbol 
ID5055880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1668938 
End bp1671940 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content54% 
IMG OID640469412 
Productglycoside hydrolase family protein 
Protein accessionYP_001154069 
Protein GI145592067 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGG TCCTAGCGCT TCTACTAGCC GCAGTGGCTA TCCTGGCAGG GCCTGTTAAT 
GTCGTGTTTA TCTTGCACAA CCACCAGCCT TGGTACGTCG ATTTTGAGAA AAATGAACTC
ATACTCCCAT GGGTCAGGAT GCACGCGGTT GGCAACTATT TGAAAGTCCC CCTCTTGATT
AACGAAAGCG GAGTCCCTGT GGCCTTCACA CTCTCAGGAA GCTTAATCGA ACAGCTCAAC
TGGTACGCCA ACGGGACCTA CATAGACGTG AGGTATCGCA TATCGGAGAA AATCGCCCGG
GGCGAGCCGC TGACGGCTGA GGAAAAGTAC GCAATGCTGG CAATACCAGG CGGTTTCTTC
GATATTAACT GGCAGAACAT ACTGTACAAA CACCCGCGGT ATGCCGTCCT CCTGGGTCTA
AGGAACGATG CATTTAGCAA ATGCCCTCCT GGCAACGTTA CGTGTGTAAT ATCTAGGTTC
TCCGACCAAG ACTTTGTCGA CCTAGCCACC TTGTTTAATC TACTCTGGAT TGACCCCTAC
ATCGCGAAGA AGTATTCCGA CATCTGGACA CTGGTAAACA AGACGTCGTA TACCCGCGAC
GATTTGAAAA AGGTTTTGGA CTTACACAGA GAGCTTGTAG GCAAGGTCTT GTCTCTCTAC
AATGAGTTGG CACGACAAGG CAAAATAGAG CTCGTCCCAG TCCCCTACTC TCACCCCCTC
ATGCCGTTGC TGGCGGACAT GGGCGCTCTG GAAGACCTAA GACTCCACAT ATCCCTCTCT
GAGGGGCTGT TCAAGAGATA TTTGGGAACC ACGCCGACAG GTGTGTGGCC ACCCGAGCAA
GCCGTAAATG ACGAAGTGCT AAGGCTCTTC GCCGAGGCTG GCTACTTGTG GACGGTCACA
GACGAAGATG TCCTAAAAAC CACGGCCCCT GGCTTCAGTC ATTACAGCCT TTACTATGCT
GACTACGGCG GGCGCCGGCT CTACGTCTTT TTCAGAGACA AGACGTTGTC AGACGCGCTG
GGCTTCAGAT ACTCGTCTAT GTCCCCCCAG GTCGCCTTAG CGGATTTTGT AAACTACCTG
AAGAAAGTCC CCCAGGGCGA GTGCAACGTG GTAGTGGTGG CGCTTGACGG AGAAAACCCA
TGGGAGAACT ACCCCAACTT CGGCGACGAC TTTCTGAAGA CGTTCTTCCA AGGCCTGGCG
GAGCTAGAGA AAAACGGTAC TGTGAAAATA TGGAAGCCTA CAGATTTTGT GAACCACTGC
AAGGACAAGG CCACGCCTCT CCCCCAGCGC CAGTTTAGAT ACTTCGACTT GAAGGTAGAC
ATATCGATTT ACAAATCTAT AAGAGATCTC CCCACCCGCC CAGTGGCTGG CAAAATAGCG
GAAGGCTCTT GGTCAGGCGG AGGAAGCCTT GCAGTGTGGA TAGGAGACCC AGACGAAAAC
GTCTGGTGGA TGTGGCTGAA AAAGGCAAGA GAAGACGTCG GCATAAACCG GACGTGGGAT
GTGCTCTTCC CACTACTCGT TGCAGAAGCG TCGGATTGGC CGTTTTGGTA CGGCAACGAC
ATGGGCTCGC CGCAGACCTT CGACCCAGTG GCTAAGTCCG CGCTTAGGTC ATACTACACG
CGCGCTGGTC TGCAACCCCC ACAGTATCTC CAGACATCTG CATACCCCGC CGGTACTCCT
CGCGAGGATA AGATAGTGGG CAGGGGAGAG GGCAAAATCG CCATGTACGG CGGAGCTGTA
ATATACGCAA ATACAACCCA TATATGGATC GAGGGAGGGC CCTGCGGAGT TGTCTACATA
TCTAACCCAG ACGTGGCTAG GTCGCCCTAC ATGTTCAGAG GCGCGGCCAA GGGGCTCGGC
GGGGAGGCTC TTGACGTATA CGCCGACATG GCCATAGACA CCTGCAAAGG ACTTGTGTAC
CTCTCCGACA GCGGCAGGTT TTACCCAGTA GGCAACACGG CGGCGCTGGG CTTTATAGGA
GCAAGGCCGG GGGGACTGCT TTACGTAGAG TTTAGAGGCC TTGTATACGC GCTGAGAGTG
CCGGAGAGCT ACGCAGCCCA GAACCTCCTA TTAAGGGCAG TAGATCCTCT AGGCGACGAT
TTCGGGCCTG GGAAGTACCA ATATCCCAAA AACCCAGCCT TCAAGCCGGG AGTGTTCGAC
CTAACCCTGT TCGAGCTATA CGACTTGGGG GACAAGTTCA GGTTCCAGTT CAGGGTTAAA
GAGCTAGGCG GCAATCCGTG GGGCGGGCCA GCCGGCTTCT CCCTGCAGTT CTTCCACGTG
TACATCAATA GAGGAGGCGG CGTCCGGAAC GACACGCTTG GCTTGAGGGT TTCCCTCTGC
AAAGAAGCGT CATGGGACGT CGCATTGTTA ATTGGGCCGG GCTGGACCGG CGGTAATAGG
ATAGTCTACG CTGACGGCGT CTACGTCGAC GACGCCATGT CTATAAAGCT GGGCACCAAC
AATACCATAA TTGCCGATGT GCCCAAGAAA TACCTCGGAA ACTACGACAA TAAGTGGAAA
ATAACCGTGT TCCTAACTTC GTGGGACGGC TACGGCCCAG ACAACATTAG AAACTTTGGA
GTAATGGCCG ACGAGTGGAC TGTAGGAGGC GCCGATCCCG TCGCTGTGTT GGCAGGCGTT
GCGCCGCGGG TCTTTGACGT CTTGGCGGAA ACTGCAGAGA TGCAAGTAAA AGCCCTCACT
TCCTACAAAG TCGTCAGACT TCCCAACGGC ACATATATTG GAGCACCCGC CGTTGTGTGC
GCGTATCTCA CAGGTTCGGC GGAGATGTGC ACCGCCACCG TCACCCAGTT CGTCACAATC
ACGGCGACAG CCACAGTAAC TCGTACATTC ACCGAGACGT ACACCACTGT GTCCACTCAA
GTAGCCACCA CGGTGACCTC TACTGAGAGG GTCAAGGAAA TCGATTGGCC CACCACAGCG
GCGCTGACAG TAGCCGCCCT CGTAGCCGGA CTCCTCGCCG GCATTTTAAC AAGACGAAAA
TAA
 
Protein sequence
MRAVLALLLA AVAILAGPVN VVFILHNHQP WYVDFEKNEL ILPWVRMHAV GNYLKVPLLI 
NESGVPVAFT LSGSLIEQLN WYANGTYIDV RYRISEKIAR GEPLTAEEKY AMLAIPGGFF
DINWQNILYK HPRYAVLLGL RNDAFSKCPP GNVTCVISRF SDQDFVDLAT LFNLLWIDPY
IAKKYSDIWT LVNKTSYTRD DLKKVLDLHR ELVGKVLSLY NELARQGKIE LVPVPYSHPL
MPLLADMGAL EDLRLHISLS EGLFKRYLGT TPTGVWPPEQ AVNDEVLRLF AEAGYLWTVT
DEDVLKTTAP GFSHYSLYYA DYGGRRLYVF FRDKTLSDAL GFRYSSMSPQ VALADFVNYL
KKVPQGECNV VVVALDGENP WENYPNFGDD FLKTFFQGLA ELEKNGTVKI WKPTDFVNHC
KDKATPLPQR QFRYFDLKVD ISIYKSIRDL PTRPVAGKIA EGSWSGGGSL AVWIGDPDEN
VWWMWLKKAR EDVGINRTWD VLFPLLVAEA SDWPFWYGND MGSPQTFDPV AKSALRSYYT
RAGLQPPQYL QTSAYPAGTP REDKIVGRGE GKIAMYGGAV IYANTTHIWI EGGPCGVVYI
SNPDVARSPY MFRGAAKGLG GEALDVYADM AIDTCKGLVY LSDSGRFYPV GNTAALGFIG
ARPGGLLYVE FRGLVYALRV PESYAAQNLL LRAVDPLGDD FGPGKYQYPK NPAFKPGVFD
LTLFELYDLG DKFRFQFRVK ELGGNPWGGP AGFSLQFFHV YINRGGGVRN DTLGLRVSLC
KEASWDVALL IGPGWTGGNR IVYADGVYVD DAMSIKLGTN NTIIADVPKK YLGNYDNKWK
ITVFLTSWDG YGPDNIRNFG VMADEWTVGG ADPVAVLAGV APRVFDVLAE TAEMQVKALT
SYKVVRLPNG TYIGAPAVVC AYLTGSAEMC TATVTQFVTI TATATVTRTF TETYTTVSTQ
VATTVTSTER VKEIDWPTTA ALTVAALVAG LLAGILTRRK