Gene Pars_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2149 
Symbol 
ID5056034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1925371 
End bp1926969 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content56% 
IMG OID640469701 
Productalpha amylase, catalytic region 
Protein accessionYP_001154347 
Protein GI145592345 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTGCT CTGTGGTGAA GTGGCGGAGC GACCCCTTCT ACGGCCGCGT GGCGGTCGTC 
AAGGCGGGTA ACGGATACGT CGTGGGAGAC TTCACCGGCT GGATACACGG GGCCTTTAAA
GAGGTGGTGG AGCTACCGCC CGGCAGATAC GCCGTTGCCT CCAGCGACGG CGCCGAGGAG
TGCCTAGTAG AGCCGCCGGA GTATCCATGG CACTTCTCCG TGCCCTATAT GGGGGTGGAC
TGGGGGGACG TGGCCGAGAT TAGGATATAC GCCCCGGAGC CCCCCGAGGT AAGCGGGGGC
CGTGTCGTGA AGTTATTGGA GGGGGAGCCC TTTTCAATAT ACGCGGCGGT TATAAAAAGT
AGAAGGTATG AAGTGAGGTG TTGCGGCAAG GTGAAGAGGT ATAGACGGCC TCCTCTAGTT
GAGGGGCATG GCATCTACGC CATGTACGAG GTCCTACCCG ATAGAGCCGC CAATAGAACG
GGGTGTAGGG ATCTTAGGCG CCAGTTCTGC GGCGGGACTT TAAAGGACGT TGCCGAGATT
GCCTTGTCCG CTTCTGAATT TGCAGATGCG CTGTATCTGC ATCCAATATA CCCCGCAATG
AGCTACCACA GATACGACGT GGTGAACCAC TTGGATGTGG ACGAGAGGCT TGGGGGGTGG
GCTGCGTTCG CCGCACTTAA GGACGCACTC AACGGGCGGG GGATGAAGCT TGTTCTTGAC
TTGGTACTAT ACCACGTTGG CCTCCGCAAC CCGCTCTTCC CCAACGGTCC CTTCATCATA
AGAGACCAAT CCTTCACAAC GCTTGTCAAG TCTCTGGCTG ACATTATGCC TAGGAACGCC
TTGACGGGAC TCCTGCTTGG AAAACCCCCG TATGATACGT TTCTAAAAGT TTGGCTTATG
CCTCGGCTAG ACTACTCAGA CCGCCGTGCG GTCCAATACG CGAGGAGCGT TGTGGAGTTC
TGGACGCCTA AAGTCGACGG GTTTAGGCTT GACGTTGCCC ACGGCATGCC CCCATCTGCG
TGGGACGAGA TACTAGAACC GGCGCGGCAT CGGTACATTC TAGGAGAACA TGTGGGCAAC
CCCGCTCCAT TTTACAAGTC AATTAAGGGC TTCACTGCCT ATATCTTATA TGGAGAATTG
GTGAAGTCCG GTTCTTTTTC CACAATTTCG GAGGCGATTA ATAGGTACCT TGCACTGACG
CCGCCGGGCG CCTTGCCTTA TATGAACACG TTTATTGAAA ACCACGACAC TGATAGGGCA
GTCACTACTA TGGGGGGCTT GGTGACTGTG GGATATGCGG TGATATTCAC GCTACCTGGG
GTCCCCTCCG TGTACGCAGG TGGCGAGTGC GGCGTAGGTG GTAGGGCCAG TGACCACACT
AACCGGGCCC CTTATAAGCC ATGTCCCGGG TCTCCCATTG CCGACACGCT CCGTGCGCTG
TACTCGGCGA GAAGAGAGTT TGGCCTATGG CGTGGGCCTG CGTGGGCAGA GCAGAAAAGA
GGGCGTATTA TCATAAACAG GCCGGGCACT AGAGCGGAGA TAGACTTAAA TAAAATTGCC
ATACTCGGCG CGGGGCGACA GCAAGAGATT CCTTTATAA
 
Protein sequence
MACSVVKWRS DPFYGRVAVV KAGNGYVVGD FTGWIHGAFK EVVELPPGRY AVASSDGAEE 
CLVEPPEYPW HFSVPYMGVD WGDVAEIRIY APEPPEVSGG RVVKLLEGEP FSIYAAVIKS
RRYEVRCCGK VKRYRRPPLV EGHGIYAMYE VLPDRAANRT GCRDLRRQFC GGTLKDVAEI
ALSASEFADA LYLHPIYPAM SYHRYDVVNH LDVDERLGGW AAFAALKDAL NGRGMKLVLD
LVLYHVGLRN PLFPNGPFII RDQSFTTLVK SLADIMPRNA LTGLLLGKPP YDTFLKVWLM
PRLDYSDRRA VQYARSVVEF WTPKVDGFRL DVAHGMPPSA WDEILEPARH RYILGEHVGN
PAPFYKSIKG FTAYILYGEL VKSGSFSTIS EAINRYLALT PPGALPYMNT FIENHDTDRA
VTTMGGLVTV GYAVIFTLPG VPSVYAGGEC GVGGRASDHT NRAPYKPCPG SPIADTLRAL
YSARREFGLW RGPAWAEQKR GRIIINRPGT RAEIDLNKIA ILGAGRQQEI PL