Gene Pars_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0199 
Symbol 
ID5054521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp178344 
End bp179801 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content58% 
IMG OID640467778 
Productglycoside hydrolase family protein 
Protein accessionYP_001152466 
Protein GI145590464 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.400833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGTGC TCAGTTTTAT AATTACACGT CGGGCTGTGG GTGTGATAGT GTTTCATCTT 
CACCTGTATC AGCCGCAGAG GGAGGATCCT TGGCTGGAGA TTATACTACC GGAGCCCTCT
GCCTCTCCGT ATAGGCACTG GAACGAGAGG GTCTCACGGG AGTGTTACGA GCCTAACGCG
GAGCTGGGCA ACTACCAGTG GGTGAGCTTC GACGTAGGAC CTACGCTGAT GAGCTGGCTG
AGGGCAAACA AGCCTCTGGT CTACAAGGCG CTTTTCGAGG CAGATAAGGC GGGGCTGGAG
AGGTGGGGCC ACGGAAACGC ACTTGCTCAT CCATACTACC ACGTAATCCT CCCGTTGGTC
TCCCGGCGGG ATCGGGACAT CCTCGTCTAC TGGGGGGTGG AGTACTTCAG GAGGGTGTTT
AAGAGGAGCC CCGAGGGGAT GTGGCTCCCC GAGATGGCGG TGGATCTCGA GACCTTGGAG
GTGTTGGCGG ACAACGGAGT TACGTACACA GTCCTCACCC AGAGCCAAGT AAAGGGGCGC
CTCGCAGGCG GGCCCTACAA GGTGGTTCTG CCTAGCGGGA GGAGCATTGC GGTTTTCGTC
CGCGACGAGG CGTTGTCCAA TGCCCTCGCC TTCTCGGGCT TTGAGAGATT TGGGGAGATG
CTGAGAGGCG TCTCGGGCGA CGTCGTCGTT GCCCTCGACG GTGAAACCTT CGGCCACCAC
ATAAAGGGAG GGGACAAGAT GCTTGCCCAG TTTATACAAG CCAATAGGGA CCGGCTGGGC
AACCTCGGAG CCTTGTACGA GAAGGGCTAC AAAGGCGAGG TGGAGATTGT GGAGAGGACC
TCGTGGAGTT GCCCCCACGG CCTTGGGAGA TGGAGCTACG ACTGCGGATG CGACGGCCCG
GCTCCTTGGA GGGAGCCTCT GAGGAAGCTC ATAGACTGGG TAGGCGAGGT TGTGGATAAG
GCATTTGTGG AGAGGCTGGG CGATAGGGGG TGGGCGCTCC TCAGGGAATA CATAGCAGTG
GTGCTGGGCG GCAGCAACGA TGGGTACACC GCCGAGGAGC TCAAGTTGCT GGAGGCGCAG
CGGGCTAAGC TGGCGGCTAA TACAAGCGAT GCGTGGTTCT TCGCCCGGGT CGGCATTGAG
TTCGGCATAG CTGTTAAGTG GGCGCTCAGG TCGCTAGAGC TAATAGAAGA TAAAGCCGTG
TTAGGGGAGT TCTTCAACCG GCTCAGGCAG ATAGCTGTAG ACGGGAAGAC CGCTATGGCC
TTCTGCCCAG GCGTCAGAGG GCCGTTGCTA GCCGCCGCCA TGTACCTAGC CCTATCTACT
GCCGGGGCTC CGCAAGAGCG GATTGGGCCG TACATAGTAA GACCTATCAA CGACGAGTTT
GAGATAGTGG ATAGCAGGAT TAGAGAAGTG TATAGATTCA GACACGACTT ATTATGGGGA
AGAACTGAAA GTTTATAA
 
Protein sequence
MIVLSFIITR RAVGVIVFHL HLYQPQREDP WLEIILPEPS ASPYRHWNER VSRECYEPNA 
ELGNYQWVSF DVGPTLMSWL RANKPLVYKA LFEADKAGLE RWGHGNALAH PYYHVILPLV
SRRDRDILVY WGVEYFRRVF KRSPEGMWLP EMAVDLETLE VLADNGVTYT VLTQSQVKGR
LAGGPYKVVL PSGRSIAVFV RDEALSNALA FSGFERFGEM LRGVSGDVVV ALDGETFGHH
IKGGDKMLAQ FIQANRDRLG NLGALYEKGY KGEVEIVERT SWSCPHGLGR WSYDCGCDGP
APWREPLRKL IDWVGEVVDK AFVERLGDRG WALLREYIAV VLGGSNDGYT AEELKLLEAQ
RAKLAANTSD AWFFARVGIE FGIAVKWALR SLELIEDKAV LGEFFNRLRQ IAVDGKTAMA
FCPGVRGPLL AAAMYLALST AGAPQERIGP YIVRPINDEF EIVDSRIREV YRFRHDLLWG
RTESL