Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0199 |
Symbol | |
ID | 5054521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 178344 |
End bp | 179801 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640467778 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001152466 |
Protein GI | 145590464 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.400833 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATAGTGC TCAGTTTTAT AATTACACGT CGGGCTGTGG GTGTGATAGT GTTTCATCTT CACCTGTATC AGCCGCAGAG GGAGGATCCT TGGCTGGAGA TTATACTACC GGAGCCCTCT GCCTCTCCGT ATAGGCACTG GAACGAGAGG GTCTCACGGG AGTGTTACGA GCCTAACGCG GAGCTGGGCA ACTACCAGTG GGTGAGCTTC GACGTAGGAC CTACGCTGAT GAGCTGGCTG AGGGCAAACA AGCCTCTGGT CTACAAGGCG CTTTTCGAGG CAGATAAGGC GGGGCTGGAG AGGTGGGGCC ACGGAAACGC ACTTGCTCAT CCATACTACC ACGTAATCCT CCCGTTGGTC TCCCGGCGGG ATCGGGACAT CCTCGTCTAC TGGGGGGTGG AGTACTTCAG GAGGGTGTTT AAGAGGAGCC CCGAGGGGAT GTGGCTCCCC GAGATGGCGG TGGATCTCGA GACCTTGGAG GTGTTGGCGG ACAACGGAGT TACGTACACA GTCCTCACCC AGAGCCAAGT AAAGGGGCGC CTCGCAGGCG GGCCCTACAA GGTGGTTCTG CCTAGCGGGA GGAGCATTGC GGTTTTCGTC CGCGACGAGG CGTTGTCCAA TGCCCTCGCC TTCTCGGGCT TTGAGAGATT TGGGGAGATG CTGAGAGGCG TCTCGGGCGA CGTCGTCGTT GCCCTCGACG GTGAAACCTT CGGCCACCAC ATAAAGGGAG GGGACAAGAT GCTTGCCCAG TTTATACAAG CCAATAGGGA CCGGCTGGGC AACCTCGGAG CCTTGTACGA GAAGGGCTAC AAAGGCGAGG TGGAGATTGT GGAGAGGACC TCGTGGAGTT GCCCCCACGG CCTTGGGAGA TGGAGCTACG ACTGCGGATG CGACGGCCCG GCTCCTTGGA GGGAGCCTCT GAGGAAGCTC ATAGACTGGG TAGGCGAGGT TGTGGATAAG GCATTTGTGG AGAGGCTGGG CGATAGGGGG TGGGCGCTCC TCAGGGAATA CATAGCAGTG GTGCTGGGCG GCAGCAACGA TGGGTACACC GCCGAGGAGC TCAAGTTGCT GGAGGCGCAG CGGGCTAAGC TGGCGGCTAA TACAAGCGAT GCGTGGTTCT TCGCCCGGGT CGGCATTGAG TTCGGCATAG CTGTTAAGTG GGCGCTCAGG TCGCTAGAGC TAATAGAAGA TAAAGCCGTG TTAGGGGAGT TCTTCAACCG GCTCAGGCAG ATAGCTGTAG ACGGGAAGAC CGCTATGGCC TTCTGCCCAG GCGTCAGAGG GCCGTTGCTA GCCGCCGCCA TGTACCTAGC CCTATCTACT GCCGGGGCTC CGCAAGAGCG GATTGGGCCG TACATAGTAA GACCTATCAA CGACGAGTTT GAGATAGTGG ATAGCAGGAT TAGAGAAGTG TATAGATTCA GACACGACTT ATTATGGGGA AGAACTGAAA GTTTATAA
|
Protein sequence | MIVLSFIITR RAVGVIVFHL HLYQPQREDP WLEIILPEPS ASPYRHWNER VSRECYEPNA ELGNYQWVSF DVGPTLMSWL RANKPLVYKA LFEADKAGLE RWGHGNALAH PYYHVILPLV SRRDRDILVY WGVEYFRRVF KRSPEGMWLP EMAVDLETLE VLADNGVTYT VLTQSQVKGR LAGGPYKVVL PSGRSIAVFV RDEALSNALA FSGFERFGEM LRGVSGDVVV ALDGETFGHH IKGGDKMLAQ FIQANRDRLG NLGALYEKGY KGEVEIVERT SWSCPHGLGR WSYDCGCDGP APWREPLRKL IDWVGEVVDK AFVERLGDRG WALLREYIAV VLGGSNDGYT AEELKLLEAQ RAKLAANTSD AWFFARVGIE FGIAVKWALR SLELIEDKAV LGEFFNRLRQ IAVDGKTAMA FCPGVRGPLL AAAMYLALST AGAPQERIGP YIVRPINDEF EIVDSRIREV YRFRHDLLWG RTESL
|
| |