Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0609 |
Symbol | |
ID | 5054185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 544700 |
End bp | 545650 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468167 |
Product | peptidase M42 family protein |
Protein accession | YP_001152852 |
Protein GI | 145590850 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.467193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTAG AGTCTTTGAC AAGGGCTCTG GGGGTGTCGG GCTTCGAGGA GGAGGTCAGG CGCCTCATCC TCGGAGCAGT TAAGGACGGA GAAGTGGACG AGTTTGGGAA CGTATTGACA AAAACCAGCA AGTCGAAGGT GGCGTTCGTC GCCCACATGG ACGAGGTTGG GCTTCTCGTC ACAAGCATAG AAGAGGACGG CAGGCTCAAA TTCCGCAAAG TCGGCGGCGT GGACGACAGG ATCCTGCCAG GATCCTCTGT GGTGCTCTAC GGCGACGGGT TTAAGGTCGA GGGGGTCATC GGGGTTGCCC CGCCCCACTT CCAGCAACAG CAACAACAGG TGTCGTGGCA GGACCTCTAC ATAGACATCG GCGCGGCGAG CAGGCAAGAG GCCGAGTCCA TGGGCATAGG GCCGATGACC CCCGCGGCGT TTTCCAGGCG GTACTCCAAC ATGGGGAAAT TCGTCTCCGC CACGGCGCTG GACGACAGGG CCGGTTGCTG GGTGTTGCTC GAGGCGTACA GGAGAGCCGC CGCGACCTAC GTCTGGACCG TGCAGGAGGA GGTGGGACTC ATGGGAGCCC GCGCGCTGTC AAGACGCCTC GACGTCAGTT ACGTGGTGGT TGTTGACACG ATGGCCTGTT GCCACCCCAA CTGGACCGGC GGGGTGAAGC CGGGAAACGG TCCAGTCCTC CGCCTATTCG ACAACTACGG CGCCTACAAC AACAAGTTGG CCAAGAAGGT GCTTGAGGTC GCGAAGAGGC GGGGCATACC CATACAGATA GGCTCAGGGG GCGGCGGCAC AGACGCAGGT GCCTTCTTCG CCGCCGGGAT CCCAGCCGTG GCCATCGGCA TATTGACCAA GTACTCCCAC TCGCCGGTGG AGATGGCGCA CAAAGACGAC CTAAAACACG CCGTGGAGCT AGTCGTGGCA CTTGCCGAGG AGCTCGCCTA A
|
Protein sequence | MELESLTRAL GVSGFEEEVR RLILGAVKDG EVDEFGNVLT KTSKSKVAFV AHMDEVGLLV TSIEEDGRLK FRKVGGVDDR ILPGSSVVLY GDGFKVEGVI GVAPPHFQQQ QQQVSWQDLY IDIGAASRQE AESMGIGPMT PAAFSRRYSN MGKFVSATAL DDRAGCWVLL EAYRRAAATY VWTVQEEVGL MGARALSRRL DVSYVVVVDT MACCHPNWTG GVKPGNGPVL RLFDNYGAYN NKLAKKVLEV AKRRGIPIQI GSGGGGTDAG AFFAAGIPAV AIGILTKYSH SPVEMAHKDD LKHAVELVVA LAEELA
|
| |