Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0452 |
Symbol | |
ID | 5056101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 394949 |
End bp | 396037 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468017 |
Product | cellulase |
Protein accession | YP_001152702 |
Protein GI | 145590700 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.547156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGACT TCGTACAGCT TTTGAAAAAG CTCTCGGAGG CGAGGGGGCC GTCGGGCTTT GAGGACGAGG TTAGAGAGGT TGTAATAAGG GAAATGGAGC CTTATGTGGA CGAGGTAGTT GTGGATAAGT GGGGCAACGT CATCGGGGTG AAGAGGGGTT CCTCAGACTA CAGGGCCATG GTGGCGGCGC ACATCGACGA GATCGGACTC GTCGTGGACC ACATAGAGAA GGAGGGCTAC CTGAGATTCA GACCAATTGG AGGGTGGAAT GAGGTTACTC TCGTCAGCCA GCGGGTCTGG GTGAGGACTT CAGATGGCCG GTGGATAAGA GGGGTTGTGG GGTCTCTGCC GCCGCATGTA ACGCCGAGCG GGAGGGAGCG CGAGGCGCCT GAGATTAAGG ACTTGTATAT AGACATCGGC GTGTATAGCA GAGAGGAGGC GGAGAAGCTA GGCGTCACCG TCGGATCTGT GGTTGTGCTG GATAGGGAAT TCGCCGTGTT GAACGGAAAG GTGGTTACCG GGAAGGCGTT TGACGACAGG GTGGGAGTAG CCGTGATGCT CTACGCCTTG AGGATGCTGG AAAAACTACC TGTCACCCTC TACGCCGTGG CGACCGTCCA GGAGGAGGTT GGACTCCGCG GGGCAAGCGT CGCTGCAGAG CGAATTAATC CCCACTACGC TCTTGCCTTG GACACCACAA TTGCCGCCGA CGTGCCGGGC GTGGGGGAGA GGCTCCACGT AACTAAGCTG GGCAAGGGCC CGGCCATAAA GGTGCTGGAC GGCGGTAGGG GCGGCCTATT CATAGCCCAC CCAGGTCTGA GGGATCACAT CGTGAAGTTG GCGAGGGAGC TCGGTATTCC CTACCAGATG GAGGTTCTTT ACGGCGGTAC CACCGACGCC ATGGCCATAG CCTTTAGGAG GGAGGGCGTC CCCGCCGCTG TTATCTCGGT GCCTACGCGG TATATCCACT CCCCGGTAGA GGTGCTCGAC GTGGAGGACG CTGTAAACGC GGCTAAGTTG CTTAAGGCAA CGCTGGAAAG GACTACGCCG GAGATCGTGG AGAAGTTTCT TGACAAGAGA GTTAAGTAG
|
Protein sequence | MEDFVQLLKK LSEARGPSGF EDEVREVVIR EMEPYVDEVV VDKWGNVIGV KRGSSDYRAM VAAHIDEIGL VVDHIEKEGY LRFRPIGGWN EVTLVSQRVW VRTSDGRWIR GVVGSLPPHV TPSGREREAP EIKDLYIDIG VYSREEAEKL GVTVGSVVVL DREFAVLNGK VVTGKAFDDR VGVAVMLYAL RMLEKLPVTL YAVATVQEEV GLRGASVAAE RINPHYALAL DTTIAADVPG VGERLHVTKL GKGPAIKVLD GGRGGLFIAH PGLRDHIVKL ARELGIPYQM EVLYGGTTDA MAIAFRREGV PAAVISVPTR YIHSPVEVLD VEDAVNAAKL LKATLERTTP EIVEKFLDKR VK
|
| |