Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_2266 |
Symbol | |
ID | 4080570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | - |
Start bp | 2384752 |
End bp | 2385753 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638010645 |
Product | HhH-GPD |
Protein accession | YP_617308 |
Protein GI | 103487747 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTT CTGCGCGGCT GCTCGGCTGG TACGATCGGT CGGCGCGCGT GCTGCCGTGG CGGATCGCGC CGGGGCGCGC GGAGGTGCCC GACCCCTATC GCGTCTGGCT GGCCGAAGTC ATGCTCCAGC AGACGACGGT CGCTGCGGTG GCGGGTTATT TCGCGCACTT CACGGAGCGT TGGCCGACGG TCGCCGATCT GGCCGCGGCC GGCGATGCCG AGGTCATGGC GGCGTGGGCA GGGCTTGGCT ATTACGCCCG TGCGCGCAAC CTGCTGGCCT GCGCGCGCGC CGTCGTCGCC GAGCATGGCG GATGCTTTCC GGACAGTGAG GCGGGGCTGC GCGCGCTGCC GGGGATCGGC GCCTATACCG CCGCGGCGGT GGCGGCGATC GCCTTTGGCC GCCCGGCGGT CGTCGTCGAC GCCAATATCG AGCGGGTGAT CGCGCGCCAC CGGTGCATCG AAACGCCGCT CCCCGCCGCG AAGCGCGCGA TTCGCGACGC GCTGGCGCCG CTGGTTCCGG GGGATCGGCC GGGCGATTTC GCGCAGGCGC TGATGGACCT CGGCGCGACC CTTTGCACGC CGCGCGCGCC CGTGTGCGCG CGCTGCCCGA TCGCCGCCGA CTGCCGCGCG CGCGGGCGCG CCGACATCGA GCGGCTGCCG GTCAAGCCGC CGAAGAAGGC CAGGCCGCGC CGCCACGGCG TTGCCCACTG GATCGAGCGA GACGGCGCGA TCTGGCTGGT GCAGCGGCCG GGCAAGGGGA TGCTCGGCGG GATGCGCGCG CTGCCCGGCG GCGAATGGTC GGACGAGCCG CCCGGCGAAT CGGGAATCGT CCGCGTCGAC CATGGTTTCA CCCATTTCGA CCTGACGCTG GTTCTCGTCC GCCGCGAAAC GGCCGATGCC GCAGCGGAAG GCATCTGGTG GCCGATCTCG GACCTTGACG CCGCGGGGCT GCCGACGCTC TATCGCAAGC TGGTGGTCAA GATGCTGGAG AGAGACGCAT GA
|
Protein sequence | MSFSARLLGW YDRSARVLPW RIAPGRAEVP DPYRVWLAEV MLQQTTVAAV AGYFAHFTER WPTVADLAAA GDAEVMAAWA GLGYYARARN LLACARAVVA EHGGCFPDSE AGLRALPGIG AYTAAAVAAI AFGRPAVVVD ANIERVIARH RCIETPLPAA KRAIRDALAP LVPGDRPGDF AQALMDLGAT LCTPRAPVCA RCPIAADCRA RGRADIERLP VKPPKKARPR RHGVAHWIER DGAIWLVQRP GKGMLGGMRA LPGGEWSDEP PGESGIVRVD HGFTHFDLTL VLVRRETADA AAEGIWWPIS DLDAAGLPTL YRKLVVKMLE RDA
|
| |