Gene Sala_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2266 
Symbol 
ID4080570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2384752 
End bp2385753 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content72% 
IMG OID638010645 
ProductHhH-GPD 
Protein accessionYP_617308 
Protein GI103487747 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTT CTGCGCGGCT GCTCGGCTGG TACGATCGGT CGGCGCGCGT GCTGCCGTGG 
CGGATCGCGC CGGGGCGCGC GGAGGTGCCC GACCCCTATC GCGTCTGGCT GGCCGAAGTC
ATGCTCCAGC AGACGACGGT CGCTGCGGTG GCGGGTTATT TCGCGCACTT CACGGAGCGT
TGGCCGACGG TCGCCGATCT GGCCGCGGCC GGCGATGCCG AGGTCATGGC GGCGTGGGCA
GGGCTTGGCT ATTACGCCCG TGCGCGCAAC CTGCTGGCCT GCGCGCGCGC CGTCGTCGCC
GAGCATGGCG GATGCTTTCC GGACAGTGAG GCGGGGCTGC GCGCGCTGCC GGGGATCGGC
GCCTATACCG CCGCGGCGGT GGCGGCGATC GCCTTTGGCC GCCCGGCGGT CGTCGTCGAC
GCCAATATCG AGCGGGTGAT CGCGCGCCAC CGGTGCATCG AAACGCCGCT CCCCGCCGCG
AAGCGCGCGA TTCGCGACGC GCTGGCGCCG CTGGTTCCGG GGGATCGGCC GGGCGATTTC
GCGCAGGCGC TGATGGACCT CGGCGCGACC CTTTGCACGC CGCGCGCGCC CGTGTGCGCG
CGCTGCCCGA TCGCCGCCGA CTGCCGCGCG CGCGGGCGCG CCGACATCGA GCGGCTGCCG
GTCAAGCCGC CGAAGAAGGC CAGGCCGCGC CGCCACGGCG TTGCCCACTG GATCGAGCGA
GACGGCGCGA TCTGGCTGGT GCAGCGGCCG GGCAAGGGGA TGCTCGGCGG GATGCGCGCG
CTGCCCGGCG GCGAATGGTC GGACGAGCCG CCCGGCGAAT CGGGAATCGT CCGCGTCGAC
CATGGTTTCA CCCATTTCGA CCTGACGCTG GTTCTCGTCC GCCGCGAAAC GGCCGATGCC
GCAGCGGAAG GCATCTGGTG GCCGATCTCG GACCTTGACG CCGCGGGGCT GCCGACGCTC
TATCGCAAGC TGGTGGTCAA GATGCTGGAG AGAGACGCAT GA
 
Protein sequence
MSFSARLLGW YDRSARVLPW RIAPGRAEVP DPYRVWLAEV MLQQTTVAAV AGYFAHFTER 
WPTVADLAAA GDAEVMAAWA GLGYYARARN LLACARAVVA EHGGCFPDSE AGLRALPGIG
AYTAAAVAAI AFGRPAVVVD ANIERVIARH RCIETPLPAA KRAIRDALAP LVPGDRPGDF
AQALMDLGAT LCTPRAPVCA RCPIAADCRA RGRADIERLP VKPPKKARPR RHGVAHWIER
DGAIWLVQRP GKGMLGGMRA LPGGEWSDEP PGESGIVRVD HGFTHFDLTL VLVRRETADA
AAEGIWWPIS DLDAAGLPTL YRKLVVKMLE RDA