Gene Sala_2875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2875 
Symbol 
ID4080668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3026366 
End bp3027400 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content72% 
IMG OID638011259 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_617913 
Protein GI103488352 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.640549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGA TCCTCGGCCT CGAATCGAGC TGCGACGAAA CGGCAGCGGC GCTTGTCACC 
GGCGACCGGC GCGTCCTCGC GCACCGCGTT GCGGGACAGG AGGCCGAACA CCGGCCCTAT
GGCGGCGTGG TACCCGAAAT CGCCGCGCGC GCGCATGTCG ACCGGCTCGC GCCGATCGTC
GAAGGCGTGC TCGATGACGC GGGCGTGACG CTCGCCGACG TCGATGCGAT CGCAGCGACC
GCCGGGCCGG GGCTGATCGG CGGGGTGATG GTCGGCCTCG TCACCGGCAA GGCGCTGGCG
CACGCCGCGA ACAAGCCGCT GATCGCGGTC AACCATCTCG AGGGCCATGC GCTCAGCCCG
CGGCTCGCCG ATCCGACCCT CGACTTTCCC TATCTGCTGC TGCTCGTCTC GGGCGGGCAT
TGCCAGTTGC TGCTCGTAAA GGGCGTCGGC GATTATCGCC GTCTCGCCAC CACGATCGAC
GATGCCGCGG GCGAGGCGTT CGACAAGACC GCCAAGCTGC TCGGCCTCGG CTATCCGGGT
GGTCCCGCGG TCGAACGCAT CGCGGCCGAA GGCGACCCGC ACGCCGTGCC GCTGCCGCGC
CCGCTCGTCG GCAGCGCCGA GCCGCATTTC TCCTTTGCCG GGCTGAAAAG CGCGGTCGCG
CGCGCCGCGG CGAGCGGAAC CCATGACGTT GCCGATCTCG CTGCCTCGTT CCAGCAGGCC
GTCGTCGACT GCCTCGTCGA TCGCAGCCGC GGCGCGCTCG CGGCGTGCCC CGATGCCAGG
GCCTTCGTCG TCGCGGGCGG CGTCGCGGCC AATGGCGCGA TCCGCACCGC GCTCACCGAC
CTCGCCGCGC GCTTCGACAA GCCCTTCGTC GCGCCGCCGC TGTGGCTCTG CACCGACAAT
GGCGCGATGA TCGCCTGGGC GGGCGCCGAA CGCTTTGCCG CGGGGCTGAC CGACCCGCTC
GATACTGCGG CGCGCCCGCG CTGGCCGCTC GACCCCGCAG CCGAAGCAGT GCGCGGCGCG
GGAGTGAAAG CATGA
 
Protein sequence
MTLILGLESS CDETAAALVT GDRRVLAHRV AGQEAEHRPY GGVVPEIAAR AHVDRLAPIV 
EGVLDDAGVT LADVDAIAAT AGPGLIGGVM VGLVTGKALA HAANKPLIAV NHLEGHALSP
RLADPTLDFP YLLLLVSGGH CQLLLVKGVG DYRRLATTID DAAGEAFDKT AKLLGLGYPG
GPAVERIAAE GDPHAVPLPR PLVGSAEPHF SFAGLKSAVA RAAASGTHDV ADLAASFQQA
VVDCLVDRSR GALAACPDAR AFVVAGGVAA NGAIRTALTD LAARFDKPFV APPLWLCTDN
GAMIAWAGAE RFAAGLTDPL DTAARPRWPL DPAAEAVRGA GVKA