Gene Sala_3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3175 
Symbol 
ID4082511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3331863 
End bp3333008 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content69% 
IMG OID638011560 
Productthreonine aldolase family protein 
Protein accessionYP_618211 
Protein GI103488650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.534304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG ACCGCCTCCT GCACGACGCG CTGATCGGTC GGCAGGGGAG CCGCGCCGAC 
CTCAACACCC CGGTGCTGAT TCTCGACCGC GAGGCGCTCG ACCGCAACAT CGCCCGCATG
GCGGCGCTCA CCAAGGCGGC GGGCGTCGCG CTGCGCCCCC ATGCCAAAAC GCATAAAAGC
GTCGATATCG CGCGCCGCCA GCTCGATGCC GGGGCGGTCG GCGTCTGCTG CGCAAAGATC
GGCGAGGCCG AAATACTGGC GGATGGCGGC ATCACCGGCA TCCTGATCAC CTCGCCCGTC
GCCGCCCCCG CCGCGATCGC GCGTCTTGCG AAACTGGCGG CGCGCGCCGA GGGGCTGATG
GCGGTCGTCG ATCATCCCGC GGTCGCCGTG CGCGTAGACG CGGCGCTTGC GGTCGAGGGT
GCGCGGCTCG ACGTCATCAT CGACATTGAC CCCGGCATCG CACGCACTGG CGTCGCGTCG
GCCGAAGCGG CGGTCGCGCT CGCGCAGACG ATCGACGCCT TGCCCGGCCT CGCCTGGCGC
GGCGTGCAAT ATTATTGCGG GTCACAGCAG CATATCGAAA GCTATGCCGA ACGCCGCGCC
GCGATCATCG AGCGCACGGA CTATTTGCAG GGGGTCATCG CCGCGCTGGC GGACGCCGGT
TTCGCGCCAC CGATCGTCAC CGGGTCGGGC ACCGGCACGC ACCGCATCGA CCTCGACCTC
GGTGTCTTTA CCGAGCTTCA GGCGGGCAGC TATGTCTTCA TGGACAAGCA ATATCTCGAC
TGCGAGCTGG CGGAGGGCGA AGCCGCGCCG TTCGAGGTCG CGCTCGCGGT CGATGCGCGC
GTCGTCAGCG CCAACCACAG CGGCCTCGTC ACGATCGACG CGGGTTTCAA GTCGCTCTCG
ACCGACGGCG GGGTGGCGGC GGTCCGGCGC GGCGCGCCCG AAACCGCCTT TTTCGCCTTC
ATGGGCGACG AACATGCCGC GCTCATCGCC CCCGGCATCG GCGACATGCT GCATCCCGGC
GATCCCGTCA GCCTGACCGT GCCGCACTGC GACCCGACGG TGAACCTCTA TGACCATTAT
CATGTCGTCG ACGGCGACAC GCTGATCGAC ATCTGGCCGG TCAGCGCCCG CGGCTGTGCG
CGATGA
 
Protein sequence
MTSDRLLHDA LIGRQGSRAD LNTPVLILDR EALDRNIARM AALTKAAGVA LRPHAKTHKS 
VDIARRQLDA GAVGVCCAKI GEAEILADGG ITGILITSPV AAPAAIARLA KLAARAEGLM
AVVDHPAVAV RVDAALAVEG ARLDVIIDID PGIARTGVAS AEAAVALAQT IDALPGLAWR
GVQYYCGSQQ HIESYAERRA AIIERTDYLQ GVIAALADAG FAPPIVTGSG TGTHRIDLDL
GVFTELQAGS YVFMDKQYLD CELAEGEAAP FEVALAVDAR VVSANHSGLV TIDAGFKSLS
TDGGVAAVRR GAPETAFFAF MGDEHAALIA PGIGDMLHPG DPVSLTVPHC DPTVNLYDHY
HVVDGDTLID IWPVSARGCA R