Gene Sala_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3033 
Symbol 
ID4083041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3180084 
End bp3181139 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID638011419 
ProductLacI family transcription regulator 
Protein accessionYP_618070 
Protein GI103488509 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAC AGACCAGGGG GCCGGGGGGC AGGCAGCCGA CAATCAACGA CGTCGCCGCG 
CTTGCGGGTG TGTCCAAGAA AACGGTAAGC CGGGTCATCA ACCGGTCGGA GTTCCTGACC
GAAAAGACGC GCGCCGCGGT GGAAAAGGCG ATCGAGCAAC TGGGGTTCGT CCCCAATCCG
CAAGCGCGTG CGCTCGCCTT TCGCCGCAAC TTCCTGATCG CGCTGCTCCA CGACAATCCG
AACGCACAGA CGGTGCTCAA TTTCCAGCGC GGCGTGCTCG ACGCGATCAA GGACAGCGAT
CTGGCGCTGC TCGTCCGCCC GGTCGATCGC GGGTCGGACA AACTGCTCGA CGATGTGCGC
ACCTTCCTCG AAAAACAGCG TCCGATTGGC GCGATGCTGT TGCCGCCGAT TTCGGAGAAT
GACGAACTCG CGGCGCTTTG TGAGGATCTG GGCGTGCGTT ATGTGCGTAT CGGCTCGGCG
CCGCTCGACG ATGCCAAACA TTGCATCTCG TCGAATGATC GTGAAGTGGT GGCGGCGGCG
GTGCGCGGGC TGATTGCGCT GGGGCACCGC CGCATCGGCT TCGTGCGCGG CCCGGCCGGT
TTCCGCTCCG CCGCCGAGCG CGAGAAGGGC TTTTTGGAGG CGCTCGCCGA AGCGGGGCTC
ACGCTTCCGC CCGAGCTCAA TGCGCCGGGT AACTACCGCT ATGCCGCCGG AATCGAGGCG
GGCGAGGCGC TGCTCGCGCG CGCCGATCCG CCGACGGCGA TCTTCTGCTC GAACGACGAA
ATGGCGGCGG GGGTGCTGAG CGTCGCGCAT GGCAAGGGAA TCAAGGTGCC CGCCGAACTG
TCGATCATCG GCTTCGACGA CAGCCCGACC GCAACGCATA TCTGGCCCGC GCTCAGCACG
GTGCGCTGGC CGATCCGCGA AATGGGCGCG CGCGCCGCGC AGATCCTCGT TCCCGATTTT
CTCGGCCCCG GCGCGAAGGT CGATGACGAA GACAATGTGC TGCCCTCAAC ATTGGTCGAG
CGGCAGTCGG TCGCGCCCCC GCCCGACAGG CTCTGA
 
Protein sequence
MAAQTRGPGG RQPTINDVAA LAGVSKKTVS RVINRSEFLT EKTRAAVEKA IEQLGFVPNP 
QARALAFRRN FLIALLHDNP NAQTVLNFQR GVLDAIKDSD LALLVRPVDR GSDKLLDDVR
TFLEKQRPIG AMLLPPISEN DELAALCEDL GVRYVRIGSA PLDDAKHCIS SNDREVVAAA
VRGLIALGHR RIGFVRGPAG FRSAAEREKG FLEALAEAGL TLPPELNAPG NYRYAAGIEA
GEALLARADP PTAIFCSNDE MAAGVLSVAH GKGIKVPAEL SIIGFDDSPT ATHIWPALST
VRWPIREMGA RAAQILVPDF LGPGAKVDDE DNVLPSTLVE RQSVAPPPDR L