Gene Sala_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0920 
Symbol 
ID4083130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp932756 
End bp933811 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID638009281 
ProductLacI family transcription regulator 
Protein accessionYP_615971 
Protein GI103486410 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.830745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGAC GCCGCCAGGC GGTGACGATC AAGCATGTGG CGGCCGATGC CGGTGTGTCG 
CTGCAAACGG TCAGCCGTGT GATCAACGAC GAACCCAATG TGCGCTCGGC AATGAAGGCG
CGCGTCCAGG CGTCCATCGA CAAGCTCGGC TATGTGCCGT CGATTGCCGC CCGCCGGATG
AGCGGGTCGC GCTCTTATCT GATTCTGGCG ATCAACGATC GCGACCGGAC GATCGCGGAC
TGGACGGCGC GGCAGGGCAC CGATTGGGTC GACCAGATGC TGCTGGGCGG CATGCTCAAG
TGCGCCGAAT ATGGCTATCG GCTTATTTTT GAGCTTGTCG ACACGCACAG CGACCATGTC
GAACGCGAAC TGCGCGCAAC CATCGCGGCG CTTCAGCCCG ACGGCGTGAT TCTGACGCCC
CCCCATTCCG ACAATCCGCT GATCGTGCGA TTGCTTGAAC GGCAGCGAAT ACCCTTTGCG
CGCATCGGAT CGCGCGGCGG AGGGGCGGGG ATTGCGCTGG TGATGGATGA CGAGAGCATG
GCGCGCCACG CGACGCGTCA CCTCATCGAC CTTGGCCATC GGCGCATTGC TTTCATTGCA
GGTTCAAGCG AATATCCGCT GAGCCAATGG CGCGTCGATG GTTGGGAAAG CGAAATGCGT
GCCGCGGGAT TGCCGACCGC CGGACTCGTG GCGAGAGGCG ACTTCACTTA CGAATCGGGC
GCGGCCGCCA CGCGGCAGCT TCTTGGTCAT CCGGATCGCC CTTCGGCGAT CATCGCCAGC
AATGACCAGA TGGCGCTCGC CGCGCTCGAA GTCGCGCGCG AACTGGGGAT CGAGATTCCG
TCACAGCTTT CGCTCGTAAG TTTCGACAAT ACGCCGATCG TGCGTTTTAC CCAGCCGACG
CTTACCGCCG TTGATCAGCC GGTCGCCGAA ACCGTGTCGC GCGCCGTCGA AATGATCATC
AAGGCGCAGC GGGGGGAAAA GTTGCCGCCA CAACCCGTGA TTGTTGCCGG GGGCTTCGTC
GAACGCGAAT CGACTTCTGC GCCCGCGCAT GGATGA
 
Protein sequence
MARRRQAVTI KHVAADAGVS LQTVSRVIND EPNVRSAMKA RVQASIDKLG YVPSIAARRM 
SGSRSYLILA INDRDRTIAD WTARQGTDWV DQMLLGGMLK CAEYGYRLIF ELVDTHSDHV
ERELRATIAA LQPDGVILTP PHSDNPLIVR LLERQRIPFA RIGSRGGGAG IALVMDDESM
ARHATRHLID LGHRRIAFIA GSSEYPLSQW RVDGWESEMR AAGLPTAGLV ARGDFTYESG
AAATRQLLGH PDRPSAIIAS NDQMALAALE VARELGIEIP SQLSLVSFDN TPIVRFTQPT
LTAVDQPVAE TVSRAVEMII KAQRGEKLPP QPVIVAGGFV ERESTSAPAH G