Gene Sala_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0034 
Symbol 
ID4082221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp32993 
End bp33979 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID638008394 
ProductArsR family transcriptional regulator 
Protein accessionYP_615093 
Protein GI103485532 
COG category[H] Coenzyme transport and metabolism
[K] Transcription 
COG ID[COG0640] Predicted transcriptional regulators
[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0629975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC TGATCGACAT TTGCCGGGCC TTGGCCGATC CGACCCGCTT GCGAGTCGTG 
GCCTTGCTGC GCGAGATGGA ACTGGCGATC GGCGAGTTGG CGGTGGTTCT GGACCAGAGC
CAGCCGCGTG TTTCCCGCCA CGTCCGCATC CTGGTCGAAG CGGGGATCGT CGAGCGGCGC
CGTGAGGGAA GCTGGGTATT CCTGCGCATC GTCGCGGATG GGCCGATCGC CGCAATCATC
GCGCAAGCGG ACAAATGGCC TTTTTCTGCG CGCGAAATGC GCGTGATCGC GCATGATGCG
CGTCGCCTTG CGGCGGTGCG CGCCGAACGC GCGGCAGCGG CCGCGCGATA TTTCGCCGAA
CATGCTGCCG AATGGGATGC CATTCGCTCA CGCCATGTCG CCGAAAGCGA GGTCGAAGCG
GCGATGCTGG CGATGATGCA CAACCGCCGC CTTGGCCACC TTCTCGACAT CGGGACGGGA
ACCGGGCGGA TGGCAGAGAT TTTTGCTCCG ACCGCGCGCC GCATCACCGC CCTCGACCGC
AGCCCCGAAA TGCTGCGGAT CGCCCGCGCC AAGCTCGAAA GACAGCCGGT GCCCGTCGAC
CTGATCCAGG GCGATTTTCT GGAGTTGCCG GTGGGGGACG CGAGCGTCGA CAGCATCGTC
ATTCATCAGG CGCTGCATTT TGCGCACGAA CCCGATCGCG TGATCGCGGA AGCGAGCCGG
GTGCTGCGCG GCGGCGGCCA CCTGCTGATC GTCGATTTCG CGCCGCACGA GGATGAGGAA
TTGCGCACGC TTGCCGCGCA CGCCCGCCTC GGCTTTTCGG ACGCGCAGAT CCGCGGCTGG
TTCGCCTCGG CGGGCCTGCT GCTCGAAACC ACACAGACGC TCGAAGGCGG GAAGCTGGCC
GTCAAGCTCT GGCTCGGACG TCGCCGGAGC GACCAGGATC AACCCCCCGT CAGCGACGGC
GGACCGACGA AAAGGCTTGC TGCATGA
 
Protein sequence
MSELIDICRA LADPTRLRVV ALLREMELAI GELAVVLDQS QPRVSRHVRI LVEAGIVERR 
REGSWVFLRI VADGPIAAII AQADKWPFSA REMRVIAHDA RRLAAVRAER AAAAARYFAE
HAAEWDAIRS RHVAESEVEA AMLAMMHNRR LGHLLDIGTG TGRMAEIFAP TARRITALDR
SPEMLRIARA KLERQPVPVD LIQGDFLELP VGDASVDSIV IHQALHFAHE PDRVIAEASR
VLRGGGHLLI VDFAPHEDEE LRTLAAHARL GFSDAQIRGW FASAGLLLET TQTLEGGKLA
VKLWLGRRRS DQDQPPVSDG GPTKRLAA