Gene Sala_0565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0565 
Symbol 
ID4081455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp578579 
End bp579637 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content62% 
IMG OID638008923 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_615619 
Protein GI103486058 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.548762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.672721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCA ACATCCGGAA CTGGCAGGAA TTGAAGAAGC CCAGCAACCT GGAAATCAAG 
ACCGGCGGCG ACGGCAAGCG CAAGGCGACC TTCGTTGCCG AACCGCTCGA GCGCGGTTTC
GGCCTGACGC TGGGTAACGC GCTGCGCCGC GTGCTGCTCT CGTCGCTCCA GGGCGCGGCG
ATCACCTCGA TCAAGATCGA AAATGTGCTG CACGAGTTTT CGAGCCTTGC CGGTGTGCGC
GAAGACGTCA CCGACATCGT CCTGAACGTC AAGCAGATCG CGCTCAAGAT GGAGGGCGAA
GGGCCGAAGC GGCTCCAGCT TTCGGCGACG GGCCCGGCGA CGGTCAAGGC GGGCGACATC
ATGGTGTCGG GCGACATCAA GGTGATGAAC CCGAACCATG TCATCTGCCA TCTCGACGAA
GGCGCGACGC TCAACATGGA GCTCGTCGCC GACACCGGCA AAGGCTATGT CCCCGCGACC
GCCAACCGCC CGGCCGATGC GCCGATCGGC CTGATCCCGG TCGACTCGCT CTATTCGCCG
GTGCGCCAGG TCGCCTACAA GGTCGACAAT GCCCGCATCG GGCAGGAACT GGACTATGAC
AAGCTGAACC TGACCGTCGA AACCGACGGC ACGGTGACGC CCGAAGATGC TGTCGCTTAC
GCCGCACGCA TCCTGCAGGA TCAGCTCCAG GTCTTCGTCC ACTTCGAAGA AGCGATGAGC
GACAGCGGCC TGATCGGCAT GGCGGCCCCC TCGGCCGCCA GCGACGAGTC CGACGTCAAC
CAGCTCAACC GCTTCCTGCT CAAGAAGGTC GACGAGCTTG AACTGTCGGT CCGCTCGGCC
AACTGCCTGA AGAACGACAA CATCATCTAT ATCGGTGACC TCGTTCAAAA GACCGAAGCC
GAAATGCTCC GCACGCCGAA CTTCGGCCGC AAGTCCTTGA ACGAAATCAA GGAAGTGCTG
TCGTCGATGG GCCTGCGTCT GGGCATGGAC ATCCCCGGCT GGCCACCCGA GAATATCGAG
GAAATGGCCA AGAAGCTGGA GCAGGAGCTG CTGGGTTAA
 
Protein sequence
MTVNIRNWQE LKKPSNLEIK TGGDGKRKAT FVAEPLERGF GLTLGNALRR VLLSSLQGAA 
ITSIKIENVL HEFSSLAGVR EDVTDIVLNV KQIALKMEGE GPKRLQLSAT GPATVKAGDI
MVSGDIKVMN PNHVICHLDE GATLNMELVA DTGKGYVPAT ANRPADAPIG LIPVDSLYSP
VRQVAYKVDN ARIGQELDYD KLNLTVETDG TVTPEDAVAY AARILQDQLQ VFVHFEEAMS
DSGLIGMAAP SAASDESDVN QLNRFLLKKV DELELSVRSA NCLKNDNIIY IGDLVQKTEA
EMLRTPNFGR KSLNEIKEVL SSMGLRLGMD IPGWPPENIE EMAKKLEQEL LG