Gene Sala_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0979 
Symbol 
ID4079978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1003132 
End bp1005141 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content61% 
IMG OID638009339 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_616029 
Protein GI103486468 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0235491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGA AGAACGAAGC CGATACCGAC GCCCCGCTGA TCGACCTGAA CGAGGCCGAC 
GTCAAAAAAC TGATCGCGCG CGGCAAGAAG CGCGGTTACC TGACCTATGA CGAGCTCAAT
GCGGCGCTGC CGCAGGACGA AATGTCGTCC GAGCAGATCG AGGATATCAT GTCGGCCATC
TCCGACATGG GCATCAACAT CGTCGAAAGC GACGAGGATG TGCAGGAAGA GGCCGAGCAG
GAGGTCGACG ACGAGGTCGA TGTTTCGGCA GGCACCGGGT CGGTTTCGAA CCCCGCGATC
GAAAAGAAGA AGGAAACGGT CGATCGCACC GACGATCCCG TGCGCATGTA TCTGCGCGAA
ATGGGTGCGG TCGAATTGCT GTCCCGCGAG GGTGAAATCG CGATCGCGAA GCGCATCGAG
GCGGGCCGCG ACACGATGAT CCTCGGGCTT TGCGAAAGCC CGCTGACCTT CAACGCGATC
ATCGAATGGT CGAACGCGCT CAACAATGGC GACATGCAGC TGCGCGAGAT CGTCGACCTC
GAAGCGATGC TGTCGAAAGA TCCGGCGCCT GAAAATCTCG ACGAGGAAGG CGCCGAGGAC
GGCGAGATCA GCGAAAAGAC CGCCGGCGTC TCGTTCAAGG ACGAGGATGA GGTCGAGGAA
GAGCCTGCCG CCGACGGCGA CGACGAGGAT GGCGAAGGCA CATCGGGCAA GCGTGAAAGC
TTCGACGACG ATGACGAGGA CAATACGCTG AGCCTCGCCG CGATGGAGGA ATTGCTCAAG
CCCGACGCGC TTGAGAAGTT CGCGAACATT ACCAAAAGCT TCAAGGCGTT CCAGAAGCTT
CAGGAAGCCC GGCTCGAAGC GCTGTCGAGT GGCGAGGAGT TTCCGGCGGC GTCGGAAAAG
AAATATCACA AGCTGCGCGA GGAACTCACC GCACAGGTCG AGAGCGTGCA GTTCCATGGC
ACCAAGATCG AATATCTGGT CGACCAGCTC TACAGCTACA ACCGCCGCCT GACCGCGCTC
GGCGGCCAGA TGCTGCGCCT TGCCGAGCGT CACAAGGTCC CGCGCAAGTC GTTCCTCGAC
CATTATGTCG GCCGCGAGCT CGAAGAAAAC TGGCTTGAGG AAGTCGCCGG CATCGACAAG
AAATGGGCGG CGTTCGCCGA GAATGAGGCC GCCGCGGTCG ATCGCATCCG CGTCGAGATC
AGCGAGATCG CGCAGGCCGC GGGCATGAGC CTGACCGAGT TCCGCCGCGT CGTGAACATG
GTGCAGAAGG GCGAGCGCGA GGCGCGCATC GCCAAGAAGG AAATGGTCGA GGCCAACCTG
CGCCTCGTCA TTTCGATCGC CAAGAAGTAC ACGAACCGCG GGCTGCAGTT CCTCGACCTC
ATTCAGGAAG GGAACATCGG CCTGATGAAG GCGGTCGACA AGTTCGAATA TCGCCGCGGC
TACAAGTTCA GCACCTATGC GACCTGGTGG ATCCGCCAGG CGATCACCCG CTCGATCGCC
GATCAGGCGC GTACGATCCG TATCCCCGTC CATATGATCG AGACGATCAA CAAGCTGGTG
CGCTGCAGCC GCCAGTTCCT CCACGAAAGC GGCCGCGAGC CGACCCCGGA GGAAATGGCC
GAGCGGCTGT CGATGCCGCT CGAAAAGGTC CGCAAGGTGA TGAAGATCGC CAAGGAGCCG
ATCAGCCTCG AAACGCCGAT CGGCGACGAG GAAGACAGCC ACCTCGGCGA TTTCATCGAG
GACAAGAATG CGGTGATACC GGTCGATGCC GCGGTGCAGT CGAACCTCAA GGAAACCGTC
ACCCGCGTCC TTGCATCGCT CACCCCGCGC GAGGAACGCG TGCTGCGTAT GCGCTTCGGC
ATCGGCATGA ACACCGACCA TACGCTCGAA GAAGTGGGTC AGCAGTTCAG CGTGACCCGC
GAACGCATCC GCCAGATCGA GGCAAAAGCC CTCCGCAAGC TCAAGCACCC GTCGCGGTCG
CGCAAGATGC GGTCGTTCCT TGACCAGTAG
 
Protein sequence
MATKNEADTD APLIDLNEAD VKKLIARGKK RGYLTYDELN AALPQDEMSS EQIEDIMSAI 
SDMGINIVES DEDVQEEAEQ EVDDEVDVSA GTGSVSNPAI EKKKETVDRT DDPVRMYLRE
MGAVELLSRE GEIAIAKRIE AGRDTMILGL CESPLTFNAI IEWSNALNNG DMQLREIVDL
EAMLSKDPAP ENLDEEGAED GEISEKTAGV SFKDEDEVEE EPAADGDDED GEGTSGKRES
FDDDDEDNTL SLAAMEELLK PDALEKFANI TKSFKAFQKL QEARLEALSS GEEFPAASEK
KYHKLREELT AQVESVQFHG TKIEYLVDQL YSYNRRLTAL GGQMLRLAER HKVPRKSFLD
HYVGRELEEN WLEEVAGIDK KWAAFAENEA AAVDRIRVEI SEIAQAAGMS LTEFRRVVNM
VQKGEREARI AKKEMVEANL RLVISIAKKY TNRGLQFLDL IQEGNIGLMK AVDKFEYRRG
YKFSTYATWW IRQAITRSIA DQARTIRIPV HMIETINKLV RCSRQFLHES GREPTPEEMA
ERLSMPLEKV RKVMKIAKEP ISLETPIGDE EDSHLGDFIE DKNAVIPVDA AVQSNLKETV
TRVLASLTPR EERVLRMRFG IGMNTDHTLE EVGQQFSVTR ERIRQIEAKA LRKLKHPSRS
RKMRSFLDQ