Gene Saro_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3045 
Symbol 
ID3916657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3260186 
End bp3261085 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content63% 
IMG OID640445825 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_498314 
Protein GI87201057 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGAGG AAAAGACCGC GCTGACCGTC CCCGCGCTGG GTGGGGAAGC CAGCCTCAAC 
CGCTATCTCT CGGAAATCCG CAAGTTTCCC GTGCTGACGG CAGAGCAGGA ATACATGCTC
GCCAAGCGCT TTCAGGAACA TCAGGACCCC GAGGCCGCCG CACAACTGGT GACGAGCCAC
CTGCGGCTCG TGGCGAAGAT CGCCATGGGC TATCGCGGCT ATGGCCTGCC GGTGAGCGAG
CTGATCAGCG AGGGCAACAT CGGCCTGATG CAGGGCGTCA AGAAGTTCGA GCCGGACCGG
GGTTTCCGTC TGGCGACTTA CGCGATGTGG TGGATCAAGG CCTCGATGCA GGAATTCATC
CTGCGCAGCT GGTCGCTCGT GAAGATGGGC ACCACCGCCG CGCAGAAGAA GCTGTTCTTC
AACCTGCGGC GAATGAAGAA GAACCTCGAG GCTTTCGAGG ATTCCGACCT TCATCCCGAC
GACGTGAGGA AGATCGCGAC CGACCTCGGC GTACCCGAGC AGGAAGTGGT CAACATGAAC
CGGCGCATGA TGATGGGCGG CGATGCGTCG CTCAACGTCT CGATGCGCGA GGACGGCGAA
GGATCGTGGC AGGACTGGTT GACGGACGAC CGTCCGCTCC AGGATGAAAC CGTGGCCGAC
GCCGAGGAAG CGCAGTATCG CCACGAACTG CTGGTCGAGG CGATGGAAAG CCTCAACGAG
CGCGAGCGCC ACATCCTGAC CGACCGCAGG CTGATCGACG ATCCCAAGAC GCTCGAGGAA
CTGAGCCAGG TCTACAACGT CAGCCGCGAA CGCGTGCGTC AGATCGAGGT GCGCGCCTTC
GAGAAGCTGC AGAAGGCGAT CCAGCGCATC GCGGTGGAGC GCAAGCTCCT GCCGGCATAA
 
Protein sequence
MSEEKTALTV PALGGEASLN RYLSEIRKFP VLTAEQEYML AKRFQEHQDP EAAAQLVTSH 
LRLVAKIAMG YRGYGLPVSE LISEGNIGLM QGVKKFEPDR GFRLATYAMW WIKASMQEFI
LRSWSLVKMG TTAAQKKLFF NLRRMKKNLE AFEDSDLHPD DVRKIATDLG VPEQEVVNMN
RRMMMGGDAS LNVSMREDGE GSWQDWLTDD RPLQDETVAD AEEAQYRHEL LVEAMESLNE
RERHILTDRR LIDDPKTLEE LSQVYNVSRE RVRQIEVRAF EKLQKAIQRI AVERKLLPA