Gene Saro_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0297 
Symbol 
ID3916234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp319545 
End bp321041 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content68% 
IMG OID640443026 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_495579 
Protein GI87198322 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTCG GGCCGAGGCT CGATATCCGG CAGACCCAGT CGCTGGTGAT GACGCCGCAG 
CTCCAGCAGG CGATCAAGCT GCTGGCGTTG TCGAACCTCG AGGTCGAGGC CTTCATCGGC
GAGGCGCTGG AGGCCAATCC GCTGCTCGAA ATCGGTGAGA CGGCACCCGC CGAGGCGGTC
GACGCCGGCC CCGAAGACCT GCGCCGCACG CATCTCGAAT CCTCGCCGGT AGACCAGCTC
GTTGCCGAGG GGCGGGTGGA GGAGGATCGC CCGCTCGACA TCGATGTCAC TGCGCTCGAC
CGCGATCGGG ACACCGGGGA TGGCGATTTC GGGGGCGGCA CGCTCGAGTT GTCCTCGACC
CGCGAAAGTG GCGGCGGCGA GGGGCCGGAC ATCGACGAAC GGGGGCGCAT CGAGGAAACG
CTTGCCGAAC ATCTCCACGC CCAGATCGGC GCGACGACCT CGGACGCGCA GCTCCTCTTC
GTCGCCCGCT GGCTGATCGA CCAGCTCGAC GAGGCAGGAT ATCTCGCCAT GCCGATTGGC
GAAGTGGCCG AGGCGCTGGG CCTTTCGCCG CTGGTGGTCG AGCGGGCGCT GGCTCTCGTC
CAGTCGCTCG ACCCGACCGG GGTCGGCGCA CGCAACCTGG CGGAATGCAT CGCGCTCCAG
GCCCGGGAGG CGGACCGCTA TGATCCGTGC ATGGCGCGGC TGATCGACAA TCTGGAACTT
GTTGCGCGCG GCGAGATCGC GCGGCTGAAA CGCCTGTGCC AGGTCGACGA CGAGGACTTT
GCCGACATGC TGGCCGAGCT GCGCGGCTAC GACCCGCGCC CCGGCCTGCG CTTCGGCGGG
GGCGCGGCGG AGCCGGTCGT GCCCGATATC CTGGTGCGCG CGGCAAAGGG CGGCTGGGAC
ATCGCGCTCA ATCAGGCGAC CCTGCCGCGC CTCGTCGTCA ATCGCAGCTA CTACGTGGAG
ATGCGCGGGG CCTGTGTCGG CAAGGAGGCC AAGGCCTGGC TGGGGGAGAA GCTGGCCGAC
GCGAACTGGC TGCTGAAGGC GCTCGACCAG CGGCAGAAGA CCATCCTCAA GGTCGCGGCC
GAGATCGTGA AGCAGCAGGA CGGCTTCTTC CGGCACGGCG TCGCGCACTT GCGCCCGTTG
ACGCTGAAGA CCGTGGCCGA AGCGATATCG ATGCATGAAT CGACCGTCAG CCGCGTGACT
TCGAACAAGT ACCTCCATTG CGACCGGGGT ACCTTCGAGC TGAAGTATTT CTTCACTTCG
GGCGTCGGCT CTTCCGACGG TGAGGGCGCT TCGGCCGCGG CGGTGAAGGC TGCGATCCGC
CAGCTCATCG ATGCCGAGGA CCCCAAGGCA ATCCTTTCGG ACGATGCCCT GGTCGATCTG
CTCAAGGCGC GGGGCTTCGA CCTTGCCCGG CGCACGGTCG CCAAGTACCG CGAGGCGATC
GGGCTCGGAA GTTCGGTCCA GCGCCGCCGC CAGAAAACAC TCGCCGGGGT GCGCTGA
 
Protein sequence
MALGPRLDIR QTQSLVMTPQ LQQAIKLLAL SNLEVEAFIG EALEANPLLE IGETAPAEAV 
DAGPEDLRRT HLESSPVDQL VAEGRVEEDR PLDIDVTALD RDRDTGDGDF GGGTLELSST
RESGGGEGPD IDERGRIEET LAEHLHAQIG ATTSDAQLLF VARWLIDQLD EAGYLAMPIG
EVAEALGLSP LVVERALALV QSLDPTGVGA RNLAECIALQ AREADRYDPC MARLIDNLEL
VARGEIARLK RLCQVDDEDF ADMLAELRGY DPRPGLRFGG GAAEPVVPDI LVRAAKGGWD
IALNQATLPR LVVNRSYYVE MRGACVGKEA KAWLGEKLAD ANWLLKALDQ RQKTILKVAA
EIVKQQDGFF RHGVAHLRPL TLKTVAEAIS MHESTVSRVT SNKYLHCDRG TFELKYFFTS
GVGSSDGEGA SAAAVKAAIR QLIDAEDPKA ILSDDALVDL LKARGFDLAR RTVAKYREAI
GLGSSVQRRR QKTLAGVR