Gene Saro_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0023 
Symbol 
ID3916026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp19422 
End bp23705 
Gene Length4284 bp 
Protein Length1427 aa 
Translation table11 
GC content64% 
IMG OID640442748 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_495306 
Protein GI87198049 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC TGACCAAATT CACCAACCAG ATCGCCAAGC CCGAGACCTT CGACCAGATC 
CAGATCGGTC TGGCGAGCCC CGAGCGCATC CGCAGCTGGT CCTTCGGCGA GATCAAGAAG
CCGGAAACGA TCAACTACCG CACGTTCAAG CCCGAGCGCG ACGGCCTGTT CTGCGCGCGC
ATCTTCGGTC CCGTGAAGGA CTACGAGTGC CTGTGCGGCA AGTACAAGCG CATGAAGTAC
AAGGGCGTCG TCTGCGAAAA GTGCGGCGTC GAAGTCACCG TCACCAAGGT GCGCCGCGAG
CGCATGGGCC ACATCGAGCT GGCCGCGCCG GTCGCCCACA TCTGGTTCCT CAAGTCGCTG
CCTTCGCGCA TCGGCCTGCT GCTCGACATG CAGCTCAAGC AGCTTGAGCG CATTCTCTAC
TTCGAAAGCT ACGTGGTCAT CGAGCCGGGC CTGACCCCGC TCGAAAAGTA CCAGCTCCTG
ACCGAGGACG AACTGCTCGA CGCGCAGGAC GAGTACGGCG AGGACGCCTT CTCGGCCGGC
ATCGGTGCCG AGGCCGTCAA GCACATGCTG ATGAACCTCG ACCTCGTGCA GGAAAAGGAA
GACCTGCTCC AGGAGCTGGC GACGACCAAG TCGGAGCTAA AGCCCAAGAA GATCATCAAG
CGCCTCAAGG TCGTGGAATC GTTCATCGAT TCGGGCAACC GCCCGGAATG GATGATCCTC
GACGTCGTGC CGGTCATTCC GCCAGAACTG CGCCCGCTGG TGCCGCTCGA CGGTGGCCGC
TTCGCGACCT CCGACCTGAA CGACCTCTAT CGCCGCGTCA TCAACCGCAA CAACCGCCTC
AAGCGCCTGA TCGAGCTGCG CGCGCCGGAC ATCATCGTCC GCAACGAAAA ACGCATGCTG
CAGGAGGCGG TTGACGCGCT GTTCGACAAC GGCCGTCGCG GCCGCGTGAT CACCGGCGCC
AACAAGCGTC CGCTCAAGTC GCTGTCCGAC ATGCTCAAGG GCAAGCAGGG CCGCTTCCGC
CAGAACCTGC TCGGCAAGCG CGTCGACTAT TCGGGCCGTT CGGTCATCGT GACCGGTCCC
GAACTCAAGC TGCACCAGTG CGGCCTGCCC AAGAAGATGG CGCTCGAGCT GTTCAAGCCG
TTCATCTACG CCCGCCTCGA TGCCAAGGGG CTGTCGATGA CCCTCAAGCA GGCCAAGAAG
TGGGTCGAGA AGGAGCGCAA GGAAGTCTGG GACATCCTTG ACGAGGTGAT CCGCGAGCAC
CCGGTCATGC TGAACCGTGC GCCCACGCTG CACCGTCTCG GCATCCAGGC CTTCGAACCG
GTCCTGATCG AAGGCAAGGC GATCCAGCTG CACCCGCTCG TCTGCTCGGC CTTCAACGCC
GACTTCGACG GTGACCAGAT GGCCGTCCAC GTGCCGCTTT CGCTGGAAGC CCAGCTCGAA
GCGCGCGTGC TGATGATGTC GACCAACAAC ATCCTCTCGC CCGCCAACGG CAAGCCGATC
ATCGTGCCTT CGCAGGACAT GGTTCTCGGC ATCTATTACC TGTCGATGGA CCGCGCTGGC
GAGCCGGGCG AAGGCATGAT GCTGGCCGAC ATGGCCGAAG TGCACCAGGC GCTGGAGGCC
AAGGCCGTCA CGCTGCACTC GAAGATCGTG GCCCGCGTGC CCCAGACCGA CGAGGACGGC
AACCAGTACC TCAAGCGGTT CGAGACCACG CCGGGCCGCA TGCTCATCGG CGAATGCCTG
CCCAAGAGCC ACAAGGTGCC GTTCGAGATC GTCAACCGCC TTCTCACCAA GAAGGAAATC
GGCGACGTCA TCGACCAGGT CTATCGCCAC ACCGGCCAGA AGGACACCGT GCTGTTCGCC
GACGCCATCA TGGCGCTGGG CTTCCGCCAC GCTTTCAAGG CCGGCATCTC GTTCGGCAAG
GATGACATGA TCATCCCGGA CAGCAAGGAC GCGCTGGTTG CCGAGACCAA GGAACTCGTT
GCCGACTACG AGCAGCAGTA CCAGGACGGT CTCATCACCC AGCAGGAAAA GTACAACAAG
GTGATCGACG CCTGGAGCCG CTGCGGCGAC CAGGTGGCGA ACGCCATGAT GGAAGAGCTT
AAGTCCTCGC CCATCGATCC GGAGACCGGT CGCCAGAAGC CGATCAACGC GATCTACATG
ATGTCGCACT CGGGCGCTCG TGGCTCCCCG GCGCAGATGA AGCAGCTCGC GGGCATGCGT
GGCCTCATGG CCAAGCCTTC GGGCGAGATC ATCGAGACCC CGATCATCTC GAACTTCAAG
GAAGGCCTGA CCGTCCTTGA ATACTTCAAC TCGACCCACG GCGCCCGAAA GGGCCTCGCG
GACACCGCGC TCAAGACCGC GAACTCGGGT TACCTGACCC GCCGTCTGGT CGACGTGTCG
CAGGACTGCG TCGTGATCGA GGAAGACTGC GGCACCGAAC GTGCGCTGGA AATGCGCGCG
ATCGTGCAGG GCGGCTCGAC CATCGCTTCG CTTGGCGAAC GCATCCTTGG CCGTACCCTT
GCCGAAGACC TGATCGAGGC GAAGTCGGGC GAAGTGATCG CGCAGAAGGG CGAACTGCTC
GACGAAGCCG CGATCGCGAA GATCGAGGCT GCCGGCGTGC AGTCGGCGCG CATCCGCTCG
CCGCTGGTTT GCGAAGCGAC CCAGGGCGTC TGCGGCAAGT GCTATGGCCG CGACCTCGCT
CGCGGTACGC CGGTTAACAT CGGTGAAGCT GTCGGCGTCA TCGCCGCGCA GTCGATCGGT
GAACCGGGCA CCCAGCTGAC CATGCGTACC TTCCACATCG GTGGCGCGGC GCAGGTCAAC
GAGCAGTCGC ACCTCGAGGC TATCAGCGAC GGTACCGTGC AGTACCGCGA CATCCCGACG
ATCACCGACA AGCGCGGCCG TCGTCTGTCG CTCGCCCGCA ACGGCGAGAT CGTCGTGATC
GATACCGAGG GTCGTGAGCG CGCGATCCAC CGCGTGCCCT ACGGTACGCA CCTGCTGCAC
GAGAACGGTG CGATCATCTC GCAGGGCGAC CGCCTGGCCG AGTGGGACCC GTTCACCACG
CCGGTGATCA CCGAAAAGCC CGGTATCGTC CGCTACCAGG ATCTTGTCGA CGGCAAGACC
CTGACCGAGC AGACCGACGA AGCCACCGGC ATGTCGAGCC GCGTGGTGAC CGAGAACCGC
GCTGCGGGTC GTGGCAAGAA GGAAGACCTC CGTCCGCGCC TGACCCTGCT CGACGAGAAC
TCGGGCGAGG CCGCACGCTA CATGATGGCG CCGGGCACGA CGCTGTCGGT CGAGGACGGC
CAGCAGGTGG AAGCGGGCGA CATCCTTGCC CGTGCCTCGC GCGAAGCCGC CAAGACCCGC
GACATCACCG GCGGTCTGCC GCGCGTTGCA GAGCTGTTCG AGGCCCGCAA GCCGAAGGAC
AACTCGATCA TCGCCAAGAT TGCGGGCCGC ATCGAGTTCG TGCGTGACTA CAAGGCCAAG
CGCAAGATCG CGATCATCCC GGAAGAGGGT GAACCGGTCG AGTACCTGGT GCCGAAGAGC
CGCGTGATCG ACGTGCAGGA AGGCGACTAC GTCAAGAAGG GCGACAACCT GATCTCGGGC
TCGCCCGATC CGCACGACAT CCTGGAAGTC ATGGGCGTCG AGGCTCTGGC CGAGTACCTC
GTCGCGGAAA TCCAGGAAGT CTACCGCTTG CAGGGCGTGA AGATCAACGA CAAGCACATC
GAGGTGATCG TTCGCCAGAT GCTGCAGAAG GTCGAGATCA CCAACGGTGG CGACACCACC
CTGCTGCCGG GCGAACAGGT CGACCTCGAG GAGATGCTCG AAACCAACGG CAAGCTCGAG
GAAGGCCAGC AGCCTGCCGA AGGCAAGCCA GTGCTGCTCG GCATCACCAA GGCCTCGTTG
CAGACGCGTT CGTTCATCTC GGCGGCGTCG TTCCAGGAGA CCACCCGCGT GCTCACGCAG
GCGGCCGTGG AAGGCAAGAA GGACTCGCTG ATCGGCCTCA AGGAGAACGT GATCGTCGGC
CGTCTCATCC CCGCCGGTAC CGGCGCGGGC ATGAACCGCA TGCGCGTCGC AGCCTCCAGC
CGCGATGCTG CCCTGCGCGC CTCCTACCGC AAGCTCCAGG AATCGCTGAT CGCTCCGGCG
ACGGCGGCTG AAGAGCACGC GGCGGAACTC GCCCAGGGTC CCGAAGCAGC GATCGGCGAC
GATCCGCTGG CGACGGTCGA GGGCGAGACC CACGGCACCG ATGCCGACGC GGGCGACTAC
CTGATCGAGG GTGACGAAGC CTGA
 
Protein sequence
MNDLTKFTNQ IAKPETFDQI QIGLASPERI RSWSFGEIKK PETINYRTFK PERDGLFCAR 
IFGPVKDYEC LCGKYKRMKY KGVVCEKCGV EVTVTKVRRE RMGHIELAAP VAHIWFLKSL
PSRIGLLLDM QLKQLERILY FESYVVIEPG LTPLEKYQLL TEDELLDAQD EYGEDAFSAG
IGAEAVKHML MNLDLVQEKE DLLQELATTK SELKPKKIIK RLKVVESFID SGNRPEWMIL
DVVPVIPPEL RPLVPLDGGR FATSDLNDLY RRVINRNNRL KRLIELRAPD IIVRNEKRML
QEAVDALFDN GRRGRVITGA NKRPLKSLSD MLKGKQGRFR QNLLGKRVDY SGRSVIVTGP
ELKLHQCGLP KKMALELFKP FIYARLDAKG LSMTLKQAKK WVEKERKEVW DILDEVIREH
PVMLNRAPTL HRLGIQAFEP VLIEGKAIQL HPLVCSAFNA DFDGDQMAVH VPLSLEAQLE
ARVLMMSTNN ILSPANGKPI IVPSQDMVLG IYYLSMDRAG EPGEGMMLAD MAEVHQALEA
KAVTLHSKIV ARVPQTDEDG NQYLKRFETT PGRMLIGECL PKSHKVPFEI VNRLLTKKEI
GDVIDQVYRH TGQKDTVLFA DAIMALGFRH AFKAGISFGK DDMIIPDSKD ALVAETKELV
ADYEQQYQDG LITQQEKYNK VIDAWSRCGD QVANAMMEEL KSSPIDPETG RQKPINAIYM
MSHSGARGSP AQMKQLAGMR GLMAKPSGEI IETPIISNFK EGLTVLEYFN STHGARKGLA
DTALKTANSG YLTRRLVDVS QDCVVIEEDC GTERALEMRA IVQGGSTIAS LGERILGRTL
AEDLIEAKSG EVIAQKGELL DEAAIAKIEA AGVQSARIRS PLVCEATQGV CGKCYGRDLA
RGTPVNIGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQVN EQSHLEAISD GTVQYRDIPT
ITDKRGRRLS LARNGEIVVI DTEGRERAIH RVPYGTHLLH ENGAIISQGD RLAEWDPFTT
PVITEKPGIV RYQDLVDGKT LTEQTDEATG MSSRVVTENR AAGRGKKEDL RPRLTLLDEN
SGEAARYMMA PGTTLSVEDG QQVEAGDILA RASREAAKTR DITGGLPRVA ELFEARKPKD
NSIIAKIAGR IEFVRDYKAK RKIAIIPEEG EPVEYLVPKS RVIDVQEGDY VKKGDNLISG
SPDPHDILEV MGVEALAEYL VAEIQEVYRL QGVKINDKHI EVIVRQMLQK VEITNGGDTT
LLPGEQVDLE EMLETNGKLE EGQQPAEGKP VLLGITKASL QTRSFISAAS FQETTRVLTQ
AAVEGKKDSL IGLKENVIVG RLIPAGTGAG MNRMRVAASS RDAALRASYR KLQESLIAPA
TAAEEHAAEL AQGPEAAIGD DPLATVEGET HGTDADAGDY LIEGDEA