Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0023 |
Symbol | |
ID | 3916026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 19422 |
End bp | 23705 |
Gene Length | 4284 bp |
Protein Length | 1427 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640442748 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_495306 |
Protein GI | 87198049 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.269767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACC TGACCAAATT CACCAACCAG ATCGCCAAGC CCGAGACCTT CGACCAGATC CAGATCGGTC TGGCGAGCCC CGAGCGCATC CGCAGCTGGT CCTTCGGCGA GATCAAGAAG CCGGAAACGA TCAACTACCG CACGTTCAAG CCCGAGCGCG ACGGCCTGTT CTGCGCGCGC ATCTTCGGTC CCGTGAAGGA CTACGAGTGC CTGTGCGGCA AGTACAAGCG CATGAAGTAC AAGGGCGTCG TCTGCGAAAA GTGCGGCGTC GAAGTCACCG TCACCAAGGT GCGCCGCGAG CGCATGGGCC ACATCGAGCT GGCCGCGCCG GTCGCCCACA TCTGGTTCCT CAAGTCGCTG CCTTCGCGCA TCGGCCTGCT GCTCGACATG CAGCTCAAGC AGCTTGAGCG CATTCTCTAC TTCGAAAGCT ACGTGGTCAT CGAGCCGGGC CTGACCCCGC TCGAAAAGTA CCAGCTCCTG ACCGAGGACG AACTGCTCGA CGCGCAGGAC GAGTACGGCG AGGACGCCTT CTCGGCCGGC ATCGGTGCCG AGGCCGTCAA GCACATGCTG ATGAACCTCG ACCTCGTGCA GGAAAAGGAA GACCTGCTCC AGGAGCTGGC GACGACCAAG TCGGAGCTAA AGCCCAAGAA GATCATCAAG CGCCTCAAGG TCGTGGAATC GTTCATCGAT TCGGGCAACC GCCCGGAATG GATGATCCTC GACGTCGTGC CGGTCATTCC GCCAGAACTG CGCCCGCTGG TGCCGCTCGA CGGTGGCCGC TTCGCGACCT CCGACCTGAA CGACCTCTAT CGCCGCGTCA TCAACCGCAA CAACCGCCTC AAGCGCCTGA TCGAGCTGCG CGCGCCGGAC ATCATCGTCC GCAACGAAAA ACGCATGCTG CAGGAGGCGG TTGACGCGCT GTTCGACAAC GGCCGTCGCG GCCGCGTGAT CACCGGCGCC AACAAGCGTC CGCTCAAGTC GCTGTCCGAC ATGCTCAAGG GCAAGCAGGG CCGCTTCCGC CAGAACCTGC TCGGCAAGCG CGTCGACTAT TCGGGCCGTT CGGTCATCGT GACCGGTCCC GAACTCAAGC TGCACCAGTG CGGCCTGCCC AAGAAGATGG CGCTCGAGCT GTTCAAGCCG TTCATCTACG CCCGCCTCGA TGCCAAGGGG CTGTCGATGA CCCTCAAGCA GGCCAAGAAG TGGGTCGAGA AGGAGCGCAA GGAAGTCTGG GACATCCTTG ACGAGGTGAT CCGCGAGCAC CCGGTCATGC TGAACCGTGC GCCCACGCTG CACCGTCTCG GCATCCAGGC CTTCGAACCG GTCCTGATCG AAGGCAAGGC GATCCAGCTG CACCCGCTCG TCTGCTCGGC CTTCAACGCC GACTTCGACG GTGACCAGAT GGCCGTCCAC GTGCCGCTTT CGCTGGAAGC CCAGCTCGAA GCGCGCGTGC TGATGATGTC GACCAACAAC ATCCTCTCGC CCGCCAACGG CAAGCCGATC ATCGTGCCTT CGCAGGACAT GGTTCTCGGC ATCTATTACC TGTCGATGGA CCGCGCTGGC GAGCCGGGCG AAGGCATGAT GCTGGCCGAC ATGGCCGAAG TGCACCAGGC GCTGGAGGCC AAGGCCGTCA CGCTGCACTC GAAGATCGTG GCCCGCGTGC CCCAGACCGA CGAGGACGGC AACCAGTACC TCAAGCGGTT CGAGACCACG CCGGGCCGCA TGCTCATCGG CGAATGCCTG CCCAAGAGCC ACAAGGTGCC GTTCGAGATC GTCAACCGCC TTCTCACCAA GAAGGAAATC GGCGACGTCA TCGACCAGGT CTATCGCCAC ACCGGCCAGA AGGACACCGT GCTGTTCGCC GACGCCATCA TGGCGCTGGG CTTCCGCCAC GCTTTCAAGG CCGGCATCTC GTTCGGCAAG GATGACATGA TCATCCCGGA CAGCAAGGAC GCGCTGGTTG CCGAGACCAA GGAACTCGTT GCCGACTACG AGCAGCAGTA CCAGGACGGT CTCATCACCC AGCAGGAAAA GTACAACAAG GTGATCGACG CCTGGAGCCG CTGCGGCGAC CAGGTGGCGA ACGCCATGAT GGAAGAGCTT AAGTCCTCGC CCATCGATCC GGAGACCGGT CGCCAGAAGC CGATCAACGC GATCTACATG ATGTCGCACT CGGGCGCTCG TGGCTCCCCG GCGCAGATGA AGCAGCTCGC GGGCATGCGT GGCCTCATGG CCAAGCCTTC GGGCGAGATC ATCGAGACCC CGATCATCTC GAACTTCAAG GAAGGCCTGA CCGTCCTTGA ATACTTCAAC TCGACCCACG GCGCCCGAAA GGGCCTCGCG GACACCGCGC TCAAGACCGC GAACTCGGGT TACCTGACCC GCCGTCTGGT CGACGTGTCG CAGGACTGCG TCGTGATCGA GGAAGACTGC GGCACCGAAC GTGCGCTGGA AATGCGCGCG ATCGTGCAGG GCGGCTCGAC CATCGCTTCG CTTGGCGAAC GCATCCTTGG CCGTACCCTT GCCGAAGACC TGATCGAGGC GAAGTCGGGC GAAGTGATCG CGCAGAAGGG CGAACTGCTC GACGAAGCCG CGATCGCGAA GATCGAGGCT GCCGGCGTGC AGTCGGCGCG CATCCGCTCG CCGCTGGTTT GCGAAGCGAC CCAGGGCGTC TGCGGCAAGT GCTATGGCCG CGACCTCGCT CGCGGTACGC CGGTTAACAT CGGTGAAGCT GTCGGCGTCA TCGCCGCGCA GTCGATCGGT GAACCGGGCA CCCAGCTGAC CATGCGTACC TTCCACATCG GTGGCGCGGC GCAGGTCAAC GAGCAGTCGC ACCTCGAGGC TATCAGCGAC GGTACCGTGC AGTACCGCGA CATCCCGACG ATCACCGACA AGCGCGGCCG TCGTCTGTCG CTCGCCCGCA ACGGCGAGAT CGTCGTGATC GATACCGAGG GTCGTGAGCG CGCGATCCAC CGCGTGCCCT ACGGTACGCA CCTGCTGCAC GAGAACGGTG CGATCATCTC GCAGGGCGAC CGCCTGGCCG AGTGGGACCC GTTCACCACG CCGGTGATCA CCGAAAAGCC CGGTATCGTC CGCTACCAGG ATCTTGTCGA CGGCAAGACC CTGACCGAGC AGACCGACGA AGCCACCGGC ATGTCGAGCC GCGTGGTGAC CGAGAACCGC GCTGCGGGTC GTGGCAAGAA GGAAGACCTC CGTCCGCGCC TGACCCTGCT CGACGAGAAC TCGGGCGAGG CCGCACGCTA CATGATGGCG CCGGGCACGA CGCTGTCGGT CGAGGACGGC CAGCAGGTGG AAGCGGGCGA CATCCTTGCC CGTGCCTCGC GCGAAGCCGC CAAGACCCGC GACATCACCG GCGGTCTGCC GCGCGTTGCA GAGCTGTTCG AGGCCCGCAA GCCGAAGGAC AACTCGATCA TCGCCAAGAT TGCGGGCCGC ATCGAGTTCG TGCGTGACTA CAAGGCCAAG CGCAAGATCG CGATCATCCC GGAAGAGGGT GAACCGGTCG AGTACCTGGT GCCGAAGAGC CGCGTGATCG ACGTGCAGGA AGGCGACTAC GTCAAGAAGG GCGACAACCT GATCTCGGGC TCGCCCGATC CGCACGACAT CCTGGAAGTC ATGGGCGTCG AGGCTCTGGC CGAGTACCTC GTCGCGGAAA TCCAGGAAGT CTACCGCTTG CAGGGCGTGA AGATCAACGA CAAGCACATC GAGGTGATCG TTCGCCAGAT GCTGCAGAAG GTCGAGATCA CCAACGGTGG CGACACCACC CTGCTGCCGG GCGAACAGGT CGACCTCGAG GAGATGCTCG AAACCAACGG CAAGCTCGAG GAAGGCCAGC AGCCTGCCGA AGGCAAGCCA GTGCTGCTCG GCATCACCAA GGCCTCGTTG CAGACGCGTT CGTTCATCTC GGCGGCGTCG TTCCAGGAGA CCACCCGCGT GCTCACGCAG GCGGCCGTGG AAGGCAAGAA GGACTCGCTG ATCGGCCTCA AGGAGAACGT GATCGTCGGC CGTCTCATCC CCGCCGGTAC CGGCGCGGGC ATGAACCGCA TGCGCGTCGC AGCCTCCAGC CGCGATGCTG CCCTGCGCGC CTCCTACCGC AAGCTCCAGG AATCGCTGAT CGCTCCGGCG ACGGCGGCTG AAGAGCACGC GGCGGAACTC GCCCAGGGTC CCGAAGCAGC GATCGGCGAC GATCCGCTGG CGACGGTCGA GGGCGAGACC CACGGCACCG ATGCCGACGC GGGCGACTAC CTGATCGAGG GTGACGAAGC CTGA
|
Protein sequence | MNDLTKFTNQ IAKPETFDQI QIGLASPERI RSWSFGEIKK PETINYRTFK PERDGLFCAR IFGPVKDYEC LCGKYKRMKY KGVVCEKCGV EVTVTKVRRE RMGHIELAAP VAHIWFLKSL PSRIGLLLDM QLKQLERILY FESYVVIEPG LTPLEKYQLL TEDELLDAQD EYGEDAFSAG IGAEAVKHML MNLDLVQEKE DLLQELATTK SELKPKKIIK RLKVVESFID SGNRPEWMIL DVVPVIPPEL RPLVPLDGGR FATSDLNDLY RRVINRNNRL KRLIELRAPD IIVRNEKRML QEAVDALFDN GRRGRVITGA NKRPLKSLSD MLKGKQGRFR QNLLGKRVDY SGRSVIVTGP ELKLHQCGLP KKMALELFKP FIYARLDAKG LSMTLKQAKK WVEKERKEVW DILDEVIREH PVMLNRAPTL HRLGIQAFEP VLIEGKAIQL HPLVCSAFNA DFDGDQMAVH VPLSLEAQLE ARVLMMSTNN ILSPANGKPI IVPSQDMVLG IYYLSMDRAG EPGEGMMLAD MAEVHQALEA KAVTLHSKIV ARVPQTDEDG NQYLKRFETT PGRMLIGECL PKSHKVPFEI VNRLLTKKEI GDVIDQVYRH TGQKDTVLFA DAIMALGFRH AFKAGISFGK DDMIIPDSKD ALVAETKELV ADYEQQYQDG LITQQEKYNK VIDAWSRCGD QVANAMMEEL KSSPIDPETG RQKPINAIYM MSHSGARGSP AQMKQLAGMR GLMAKPSGEI IETPIISNFK EGLTVLEYFN STHGARKGLA DTALKTANSG YLTRRLVDVS QDCVVIEEDC GTERALEMRA IVQGGSTIAS LGERILGRTL AEDLIEAKSG EVIAQKGELL DEAAIAKIEA AGVQSARIRS PLVCEATQGV CGKCYGRDLA RGTPVNIGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQVN EQSHLEAISD GTVQYRDIPT ITDKRGRRLS LARNGEIVVI DTEGRERAIH RVPYGTHLLH ENGAIISQGD RLAEWDPFTT PVITEKPGIV RYQDLVDGKT LTEQTDEATG MSSRVVTENR AAGRGKKEDL RPRLTLLDEN SGEAARYMMA PGTTLSVEDG QQVEAGDILA RASREAAKTR DITGGLPRVA ELFEARKPKD NSIIAKIAGR IEFVRDYKAK RKIAIIPEEG EPVEYLVPKS RVIDVQEGDY VKKGDNLISG SPDPHDILEV MGVEALAEYL VAEIQEVYRL QGVKINDKHI EVIVRQMLQK VEITNGGDTT LLPGEQVDLE EMLETNGKLE EGQQPAEGKP VLLGITKASL QTRSFISAAS FQETTRVLTQ AAVEGKKDSL IGLKENVIVG RLIPAGTGAG MNRMRVAASS RDAALRASYR KLQESLIAPA TAAEEHAAEL AQGPEAAIGD DPLATVEGET HGTDADAGDY LIEGDEA
|
| |