Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2527 |
Symbol | |
ID | 3916848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2730746 |
End bp | 2731807 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640445284 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_497797 |
Protein GI | 87200540 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.717834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTCA ACATCAAGAA CTGGCAGGAA CTGAAGAAGC CCAACAATCT CGAGATCAAG CCGGGCAACG ATTCCAAGCG CCGTGCGACT TTCGTCGCCG AGCCGCTTGA GCGTGGTTTT GGTCTCACGC TCGGCAACGC GCTGCGTCGC GTGCTGCTCT CCTCGCTTCA GGGCGCGGCG GTCACCTCGA TCAAGATCGA GAACGTTCTG CACGAATTCT CGTCGCTCGC CGGCGTGCGC GAGGACGTCA CCGACATCGT GCTCAACGTC AAGCAGATCG CGCTCAAGAT GCAGGGCGAA GGCCCCAAGC GTCTCCAGCT TTCGGCAACC GGTCCTGGCG AAGTCAAGGC CGGCGACATT GCCGTGACGG GCGACATCGA GGTCATGAAC AAGGACCTCG TGATCTGCCA TCTCGACGAA GGCGCGACCT TCAACATGGA ACTGACCGTC GATACCGGCA AGGGCTACGT CCCCGCCGTG TCGAACCGTC CGGTAGACGC TCCGATCGGT CTGATCCCGG TTGACTCGCT CTATTCGCCG GTCCGTCAGG TCTCCTACAA GGTCGAGAAC GCCCGCATCG GTCAGGAGCT TGACTACGAC AAGCTCAACC TGACGGTCGA AACCGACGGC ACCGTCACTC CGGAAGACGC CGTGGCCTAT GCCGCGCGCA TCCTTCAGGA CCAGCTTGCG CTGTTCGTCC ACTTCGACGA CCAGGTTCCG GTCGGCCACG TTCCGATGGT CGCGGGCGTT CCTGCCCACG CGCCGGAGGA AAGCGACGCG AACCAGCTCA ACCGCTACCT TCTCAAGAAG GTGGACGAGC TGGAACTGTC GGTTCGCTCG GCCAACTGCC TCAAGAACGA CAACATCATC TACATCGGCG ACCTGGTCCA GAAGACCGAA GCCGAGATGC TGCGCACGCC GAACTTCGGC CGCAAGTCGC TCAACGAGAT CAAGGAAGTT CTCTCCTCGA TGGGTCTGCG CCTCGGCATG GACATCCCCG GCTGGCCGCC GGAGAACATC GAAGAGATGG CCAAGAAGCT CGAACAAGAG CTTCTGGGCT AA
|
Protein sequence | MTVNIKNWQE LKKPNNLEIK PGNDSKRRAT FVAEPLERGF GLTLGNALRR VLLSSLQGAA VTSIKIENVL HEFSSLAGVR EDVTDIVLNV KQIALKMQGE GPKRLQLSAT GPGEVKAGDI AVTGDIEVMN KDLVICHLDE GATFNMELTV DTGKGYVPAV SNRPVDAPIG LIPVDSLYSP VRQVSYKVEN ARIGQELDYD KLNLTVETDG TVTPEDAVAY AARILQDQLA LFVHFDDQVP VGHVPMVAGV PAHAPEESDA NQLNRYLLKK VDELELSVRS ANCLKNDNII YIGDLVQKTE AEMLRTPNFG RKSLNEIKEV LSSMGLRLGM DIPGWPPENI EEMAKKLEQE LLG
|
| |