Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0242 |
Symbol | |
ID | 5732137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 276873 |
End bp | 281504 |
Gene Length | 4632 bp |
Protein Length | 1543 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277366 |
Product | DNA-directed RNA polymerase, beta' subunit |
Protein accession | YP_001543022 |
Protein GI | 159896775 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAAA TCAATGAATT CAATGCAATT CGTATCAGTC TCGCTTCACC AGAAGATATC CTGAGCTGGT CGCACGGTGA AGTAACCAAA CCAGAGACGA TTAATTATAG AACTCTGCGT CCTGAACGCG ATGGCTTGTT CTGTGAAAAA ATCTTCGGAC CTACCCGCGA TTGGGAATGT TACTGTGGGA AGTACAAACG TGTGCGTTAT AAGGGCATTG TATGCGATAA ATGTGGCGTA GAAGTCACCC GATCCAAAGT GCGACGCGAT CGCATGGGTC ATATCAAACT TGCCTCGCCT GTATCGCATA TTTGGTTTGT TAAAGGCACA CCATCGCGTT TGGGTTTACT TTTAGATATC TCGCCACGTA ATCTTGAGCG AGTGCTTTAT TTTGCCTCGT ATATTATCAC CGATGTCGAT GATCTGGCAT TGGGCAATGT ACGTGAGCAA ATGAAGACCG ACTTTGGCGT GCGGCGCAAA GATCTCGAAG AAAAAATTAT CGAACAACGT GGCGAAAAGG CAACCCGTTT AAGCAAAGAC CTTGCGGCGA TGGATAATGC CATGGAAGGC ACGTTGGAGC GAACGCATGA ACAATTTGCC CGCCAACGCC AAGAAATCGA AGATGAAGCC AATGCATTGC GCGAACACCT CGAAGAACTG ATCGGCATCG ACGCACTAGC CGATGAAGAT ATTGTTTATC GTGGCACGGT GCTGCTCGAA GAAAACGAGC CAGTGCGCGA ACGCAGCTTG GAACAGCTCG AACAACTGAT CGATCAAGAG CGCGAAAAGC TTGAACAACG CCGCGAATAC GAGATTGAAA ACGTGCGCTT GTTGGCCGAC GGCGAGCGTG ATCAACGCCA ATCGGTGGCT GATGCTGAAC AAGAACGGCT AACCACGGCC TTGATCAAAC AGCTTGAAGA TCTCAACAAA GAAGAAAAAG ACAAGCTTGA TCGCTTGGAT GATATTCAGT TGCACCGGAT TATTTCGGAA AACGAATATC GGATTCTGCG TGATTTAGCG CCCTACACCT TCAAGGCTGA TATGGGTGCT GGCGCGGTGC GCGACATCGT ATCGATGGTT GATCTCGACG AATTGTCGAA CCAAATGCAA GCCGAAGTCC AAAGCTCATC GGGTCAACGC CGCAAGAAAG CCACCAAACG GCTGCGCGTG GTTGAAGCAT TCCGCAAGAG CGGCAATCGT CCTGAATGGA TGATTATGAC CGTCTTGCCA GTGATTCCAC CAGATTTGCG CCCGATGGTC CAGCTTGATG GTGGTCGGTT TGCGACCTCT GACTTGAACG ACCTGTATCG ACGGGTGATC AACCGCAACA ACCGGCTCAA GCGCTTGATG GAGTTGAACG CTCCTGAAAT CATCGTGCGC AACGAAAAAC GTATGTTGCA AGAAGCGGTT GATGCCTTGA TCGACAACGG TCGCCGTGGC CGCCCAGTCA GTGGCAAGGG CAAGCATCGC CTTAAGAGCC TGAGCGATAT GCTCAAAGGC AAGCAAGGCC GCTTCCGCCA AAACCTCTTG GGTAAGCGGG TTGACTACTC AGGCCGTTCG GTGATTGTGG TCGGACCAAC CCTGCAATTG CACCAATGTG GTTTGCCCAA GAAAATGGCC TTGGAATTGT TCAAGCCATT CGTCATGCGC CGCTTGGTCG ATAAAGGCTT TGCCCACAAC ATCAAATCAG CCAAGCGCTT CGTTGAGCGG GTTCGCCCCG AAGTGTGGGA TGTGCTCGAA GAAGTCATCA AAGACTACTT GGTATTGCTG AACCGTGCGC CTTCGCTGCA CCGTTTGTCG ATTCAAGCCT TTGAAGCCAA GCTGATCGAA GGCTCGGCGA TTCAATTGCA CCCCTTGGTG TGTGCCGCAT TCAACGCCGA CTTTGACGGC GACCAAATGG CTGTGCACGT ACCATTGTCG CGCAAAGCCC AAGAAGAAGC CCGTCGTCGT ATGATTTCAA CCTACAACCT GTTGTCACCG GCAACTGGCG ACCCAATTAT TACGCCATCG CAAGACATCG TGTTGGGTTG TTTCTACCTG ACCCAAGTTC GACCTGGGGC TAAGGGTGGC GGCAAGCGCT TTGGGTCAAT TGACGAAGCT TTGTTGGCCT ACACCAATGG TGTTGTACAT ATTCAAGCAC CTGTTTGGAT TGTGATCGAA GATTACATTC TGCCAGGCAG CGATTTGCGT GAAAAGGAAT TGCCATCGTT GGATGGCGTT ACGCCACGCG TTTTGATCGA AACCAGCGTT GGGCGGATCA TCTTCAACAA TGCGTTGCGC TACCAAGGCG AGCCAAAAGC TGGCGAAAAC GGCTATCGCT CACCATTGCA CTATCGCAAC TTCTTGGTTG GCAAATCGGG ATTGAAAGCC TTGATCGCCG ACTGCTATCG CTTCCACTCA CAGCGCGAGA ACATCATCGC CGACGTGTAT CAAGAATTAA TTGAACGTTT CGGCCCTGAA ACCTCGGAAG AATCGCTGTT GCGTTTCTAT GCCTCAGAAC GGACGGCGCG TTTGGCCGAC CGGATCAAGG CCTTGGGCTT CAAGCACGCG ACCTTGGGCG GGATGACCTT CTCGGCTTCG GACGTAGAAG TGCCTGATAC CAAAGATGCA ATTGTGCAAG AAACCTACAA GAAAGTGCAA GACATCGAAA AGATGCAACG TCGTGGTTTG ATTACCGACG ACGAGCGCTA TCGTGAAGTG GTTACGGCGT GGCTCGATGC TACCAACCAA ATCAAGGTTG AAGTTCAACG CTCATTGAAT CCATTCGGGC CAGTTTCGAT GATGTCAACC TCGGGTGCGC GTGGTAACGT CGAGCAAATT CGCCAGATGG CGGGGATGCG GGGCTTGACC ACTGACCCAA CCGGACGGAT TATCGAATTG CCGATTACCG CCAACTTCCG CGAAGGCTTG AGCGTTATCG AGTACTTCAT CTCAACCCAC GGTGGTCGGA AAGGTTTGGC TGATACCGCC TTGCGGACAG CCGACGCTGG TTACTTGACC CGCCGTTTGG TGGACGTGGC CCAAGATGTG ATCGTGACGA TCGAAGATTG TGGCACAACC GAAGGTATGT GGATTCGGGT CAGTCGCGAT ATTTTGGCCT CGGTCGAAGA TCGGATCGTT GGCCGCGTGA CGGTGGCTCC GGTCACTAAT CCCGTGACTG GTGAAGTCAT TTTCGATACC GATAGCGAAA TCTTAGAAGA CGATGGCAAG CACATTGCGA ATGTGCTGAA ATCGTTGGAC AAAGAGGCTC AAGCTGAATT TGGCATTTAT GTCCGTTCGG TACTGACCTG TAACGCCGAT TATGGCATTT GTCGCAAGTG CTATGGCCGC AACCTTGCCA CTGGCAAGAT GGTCGAAATT GGCGAAGCTG TCGGGATTAT CGCGGCGCAA TCGATTGGTG AGCCAGGGAC GCAGCTGACC CTGCGGACGT TCCACACTGG TGGTGTGGCA ACTGATACCG ACATCACCCA AGGTTTGCCA CGGGTGCAAG AAATCTTCGA AGCCCGGATT CCGAAAGGGA AAGCGGTGCT GGCTCAGATT GCTGGCCGTG TCCAGATTGT GCGCGAAGAA GAAGGCATTC GGCGCTTGCG GATCGTCTCT GAGGAAGTCT ATACCGACGA GCAAATGTTG CCCAAAGATT ACCGCGTGGT GGTCAAAAAC GGCGATGCTG TCGAAATTGG CAACCTCTTG GCCGAAAGCA ATGTCGATGG CGATGGTCGT GCACCATTGG TCGCAGGCCT AGCTGGCAAC GTCTATGTTG ACGATGATCG CTTGGTAATT CAAGCCAAAG ACATCGAAGA ACATGAAGAA GTGATTGCCC ACGCTGCTCG TTTGCGGGTC AAAGATGGCG ATTTGGTGCA AGTTGGCCAA CAAATGACCG AAGGCTCATC TGACCCTCAA GAAATGCTGC AATTGCGTGG TCGCGAAGCA GTTCAAGAAT ACCTGACCAA CGAAGCCCAA AAGGTCTATC GTTCGCAAGG TGTGGGGATC AACGATAAGC ACATTGAAGT GATTGTGCGC CAAATGTTGC GACGGGTACG CATCGAAGAG CCAGGCGATA CTGAACTCTT GCCAGGCGAA TTGGTCGAAC TCCACGAACT CAATCGGATC AATGCTTCGA TTGTGAGCCA AGGTGGTGAC CCAGCATTGG CAGTGCCAGT TTTGTTGGGG ATTACCAAGG CCTCGTTGTC AACCGACTCA TTCTTGTCGG CGGCCTCGTT CCAAGAAACC ACGCGCGTGC TGACTGAAGC TGCCGTCAAT GGCAAGATCG ATTACCTGCG TGGCTTGAAA GAAAACGTGG TTATCGGCAA GCTGATTCCA GCTGGTACGG GGATGGAGCA ACGGCGCAAA TTGGCTGAAG AAGCCGCCTT GCGGGTTGCC CAAATCACCA GCACTCCCGC TGATCGCGAA GTTCCAGCAG CAACGCCTGC GCCAGCAGTG ATGAGCGAAC CCAAGCCCGC GCCACCACGC TCATTCGACG AAGCCTTGAA CGCTGTCACC AACATCGACA GTGGCAATGG ACCAAAGGAT GACTTGTTTG CCCAAGCCAT GGCTCGTTTG CAAGCTGAAG AAGGTCGTAA ACCAACCCTG AGCGAACTCT TAGGCACCGA CGAAGACGAG GAAAACGTCT AA
|
Protein sequence | MLEINEFNAI RISLASPEDI LSWSHGEVTK PETINYRTLR PERDGLFCEK IFGPTRDWEC YCGKYKRVRY KGIVCDKCGV EVTRSKVRRD RMGHIKLASP VSHIWFVKGT PSRLGLLLDI SPRNLERVLY FASYIITDVD DLALGNVREQ MKTDFGVRRK DLEEKIIEQR GEKATRLSKD LAAMDNAMEG TLERTHEQFA RQRQEIEDEA NALREHLEEL IGIDALADED IVYRGTVLLE ENEPVRERSL EQLEQLIDQE REKLEQRREY EIENVRLLAD GERDQRQSVA DAEQERLTTA LIKQLEDLNK EEKDKLDRLD DIQLHRIISE NEYRILRDLA PYTFKADMGA GAVRDIVSMV DLDELSNQMQ AEVQSSSGQR RKKATKRLRV VEAFRKSGNR PEWMIMTVLP VIPPDLRPMV QLDGGRFATS DLNDLYRRVI NRNNRLKRLM ELNAPEIIVR NEKRMLQEAV DALIDNGRRG RPVSGKGKHR LKSLSDMLKG KQGRFRQNLL GKRVDYSGRS VIVVGPTLQL HQCGLPKKMA LELFKPFVMR RLVDKGFAHN IKSAKRFVER VRPEVWDVLE EVIKDYLVLL NRAPSLHRLS IQAFEAKLIE GSAIQLHPLV CAAFNADFDG DQMAVHVPLS RKAQEEARRR MISTYNLLSP ATGDPIITPS QDIVLGCFYL TQVRPGAKGG GKRFGSIDEA LLAYTNGVVH IQAPVWIVIE DYILPGSDLR EKELPSLDGV TPRVLIETSV GRIIFNNALR YQGEPKAGEN GYRSPLHYRN FLVGKSGLKA LIADCYRFHS QRENIIADVY QELIERFGPE TSEESLLRFY ASERTARLAD RIKALGFKHA TLGGMTFSAS DVEVPDTKDA IVQETYKKVQ DIEKMQRRGL ITDDERYREV VTAWLDATNQ IKVEVQRSLN PFGPVSMMST SGARGNVEQI RQMAGMRGLT TDPTGRIIEL PITANFREGL SVIEYFISTH GGRKGLADTA LRTADAGYLT RRLVDVAQDV IVTIEDCGTT EGMWIRVSRD ILASVEDRIV GRVTVAPVTN PVTGEVIFDT DSEILEDDGK HIANVLKSLD KEAQAEFGIY VRSVLTCNAD YGICRKCYGR NLATGKMVEI GEAVGIIAAQ SIGEPGTQLT LRTFHTGGVA TDTDITQGLP RVQEIFEARI PKGKAVLAQI AGRVQIVREE EGIRRLRIVS EEVYTDEQML PKDYRVVVKN GDAVEIGNLL AESNVDGDGR APLVAGLAGN VYVDDDRLVI QAKDIEEHEE VIAHAARLRV KDGDLVQVGQ QMTEGSSDPQ EMLQLRGREA VQEYLTNEAQ KVYRSQGVGI NDKHIEVIVR QMLRRVRIEE PGDTELLPGE LVELHELNRI NASIVSQGGD PALAVPVLLG ITKASLSTDS FLSAASFQET TRVLTEAAVN GKIDYLRGLK ENVVIGKLIP AGTGMEQRRK LAEEAALRVA QITSTPADRE VPAATPAPAV MSEPKPAPPR SFDEALNAVT NIDSGNGPKD DLFAQAMARL QAEEGRKPTL SELLGTDEDE ENV
|
| |