Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0084 |
Symbol | rpoA |
ID | 4239593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 78204 |
End bp | 79193 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638103616 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_718291 |
Protein GI | 113460233 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000268564 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGTT CTGTTACAGA ATTTTTAAAG CCACGTTTAG TTGATATTGA GCAAATTAGT TCTACTCATG CTAAAGTGAT CTTAGAACCG TTAGAGCGTG GCTTTGGTCA TACTCTAGGT AATGCATTGC GTCGAATTCT TCTATCTTCT ATGCCGGGCT ATGCAGTTAC CGAGGTGGAA ATAGATGGCG TATTGCATGA ATATAGTAGT AAGGAAGGTG TTCAGGAGGA TATTATTGAG GTACTTCTGA ACTTAAAAGG TCTAGCGGTT AAAGTACAGA ATAAGGATAA TGTTTTTCTG ACACTAAGCA AATCTGGAAT TGGCCCTGTT GTTGCAGCTG ACATTACTCA TGATGGTGAT GTTGAAATCG TAAATCCTGA TCATGTTATC TGTCACTTAA CAGACGAAAA TGCGTCTATT AATATGCGTA TTCGTGTACA ACGTGGTAGA GGGTATGTGC CGGCATCATC TAGAGTTCAT TCTTTAGATG AAGAACGTCC GATTGGTCGT TTGTTAGTTG ATGCTTGTTA TAGTCCAGTT GATCGTATTG CTTATAATGT ACAAGCAGCA CGTGTTGAAC AACGAACTGA CTTAGATAAG TTGGTTATTG AACTGGAAAC AAATGGCACA ATCGAGCCAG AGGAAGCGAT CCGTCGTGCT GCAACGATTT TAGCAGAGCA ACTAGATGCC TTTGTTGATT TGCGAGATGT TCGTCAACCA GAAGTTAAGG AAGAAAAACC GGAGTTTGAT CCGATTTTAT TGCGTCCTGT TGATGACTTA GAGTTGACAG TTCGTTCTGC TAACTGTTTG AAAGCAGAAA CAATTCATTA TATCGGTGAT TTGGTACAGC GTACTGAAGT TGAGTTACTG AAAACACCTA ATTTAGGTAA AAAATCTCTT ACAGAGATTA AAGATGTGCT TGTTTCACGC GGTCTATCAC TTGGTATGCG CCTCGAAAAT TGGCCACCAG CAAGTATTGC TGAAGACTAG
|
Protein sequence | MQGSVTEFLK PRLVDIEQIS STHAKVILEP LERGFGHTLG NALRRILLSS MPGYAVTEVE IDGVLHEYSS KEGVQEDIIE VLLNLKGLAV KVQNKDNVFL TLSKSGIGPV VAADITHDGD VEIVNPDHVI CHLTDENASI NMRIRVQRGR GYVPASSRVH SLDEERPIGR LLVDACYSPV DRIAYNVQAA RVEQRTDLDK LVIELETNGT IEPEEAIRRA ATILAEQLDA FVDLRDVRQP EVKEEKPEFD PILLRPVDDL ELTVRSANCL KAETIHYIGD LVQRTEVELL KTPNLGKKSL TEIKDVLVSR GLSLGMRLEN WPPASIAED
|
| |