Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1158 |
Symbol | |
ID | 5208109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1440837 |
End bp | 1441808 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640594775 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001275515 |
Protein GI | 148655310 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00804357 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0422731 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATA TCGCCATGCC AAAGATTGAA GTCGTTACCG CTGCTGAAAA TTACGGACGG TTCAAAATCG AGCCGCTCGA TCCAGGGTAC GGACATACGC TTGGAAATGC GCTGCGCCGC GTTCTGCTCT CTTCCATCCC TGGTGCGGCG ATTACGAAGA TCAAGATCGA GGGAGTATTT CATGAGTTCT CGACTATTCC GGGGGTCAAA GAAGACGTCA CTGAGATTGT CTTGAACATC AAAGGTATTC GTTTGCGTTC CTATGCCGAA CGTCCTGTGA AAATATCGCT GTCGAAACGC GGCGCCGGCG TTGTTCGCGC TGCGGATATT GATGCGCCAA GCAATGTCGA GATCGTTAAT CCCAACCACT ATATCTGTAC GATCGATCGC GACGATGCCG CCATTGATAT GGAAATGACG GTCGAACGCG GGCGCGGCTA CCTGCCGGCG GATCAGCGCG ATGCGCTGCC GATCGGCGAA ATCCCGATTG ATGCGATCTT TACTCCTGTC CCCAAGGTCA ATTATGTGGT TGAACATATT CGCGTGGGGC AGGCGACCGA TATCGACAGC CTGTTGATCG AAATCTGGAC TGATGGAACG ATCAAGCCGG GGGATGCGCT CAGCCACGCG GCGCAGGTGC TGGTTCAGTA TTCGCAGACG ATTGCCGACT TCAATCGCCT CTCGACAGAA GCGGAACCGA CTACAGCGCC CAACGGACTG GCTATCCCGG CGGATATTTA TGATACGCCG ATCGAGGAGC TCGATCTCTC AACACGGACC TACAACTGTC TCAAGCGCGC CGATATTACC AAAGTCGGTC AGGTGCTCGA AATGGACGAA AAGGCGCTGC TGTCGGTGCG GAATCTGGGA CAAAAATCGA TGGAAGAAAT CCGCGATAAA TTGATCGAAC GCGGCTATAT TCCACGGATC GGTCAAACAT CGCACGCGGC TCGTGCAGAG ATCGAGGGTT GA
|
Protein sequence | MLDIAMPKIE VVTAAENYGR FKIEPLDPGY GHTLGNALRR VLLSSIPGAA ITKIKIEGVF HEFSTIPGVK EDVTEIVLNI KGIRLRSYAE RPVKISLSKR GAGVVRAADI DAPSNVEIVN PNHYICTIDR DDAAIDMEMT VERGRGYLPA DQRDALPIGE IPIDAIFTPV PKVNYVVEHI RVGQATDIDS LLIEIWTDGT IKPGDALSHA AQVLVQYSQT IADFNRLSTE AEPTTAPNGL AIPADIYDTP IEELDLSTRT YNCLKRADIT KVGQVLEMDE KALLSVRNLG QKSMEEIRDK LIERGYIPRI GQTSHAARAE IEG
|
| |