Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0446 |
Symbol | |
ID | 4027020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 491627 |
End bp | 492628 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637965604 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_572507 |
Protein GI | 92112579 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.384145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGTT CAGTGACAGA GTTTCTCCGT CCTCGCGACA TCAAGGTCGA AGAGATCAAC GCGAATCATG CGAAGATCGT GCTCGAGCCG TTCGAGCGTG GTTTCGGCCA TACCCTGGGG AATGCTCTGC GTCGCATCCT GCTGTCTTCC ATGCCCGGTT GCGCCGTTGT GGAAGCGGAG ATTGAGGGCG TTCTGCACGA GTACAGCGCC ATCGAGGGCG TCCAAGAGGA CGTCATCGAG ATTCTCCTGA ACCTCAAGGA CGTTGCCGTC AAGATGCACG GTAACCGTGA CGAGGTGGTT CTGGCGCTGA GCAAGCAGGG GCCGAGCGTG GTCACCGCTG GCGATATCGC CGTCGATCAT GACGTCGAAA TCGTCAACCC GGATCACGTC ATCGCGCACC TCAACGACAG CGGCGAGCTG AAAATGCAGC TCAAGGTGGT TCGCGGTCGT GGCTACGAGC CGGCGGATAC CCGTGCTTCC GAGGAAGACG AATCGCGTGC GATCGGCCGC CTCCAGTTGG ATGCGACCTT CAGCCCGGTA CGTCGTGTGT CCTACTCCGT GGAAGCCGCG CGTGTCGAGC AGCGTACCGA CCTCGATAAG CTGATTATCG ACTTGGAAAC CGACGGCACC CTGGACCCGG AAGAAGCGAT TCGCCGCAGT GCGACCATCC TCCAAGAGCA GCTGGCCGCG TTCGTCGACC TCGAAGCCGA TAAGGAACAG GAAGTCGAAG AAGAAGAGGA TCAGATCGAT CCGATTCTGC TGCGCCCCGT AGACGATCTC GAGTTGACCG TCCGCAGCGC CAACTGCCTG AAGGCCGAGA ATATCTATTA TATCGGTGAT CTGATTCAGC GTACCGAAGT GGAGCTGTTG AAGACCCCGA ACCTCGGCAA GAAATCCTTG AATGAAATCA AGGACGTTCT GGCAGCGCGC GGTCTTTCCC TCGGCATGCG GCTGGAAAAT TGGCCGCCGG CGAGCCTGAA GGACGACAAG GCCTCTGCGT GA
|
Protein sequence | MQRSVTEFLR PRDIKVEEIN ANHAKIVLEP FERGFGHTLG NALRRILLSS MPGCAVVEAE IEGVLHEYSA IEGVQEDVIE ILLNLKDVAV KMHGNRDEVV LALSKQGPSV VTAGDIAVDH DVEIVNPDHV IAHLNDSGEL KMQLKVVRGR GYEPADTRAS EEDESRAIGR LQLDATFSPV RRVSYSVEAA RVEQRTDLDK LIIDLETDGT LDPEEAIRRS ATILQEQLAA FVDLEADKEQ EVEEEEDQID PILLRPVDDL ELTVRSANCL KAENIYYIGD LIQRTEVELL KTPNLGKKSL NEIKDVLAAR GLSLGMRLEN WPPASLKDDK ASA
|
| |