Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2039 |
Symbol | |
ID | 8419884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 2339706 |
End bp | 2340749 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645038627 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_003198901 |
Protein GI | 258406159 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0253649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000243746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGTCA AAGACGGTGA CAAACTTGTC AATTGCCGTA ATTGGTCGAC GCTGGTTCAT CCAGAGACTC TTGAGCGGGA TGAAGACAGC ACGGAGAACT ATGGCAAGTT TTCCTGCGAA CCTTTGGAGC GGGGTTTTGG AACGACCTTA GGCAATGCCC TGCGCCGGGT GCTGCTGTCC TCTCTGCAAG GGGCGGCCAT TGTAGCGGTG CGCATCAAGG GTATCCAGCA CGAGTTCACG ACGATCCCCG GGGTAATGGA GGACATTACT GATATAGTCC TCAATTTGAA GCAGTTGCGT TTGCGGATGA ATACGGACGA GCCTCAGCGT ATTGAACTCA ATGTCAATAC CAAGGGTGCC GTGACGGCTT CCGCGTTTCA GACAACGCAA AACCTCGAGA TCCTGAATCC GGACTTACAT ATCGCCACCT TGTCGGAGGA TATCGAATTC GGCATTGAGG CTGAAGTGCG GATGGGAAAA GGGTATGTTC CGGCGGAGAT GCATGAAGGA TTGGAAGAGG AAATCGGGCT GATTTCCATG GATGCCAGTT TCTCTCCGAT CCGCAAAGTC GCGTATCGTG TGGAACAGGC CCGCGTGGGC CAGATGACCA ACTATGACAA ACTGGTCATG GAAGTCTGGA CTGACGGGTC CGTGTTGCCC GAGGACGCTG TGGCTTATAG CGCCAAAATC CTGAAAGAAC AGCTGGCCGT GTTCATTAAT TTCAATGAAG ACAGTGCCAA TGTCTGTGAA TCCAAGGGTG CCGGCACCGA GTCATTGAAC TCGAACCTCT TCAAACATAT TGATGACCTC GAACTCCCCG TCCGGGCCAG CAATTGTCTG AAAAGCGCCA ACATCAATCT TGTTGGTGAA CTGGTCCAGA AGACCGAGGG CGAGATGTTG AAGACCAAGA ATTTTGGTCG CAAATCGCTT GAGGATATCC GCAAGGTGAT CCATGAACTC GGTCTCGACT TCGGAATGAA GCTGGACGGC TTCGAGGAAC AATACAAGAA ATGGCGAGAG AGGAACCAGC AAGATGAGGC ATAA
|
Protein sequence | MIVKDGDKLV NCRNWSTLVH PETLERDEDS TENYGKFSCE PLERGFGTTL GNALRRVLLS SLQGAAIVAV RIKGIQHEFT TIPGVMEDIT DIVLNLKQLR LRMNTDEPQR IELNVNTKGA VTASAFQTTQ NLEILNPDLH IATLSEDIEF GIEAEVRMGK GYVPAEMHEG LEEEIGLISM DASFSPIRKV AYRVEQARVG QMTNYDKLVM EVWTDGSVLP EDAVAYSAKI LKEQLAVFIN FNEDSANVCE SKGAGTESLN SNLFKHIDDL ELPVRASNCL KSANINLVGE LVQKTEGEML KTKNFGRKSL EDIRKVIHEL GLDFGMKLDG FEEQYKKWRE RNQQDEA
|
| |