Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0279 |
Symbol | |
ID | 4077414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 284956 |
End bp | 285972 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005573 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_612274 |
Protein GI | 99080120 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.464101 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCATA AGAATTGGGC AGAATTGATC AAGCCCACGC AGCTTGAGGT GAAACCGGGC AATGATCCGG CACGTCAGGC AACGCTCGTT GCGGAACCGC TGGAGCGTGG CTTTGGTCTG ACGCTCGGCA ACGCGCTGCG CCGCATCCTG ATGAGCTCGC TGCAAGGCGC GGCCATCACA TCCGTCCAGA TCGACAACGT GCTGCACGAG TTTTCCTCCG TGGCCGGTGT TCGTGAAGAC GTCACAGACA TCATCCTGAA CCTCAAGCAG GTCTCCCTGC GCATGGAAGT CGAAGGGCCC AAGCGCCTGT CGATCAATGC CAAAGGTCCG GCCGTCGTCA CCGCAGGCGA CATTGCCGAA ACCGCTGGCA TCGAAGTTCT GAACCGCGAG CACGTCATCT GCCACCTCGA CGATGGTGCG GATCTGTTCA TGGAACTCAC TGTCAACACC GGCAAAGGCT ATGTCTCTGC CGAGAAGAAC AAGCCCGAGG ACGCACCGAT TGGTCTTATT CCGATCGACG CGATCTATTC CCCGGTCAAG AAGGTCTCTT ACGACGTTCA GCCGACCCGC GAAGGTCAGG TTCTGGACTA TGACAAGCTG ACCCTCAAAG TTGACACCGA CGGCTCCATC ACCCCCGAAG ACGCGCTGGC TTTTGCGGCC CGCATCCTTC AGGACCAGCT GTCGATCTTC GTGAACTTCG ACGAGCCGGA ATCCGCAGGT CGTCAGGACG AGGACGATGG TCTCGAGTTC AACCCGCTTC TCCTCAAGAA AGTGGACGAG CTGGAACTGT CCGTGCGTTC GGCAAACTGC CTCAAGAACG ACAACATCGT CTATATCGGC GATCTGATCC AGAAAACCGA AGCCGAGATG CTCCGCACCC CGAACTTCGG CCGCAAGTCC TTGAACGAAA TCAAGGAAGT GCTGTCTGGC ATGGGTCTGC ACCTCGGTAT GGACGTCGAG GACTGGCCGC CGGACAACAT CGAAGAGCTG GCCAAGAAAT TCGAAGACAG CTTCTAA
|
Protein sequence | MIHKNWAELI KPTQLEVKPG NDPARQATLV AEPLERGFGL TLGNALRRIL MSSLQGAAIT SVQIDNVLHE FSSVAGVRED VTDIILNLKQ VSLRMEVEGP KRLSINAKGP AVVTAGDIAE TAGIEVLNRE HVICHLDDGA DLFMELTVNT GKGYVSAEKN KPEDAPIGLI PIDAIYSPVK KVSYDVQPTR EGQVLDYDKL TLKVDTDGSI TPEDALAFAA RILQDQLSIF VNFDEPESAG RQDEDDGLEF NPLLLKKVDE LELSVRSANC LKNDNIVYIG DLIQKTEAEM LRTPNFGRKS LNEIKEVLSG MGLHLGMDVE DWPPDNIEEL AKKFEDSF
|
| |