Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3856 |
Symbol | |
ID | 4074919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | + |
Start bp | 107815 |
End bp | 109062 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638004513 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_611248 |
Protein GI | 99077989 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACATG CCGTCCCCAG TGCCATCGCC GTGCAGCAGG TCATCGATGA TGTGATGCGC CGCGACAGGG GGCGGCTGCT GGCGGGCCTG AGCGTCCGCC TCGGGGATAT CCAGCTGGCA GAAGACGCTC TGCAGGAGGC GGTGATTGCG GCGCTCAAAC ACTGGGGCCG CTCCGGCGTG CCTCATGCGC CGCTGGCCTG GCTGATGCGG GCAGGGTTCA ACAAGGGGAT CGACCAGCTG CGCAGCCGCC AGCGCGAGGG GCGCAAGGCA GAGGATCTGG GGCTGATTGC CCCCGACCAA GACGCGGCAG AGCAGGCCGA GACCATCCCA GATGCGCGCC TGCGGCTGAT CTTCACCTGC TGCCATCCGG CGCTGGAGGA AAAATCCCGC GTCGCCCTGA CCCTGCGCAC GGTCTGCAGC CTGAGCACGC GCGACATTGC CGCGGCCTTT CTCGACAGCG AGCAGACCAT GGGCCAGCGC CTGTCCCGCG CCAAGGCCAA GATCCGTGCC AAAGGCATCG GGTTTCAGGT GCCAGAGCCC GACCAGTGGT CCGAACGGCT CGGCACGGTG CTCTCCACGC TCTATCTGAT CTTCACAACC GGCTATGTGC AGGAGGAGGC AGGCCCGCGC GATTTCTGTC GCGAGGGGAT CTATCTTGCG CGGCTTTTGT GCGCGCTGCG CCCGGATGAT CCCGAAATCG AGGGGGCGCT GGCCCTGATG CTGCTGACCG AAGCACGCAG CGCCGCCCGC ATTGGCCCGG ATGGGGCGAT GCGCCCGATC GAGGACCAGG ACAGCAGCCT CTGGCACCAT GAGACCATCA CCGAGGCGCA GGCTCTGCTG GCACAGGCGG TTCTGCGCCG CCAGCCCGGC GCGTTTCAGA TCAAGGCCGC CCTTGCGGAT TGCCACATGA TGCGCCCAAA GCCTGATTGG GCGCAGATGG CGCTGCTCTA CCAGGCGCTC TGGCGGTTTG AGCCAACCCC GGTGGTGGCA CTCAACCAGG CGGTGGTGAT GGCAGAGCTG GGGCAAGGCG CTCAGGCCCT GAACCACTTG CGTGCCCTAG AGGATGACCT GGGACAGTTT CAACCATGGC ATGCGGCAAT GGCACATGTG CTGGCACAGG AAGGCCATAT CGGGGACGCC CGCTGTGCCT ATGAGCAGGC CATAAAAACC GCTCCCCATG ACGCTGCGCG CCGGTTTCTG GAAACCAGAG TGCAGAGACT GCCCCCCTGC GCTTTCAGCG GATCTTAA
|
Protein sequence | MTHAVPSAIA VQQVIDDVMR RDRGRLLAGL SVRLGDIQLA EDALQEAVIA ALKHWGRSGV PHAPLAWLMR AGFNKGIDQL RSRQREGRKA EDLGLIAPDQ DAAEQAETIP DARLRLIFTC CHPALEEKSR VALTLRTVCS LSTRDIAAAF LDSEQTMGQR LSRAKAKIRA KGIGFQVPEP DQWSERLGTV LSTLYLIFTT GYVQEEAGPR DFCREGIYLA RLLCALRPDD PEIEGALALM LLTEARSAAR IGPDGAMRPI EDQDSSLWHH ETITEAQALL AQAVLRRQPG AFQIKAALAD CHMMRPKPDW AQMALLYQAL WRFEPTPVVA LNQAVVMAEL GQGAQALNHL RALEDDLGQF QPWHAAMAHV LAQEGHIGDA RCAYEQAIKT APHDAARRFL ETRVQRLPPC AFSGS
|
| |