Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0235 |
Symbol | |
ID | 4076268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 251905 |
End bp | 256137 |
Gene Length | 4233 bp |
Protein Length | 1410 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005529 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_612230 |
Protein GI | 99080076 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0585906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGG AAATCACCAA CAACCCGTTC AACCCGCTGA CTCCGCCCAA GGTCTTTGAC GAAATCAAAG TCTCGCTGGC CAGCCCCGAA CGGATCCTGT CTTGGTCCTA CGGCGAGATT AAAAAGCCGG AAACCATCAA CTACCGTACG TTCAAGCCCG AGCGTGACGG CCTGTTCTGC GCGCGTATCT TTGGCCCGAT CAAAGACTAC GAATGTCTCT GCGGCAAATA TAAGCGCATG AAGTATCGCG GCGTTGTCTG CGAGAAATGT GGTGTTGAAG TTACCCTTCA AAAAGTGCGC CGCGAGCGCA TGGGCCACAT CGAACTGGCG GCACCCGTTG CGCATATCTG GTTCCTGAAG TCGCTTCCTT CCCGGATCGG CCTCATGCTG GACATGACCC TGCGTGATCT GGAACGTGTT CTGTACTTTG AAAACTACGT TGTCATCGAG CCGGGTCTGA CTGACCTGCA ATACGGCCAG ATGATGACCG AAGAAGAGTA CATGGACGCG CAAGACGCCT ATGGCATGGA CGCTTTCACC GCCAATATCG GTGCGGAAGC GATCCGCGAA ATGCTGGCGG CGATCGATCT TGAGGCAGAA GCCGAGCACC TGCGCGCCGA ACTGGCCGAG GCCACCGGCG AACTGAAGCC CAAGAAGATC ATCAAACGTC TGAAAGTCGT TGAATCCTTC CTCGAGTCGG GCAACCGTCC TGAGTGGATG ATCATGACCG TGATCCCGGT TATCCCGCCG GAACTGCGTC CGCTGGTTCC GCTGGATGGG GGCCGTTTTG CGACCTCCGA CCTCAACGAC CTCTATCGTC GTGTGATCAA CCGGAACAAT CGTCTGAAGC GACTCATCGA GCTGCGCGCG CCTGACATCA TCGTCCGCAA CGAAAAGCGG ATGCTGCAGG AATCCGTTGA TGCTCTGTTC GACAACGGCC GTCGTGGTCG CGTCATCACC GGTGCCAACA AGCGTCCGCT GAAGTCGCTC TCCGACATGC TGAAGGGTAA GCAGGGTCGC TTCCGTCAGA ACCTTCTCGG TAAACGCGTC GACTTCTCCG GTCGTTCGGT TATTGTGACC GGCCCCGAGC TGAAGCTGCA CCAGTGTGGC CTGCCCAAGA AGATGGCGCT CGAACTCTTC AAGCCCTTCA TCTACTCGCG TCTGGAGGCC AAAGGTCTGT CCTCCACCGT GAAGCAGGCG AAGAAGCTCG TTGAAAAAGA ACGTCCCGAA GTTTGGGATA TCCTCGACGA GGTGATCCGC GAGCACCCGG TGATGCTGAA CCGCGCACCG ACGCTGCACC GTCTTGGTAT TCAGGCGTTC GAACCCACGC TGATCGAAGG TAAAGCGATC CAGCTGCACC CGCTGGTCTG TTCGGCGTTC AACGCGGACT TCGACGGTGA CCAGATGGCC GTGCACGTCC CGCTGAGCCT TGAGGCCCAG CTGGAAGCGC GCGTCCTGAT GATGTCGACG AACAACGTTC TGTCGCCCGC AAACGGTGCG CCGATCATCG TTCCCTCGCA GGATATGATC CTTGGTCTCT ACTACCTCAC GCTGGAGCGT GAAGGCATGA AGGGCGAAGG CAAGATCTTT GGCAACCTCG ACGAGGTCCA GCACGCGCTG GACGCGGGCG AGGTGCATCT GCACTCCAAA GTGACCGTGC GGGTTCCGCA GATCGACGAA GAGGGCAACG AAGTCTTCCA ACGCTTCGAG ACCACGCCGG GCCGTGCCCG TCTGGGCTCC TTGCTGCCGA AGAACGCCAA GGCACCGTTT GAACTGGTGA ACCGTCTGCT GCGCAAACGC GAAGTTCAGC AGGTCATCGA CACCGTCTAC CGTTACTGCG GTCAGAAAGA GTCGGTGATC TTCTGTGACC AGATCATGAC CATGGGCTTC CGCGAAGCGT TCAAGGCGGG CATCTCGTTC GGCAAGGACG ACATGGTGAT CCCCGACAAC AAGTGGACCA TCGTCGACGA CACCCGCGAT CAGGTGAAAG ACTTTGAACA GCAGTACATG GACGGCCTGA TCACTCAGGG CGAAAAGTAC AACAAGGTTG TCGATGCCTG GTCGAAGTGT AACGACAAGC TCACCGAAGC CATGATGTCC ACCATCTCGG CGGTCAAAAA GGCTGAAGAT GGCTCCGACA TGGAACCGAA CTCGGTCTAT ATGATGGCGC ACTCCGGTGC GCGTGGCTCC GTGACGCAGA TGCGTCAGCT GGGCGGGATG CGCGGCCTGA TGGCAAAGCC GAACGGCGAC ATCATCGAGA CCCCGGTTAT CTCGAACTTT AAAGAAGGCC TCACCGTTCT GGAGTACTTC AACTCCACCC ACGGTGCGCG TAAGGGTCTG TCGGACACCG CTCTGAAAAC GGCGAACTCC GGTTACCTTA CACGTCGTCT TGTGGACGTG GCGCAGGACT GCATCGTGCG CGAGCACGAC TGTGGCACCG AGCGTGCGAT CACGGCTGAA ACCGCGGTCA ACGACGGTGA AGTTGTGGCG TCTCTTGGTG AGCGTATCCT GGGTCGTGTG GCGGCAGATG ACGTCAAGCG TCCAGGCACC GAAGAAGTCC TGTTGGCTGC AGGTCAGCTC ATTGACGAAC GTATGGCCGA CACCATCGAG GAAGCTGGCG TTCAGTCCAT GCGCATCCGC AGCCCGCTGA CCTGTGAAAG CGAAGAGGGC GTCTGCGCCA TGTGCTATGG CCGCGACCTT GCACGCGGCA CCATGGTCAA CTCCGGTGAG GCCGTCGGCA TCATCGCGGC GCAGTCCATC GGTGAACCAG GTACACAGCT GACGATGCGG ACCTTCCACA TCGGCGGCGT GGCCCAGGGT GGTCAGCAGT CCTTCCTCGA GGCGTCCCAG GAGGGCAAGA TCGTGTTCGA GAACACGAAC ACGCTCACCA ACGCCAACGG CGAAGTTCTG ACCATCGGCC GGAACATGAA GCTGATCATT CAGGACGAGC ACGGTGAAGA GCGCTCCAGC CACAAGCTGG GCTACGGTAC CAAGCTCTTT GTGAAAGAGG GCCAAGAGGT CAAACGCGGC GATAAACTGT TCGAGTGGGA CCCCTATACC CTGCCGATCA TCGCCGAGAA GCCGGGTACC GCGAAGTACG TGGACCTCGT GTCCGGTCTG GCCGTGCGCG AAGAAACCGA CGATGCCACC GGCATGACCC AGAAGATCGT GATCGACTGG CGTGCGGCAC CGAAGGGGTC TGACCTGAAG CCGGAGATCA TCCTTGTGGG TGACGATGGC GAACCAGTGC GCAACTCCCT TGGAAACCCG CTCACCTATC CGATGTCCGT GGATGCGATC CTGTCCTGTG AAGATGGCCA GAAGATCGAA GCCGGTGACG TTGTTGCGCG TATCCCGCGT GAAGGCGCGA AGACGAAGGA CATTACCGGT GGTCTGCCGC GTGTGGCCGA ACTCTTCGAG GCGCGTCGTC CGAAGGATCA CGCGATCATC GCGGAAATCG ACGGCTATGT GCGCTTTGGT CGTGACTACA AGAACAAGCG TCGTATCTCG ATCGAGCCGG CAGATGAGTC CATGGAGGCC GTGGAATACA TGGTGCCCAA GGGCAAGCAC ATCCCGGTGG TCGAAGGCGA CTTCGTTCAG AAGGGCGACT ACATCATGGA CGGCAACCCG GCGCCGCATG ACATCCTCTC CATCATGGGT GTCGAGGCTC TGGCGAACTA CATGATCGAC GAGGTGCAGG ACGTCTATCG CCTGCAGGGT GTGAAGATCA ACGACAAGCA CATCGAGGTG ATCGTTCGCC AGATGCTGCA GAAGTGGGAG ATCTCCGACT CTGGCGAGAC CACGCTCCTC AAGGGCGAAC ACGTCGACAA GCAGGAGTTC GACGCTGCAA ACGAGAAGGC GCTGGCCCGT GGCAAGCGTC CTGCTCAGGG CGAGCCGATC CTTCTTGGTA TCACCAAGGC GTCGCTGCAG ACCCGCTCCT TCATCTCCGC GGCCTCCTTC CAGGAGACCA CACGGGTGCT CACCGAAGCC TCCGTACAGG GCAAGAAGGA CAAGCTGGTC GGCCTGAAGG AAAACGTCAT CGTGGGTCGT CTGATCCCGG CGGGTACCGG TGGTGCCACC CAGCAGGTGC GTCACATCGC GGCCTCGCGC GACAATGTGG TTCTTGAGGC CCGTCGTGAA GAAGCCGAAG CGGCGGCAGC CCTTGCTGCG CCGATGGCGG ATGATGCGGC GGATGCGGAC TTCCTCGTGG AAACCCCGGA AAGCCGCGAC TGA
|
Protein sequence | MNQEITNNPF NPLTPPKVFD EIKVSLASPE RILSWSYGEI KKPETINYRT FKPERDGLFC ARIFGPIKDY ECLCGKYKRM KYRGVVCEKC GVEVTLQKVR RERMGHIELA APVAHIWFLK SLPSRIGLML DMTLRDLERV LYFENYVVIE PGLTDLQYGQ MMTEEEYMDA QDAYGMDAFT ANIGAEAIRE MLAAIDLEAE AEHLRAELAE ATGELKPKKI IKRLKVVESF LESGNRPEWM IMTVIPVIPP ELRPLVPLDG GRFATSDLND LYRRVINRNN RLKRLIELRA PDIIVRNEKR MLQESVDALF DNGRRGRVIT GANKRPLKSL SDMLKGKQGR FRQNLLGKRV DFSGRSVIVT GPELKLHQCG LPKKMALELF KPFIYSRLEA KGLSSTVKQA KKLVEKERPE VWDILDEVIR EHPVMLNRAP TLHRLGIQAF EPTLIEGKAI QLHPLVCSAF NADFDGDQMA VHVPLSLEAQ LEARVLMMST NNVLSPANGA PIIVPSQDMI LGLYYLTLER EGMKGEGKIF GNLDEVQHAL DAGEVHLHSK VTVRVPQIDE EGNEVFQRFE TTPGRARLGS LLPKNAKAPF ELVNRLLRKR EVQQVIDTVY RYCGQKESVI FCDQIMTMGF REAFKAGISF GKDDMVIPDN KWTIVDDTRD QVKDFEQQYM DGLITQGEKY NKVVDAWSKC NDKLTEAMMS TISAVKKAED GSDMEPNSVY MMAHSGARGS VTQMRQLGGM RGLMAKPNGD IIETPVISNF KEGLTVLEYF NSTHGARKGL SDTALKTANS GYLTRRLVDV AQDCIVREHD CGTERAITAE TAVNDGEVVA SLGERILGRV AADDVKRPGT EEVLLAAGQL IDERMADTIE EAGVQSMRIR SPLTCESEEG VCAMCYGRDL ARGTMVNSGE AVGIIAAQSI GEPGTQLTMR TFHIGGVAQG GQQSFLEASQ EGKIVFENTN TLTNANGEVL TIGRNMKLII QDEHGEERSS HKLGYGTKLF VKEGQEVKRG DKLFEWDPYT LPIIAEKPGT AKYVDLVSGL AVREETDDAT GMTQKIVIDW RAAPKGSDLK PEIILVGDDG EPVRNSLGNP LTYPMSVDAI LSCEDGQKIE AGDVVARIPR EGAKTKDITG GLPRVAELFE ARRPKDHAII AEIDGYVRFG RDYKNKRRIS IEPADESMEA VEYMVPKGKH IPVVEGDFVQ KGDYIMDGNP APHDILSIMG VEALANYMID EVQDVYRLQG VKINDKHIEV IVRQMLQKWE ISDSGETTLL KGEHVDKQEF DAANEKALAR GKRPAQGEPI LLGITKASLQ TRSFISAASF QETTRVLTEA SVQGKKDKLV GLKENVIVGR LIPAGTGGAT QQVRHIAASR DNVVLEARRE EAEAAAALAA PMADDAADAD FLVETPESRD
|
| |