Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0813 |
Symbol | |
ID | 5832321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 883759 |
End bp | 886074 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641366595 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001638289 |
Protein GI | 163850246 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.179863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.675147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAGCG TGAACCTGCT GATCGCCGAG GAACTCGGTG CGCGCGAGGG GCAGGTGGCG GCGGCCGTGG ACCTGCTCGA CGGCGGCTAC ACCGTCCCGT TCATCGCCCG CTACCGCAAG GAGGCAACTG GCTCGCTGGA CGACGCACAG CTCCGCACCC TTGAGGAGCG GCTGGGGTAT CTGCGCGAGC TGCGCGACCG GCGCACCAGC GTCACCGAGA GCATCCGCGC CCAGGGCAAG CTGACGCCGG AACTCGCCGC CGCCATCGCC GCCGCCGACA CCAAGGCGCG GCTGGAAGAC ATCTACCTGC CGTTCCGGCC CAAGCGCCGC AGCAAGGCGC AGACCGCCCG CGAGGCCGGG CTCGCGCCGT TGGCCGAGAC CCTGCTCGCG CGGCCGGAGA CGGCGCCGGG GCGGGCAGCG CAGGGCTTCG TCGACGCGGC CAAGGGCATC GAAACCGCCG AGGCGGCCCT GGAGGGCGCC CGGGCGATCC TGATCGAGCG GTTTGCCGAG GATGCCGACC TGATCGGCCG CCTGCGCGAG GACTTTTGGC GCGGCGGCGA GGCCGTGGCG AAGGTGCGCA AGGGCCAGGA GACTGCGGGC CAGAAATTCT CCGACTATTT CGACTGGCGC GAGCGCCTGG AGCGGATGCC CTCGCACCGG GTGCTGGCGG TGTTCCGCGG CGAAAAGGAG GAGGTGCTCG ACCTCGCCTT CGCCGCGGAG GGCGAGGATT CCGCGCCCGG CGTGCCGGGG CCGTTCGAGC TCGCCGTCTG CCGCCGGTTC GGCGTCTCCG CGCGGGGCCG GCCGGCGGAT GCGTGGCTGC TCGACACCGT CCGCACCGCG TGGCGCACCA AGATCCGCAC CGGCATCAAG GCCGATCTGC GGGCACGCCT GTTCGAGCGG GCGGAGGAGG CGGCGGTGAA GGTGTTCGCC GGCAACCTCA AGGATCTGCT GCTCGCCGCC CCGGCGGGCG GGCGGGCGAC GCTCGGGCTC GATCCGGGCT ACCGCAACGG CGTGAAGGCG GCGGTGGTCG ACCGCACCGG CAAGGTCGTC GCGGTCGAGA CCACCTATCC GCACGAGCCG CAGCGGCGCT GGAAGGAGGC GGTGGCCTCG CTCTCCCGGC TCTGCCGCCA GCACGGCGTC GAGTTGATCG CCATCGGCAA CGGCACCGCC TCGCGCGAGA CCGACCGGCT CGCCACCGAG ATCCTGGCCG CAAACCCTGA TCTCAAGATG GCCAAGGTCA CGGTGTCGGA GGCCGGCGCC TCGGTGTACT CGGCCTCGGC CATCGCCACC CGTGAGTTGC CCGACCTCGA CGTGTCGCAT CGCGGCGCCG TCTCCATCGC CCGGCGCCTG CAGGACCCGC TGGCGGAACT GGTGAAAATT GACCCGAAAT CCATCGGCGT CGGCCAGTAC CAGCACGACG TCACCGAGCA GAAGCTGTCG CGCTCGCTTC AAGCGGTGGT CGAGGATGCG GTGAACGCGG TCGGCGTCGA TGTGAACACC GCCTCCGGCC CGCTGCTGGC CCAGGTCTCG GGCCTCGGCG CGTCGGTGGC GGACAAGATC GTCACCCACC GCGACGCCAA CGGCCCGTTC CGTACCCGCG CCGGACTGAA GAAGGTGCCG GGCCTCGGCG CCAAGACCTT CGAGTTGGCG GCGGGCTTCC TGCGCATCCC CGATGGCGAG GACCCGCTCG ACCGCTCCGG CGTCCACCCG GAAGCCTATC CGGTGGTGCG CCGCATCCTG GAGGCGACGA AGAGCGACAT CCGCGTGCTG ATCGGCAATG CCGCCGCCCT GCGTCCGCTC TCGCCCGCCG CCTTCGCCGA CGAACGCTTC GGCGTGCCGA CCGTGCGCGA CATCATCGCC GAGTTGGAAA AGCCCGGCCG CGACCCGCGC CCGGCCTTCA AGACGGCGAG CTTCCAGGAA GGCGTCGAGA AAATCGGCGA CCTCAGGCCA GGGATGCAGT TGGAGGGCGT CGTCACCAAC GTCGCGGCCT TCGGCGCCTT CGTCGATATC GGCGTGCATC AGGACGGACT CGTCCACATC TCGGCCATGG CCCGCAAGCG GATCGCCTCG CCTTCCGAAG TGGTGAAGAC CGGCGACGTG GTGCGTGTGC TGGTGTTGTC GGTCGATGTG CCGCGCAAGC GCATCGCGCT GTCGATGCGG CTCGACGACC CCCTTGAGGG CGCAACGGCG CCGCGTGGAA ACGTCCCCCG CCCCGAGGCG CAGCCCCGGC GCCCGGCGCC CGCAGCCCCG CCGCAGGATG GGGCGCTGGC CGACGCGCTC CGGCGCGCCG GAGTCTCGTC GTCTAAGCGT TCTTGA
|
Protein sequence | MKSVNLLIAE ELGAREGQVA AAVDLLDGGY TVPFIARYRK EATGSLDDAQ LRTLEERLGY LRELRDRRTS VTESIRAQGK LTPELAAAIA AADTKARLED IYLPFRPKRR SKAQTAREAG LAPLAETLLA RPETAPGRAA QGFVDAAKGI ETAEAALEGA RAILIERFAE DADLIGRLRE DFWRGGEAVA KVRKGQETAG QKFSDYFDWR ERLERMPSHR VLAVFRGEKE EVLDLAFAAE GEDSAPGVPG PFELAVCRRF GVSARGRPAD AWLLDTVRTA WRTKIRTGIK ADLRARLFER AEEAAVKVFA GNLKDLLLAA PAGGRATLGL DPGYRNGVKA AVVDRTGKVV AVETTYPHEP QRRWKEAVAS LSRLCRQHGV ELIAIGNGTA SRETDRLATE ILAANPDLKM AKVTVSEAGA SVYSASAIAT RELPDLDVSH RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVTEQKLS RSLQAVVEDA VNAVGVDVNT ASGPLLAQVS GLGASVADKI VTHRDANGPF RTRAGLKKVP GLGAKTFELA AGFLRIPDGE DPLDRSGVHP EAYPVVRRIL EATKSDIRVL IGNAAALRPL SPAAFADERF GVPTVRDIIA ELEKPGRDPR PAFKTASFQE GVEKIGDLRP GMQLEGVVTN VAAFGAFVDI GVHQDGLVHI SAMARKRIAS PSEVVKTGDV VRVLVLSVDV PRKRIALSMR LDDPLEGATA PRGNVPRPEA QPRRPAPAAP PQDGALADAL RRAGVSSSKR S
|
| |