Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4253 |
Symbol | |
ID | 5834831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4729702 |
End bp | 4730790 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641370044 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001641693 |
Protein GI | 163853650 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.249278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCTT CTGTCATAGC AGGACTGCCC GGACCCGCAG TCCGGCCGGT GGGAACCGGC AGCCCCACCG CGAACGGGAA GCCGCTCCCT GTTGCCTTGA GCCGTCCTGG GGTTGGTCCG GCGGTTTACG CCACCCTTAT CGAACTCGGC GCAGATCCGG ACGGCCTCCT TGCCGAGTTG GGGTTTAATC CCGGGCTCTT CGACGGCGGC AAGCTAGTCC CCTATGCCGC CCTCGGCCGC CTGATCACCC TCGGGGCCGA GCGGACGAAT TGCCCTCACC TTGGGCTCCT CGTCGGCCAG CGCGCCACCC TGGCGTCGCT GGGGCCGATC AGCGTGTTGA TGCGCCACTC GGACAGCGTC GGCGATGCCT TGCGGGCTCT CGTCGCGCAC TCGGGCGCGC AGAACTGGGG CGCGGTGTTC GGTCTAGGCA TCGACAGCGG CGTCGCCGTC CTCAGCCACG CCCCTTATGG TCCGGAAGCC GAGTGCACGG GCATCCAGTC GGAGCGAGCC CTCGCCACAT TGACCAACGT CCTTCGGGCG CTGTGCGGGG CTGATGGGGC GCTGCAGGAG GTTCTGCTGC CGCGCTCCAA GCTACGCGAC ACCCATCTCT ATGACCAGTT CTTCCAGGCG CCCGTCCGGT TCGATCAGGA AGTGGCCGCC TTGGTGTTCT CGGCCGAAGT TCTTAAGCAG CCCATCGCCG GTGCAGACCC TATTGTCCGC CAGAGGGCAG AGGAACGCCT TCGCCGACTT GAAGCTGAAC AGTCATCCAA TTTGACGGAG GAACTGCGTC GATACCTCCG GATCCAAATG ACCCGGCAGC TCTGCACGGC GGAGCACGTG GCGCGTACGC GGCGGGTCAA CCGCCGCACC TTGAGCCGAC ATTTGCGGGC CGAGGGCACG ACCTTCAGGC GGCTTGCCAA CGAGGCACTG TTCCAGGTGG CCAAGCAACT GTTGGCCGAC ACGTGCATGA GCCTGACGGA AATCTCGGCC ACCCTGAACT TCTCCGAGCC GGCCGCTTTC ACGCATGCCT TCCGGCGCTG GTCGGGCACG ACGCCGAGTG CGTGGCGGCT GGAGAGCCGA GCGGCTTAA
|
Protein sequence | MSSSVIAGLP GPAVRPVGTG SPTANGKPLP VALSRPGVGP AVYATLIELG ADPDGLLAEL GFNPGLFDGG KLVPYAALGR LITLGAERTN CPHLGLLVGQ RATLASLGPI SVLMRHSDSV GDALRALVAH SGAQNWGAVF GLGIDSGVAV LSHAPYGPEA ECTGIQSERA LATLTNVLRA LCGADGALQE VLLPRSKLRD THLYDQFFQA PVRFDQEVAA LVFSAEVLKQ PIAGADPIVR QRAEERLRRL EAEQSSNLTE ELRRYLRIQM TRQLCTAEHV ARTRRVNRRT LSRHLRAEGT TFRRLANEAL FQVAKQLLAD TCMSLTEISA TLNFSEPAAF THAFRRWSGT TPSAWRLESR AA
|
| |