Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0034 |
Symbol | |
ID | 5832931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 36123 |
End bp | 37223 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641365818 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001637533 |
Protein GI | 163849490 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.131953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTCA CAGCCAACCT CACCGCGACG AAGCAGGAGG TGCGTTCGCG CGCGGCGGCG CCGTCCATGC CCCCGAGCGC CACGGGCGCG CTAGCCCGAC TCGCGGTGGC TCGCGCGGCT GCGTCAGGGA TTGATGTCCA GCCGTGCCTG TCCAGGGCCG GTCTGAGCGA GCGCCAGATC AAGAACCGGC ACGCTCGCAT CGGTGCGACA AACCAAGCCG CGCTCGTCGG TCTGCTGGCT GAGGCGCTCG AGGACGACCT CTTCGGTTTT CATCTCGGCC AAAGCTTCGA GCTCGGTGAG ATTGGGCTGC TTTATTACGT GATGGCCTCC GCCCCAACGC TGCGTGACGC ACTCTGTCGG GCGGAGCGCT ACGCCGCGAT CACGAACGAG GGTATCGCCC CGATCTATAG CCAGAGCGGT GAGGTCCGCG TCTCGTATGT CGGGCTGGCT CGGCACGCTG CACGGCATCA GGTCGAGTTC TGGATGACGG GTCTCGTCCG GGTCGCCCAG CAATTGACCA GCCTGCGACT ATCGCCAATC CACCTGACCC TGTGCCACCC ACGTCACGCG GGAGCCCGCG AGATCGAGGC CTTCCTCGGC TGCGCCATTG CGTTCGATGC CCTGGTGGAC GAGGTTCAAT TTCCCTCTGC CGCGGGGAAC GCGGTCCTGA CCGGCGCCGA CCCCTACCTG CACGATCTCC TGCTCGGGTA CAGCGAAGAA GCGCTCGCTC ATCGTGTCCG TTTGGCGGAA AGCCTACGTA CGCGGGTGGA GAACGCGGTG ATGCCGCTCC TGCCGCATGG TCGGCCGCGC ATCTCCGAGA TTGCACGGGC ACTGGGCACA AGCCAGCGAA CCCTGTCCCG CCGCTTGACC GAAGAGGGGC TCAGCTTCGA GAGCGTGTTG GAAGAGATGC GGCGGGACCT TGCCCTGCGC TACCTTCGGG ACACGCGTCT CTCGATCTCG CGCATCGCTT GGTTGCTGGG GTTCCGGGAG GCCACCGCCT TCACCCACGC CTTCCGGCGC TGGACGGGCC GATCGCCGAC GGAAGCGCGG GTAGAGCGGG ACCGGGCCCT GAACCCGTCG CCTCCGCAGC GGTCCAGTTA A
|
Protein sequence | MPFTANLTAT KQEVRSRAAA PSMPPSATGA LARLAVARAA ASGIDVQPCL SRAGLSERQI KNRHARIGAT NQAALVGLLA EALEDDLFGF HLGQSFELGE IGLLYYVMAS APTLRDALCR AERYAAITNE GIAPIYSQSG EVRVSYVGLA RHAARHQVEF WMTGLVRVAQ QLTSLRLSPI HLTLCHPRHA GAREIEAFLG CAIAFDALVD EVQFPSAAGN AVLTGADPYL HDLLLGYSEE ALAHRVRLAE SLRTRVENAV MPLLPHGRPR ISEIARALGT SQRTLSRRLT EEGLSFESVL EEMRRDLALR YLRDTRLSIS RIAWLLGFRE ATAFTHAFRR WTGRSPTEAR VERDRALNPS PPQRSS
|
| |