Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0043 |
Symbol | |
ID | 5835743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 50049 |
End bp | 51131 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641365827 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001637542 |
Protein GI | 163849499 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCTCTT GTGCCGCTTT GGCAACCGGA TACCCCAGCC CACCTGAGAA CCTGCTTCCT GTCGGCTTGG CGCACCTCAG GGTCGCTCAA GCGATCTACG CCGTCCTCGT CGAACTCGAA GCGGATCCCG ACACCCTCAT GGTCGAGGCA GGGCTCGATC CGAGGCTCTT CGAGTGTCAT GGCAAGCTCG TCCCGTACAG CTCCCTCGGT CGCCTGATTG CCATAGCGGT TGAGCGCACC CGCTGCCCTC ATTTGGGGCT CCTCATCGGG CAAAGGACCA CCATCACCTC CCTCGGGCTT CTCGGCCTGC TGTTGAACAA CTCGGACACC ATCGGGGATG CCTTGCGGGC TCTCGAAGCG TATCTGGGTA TGCTGAATCG GGGTGCGGTG GTCGGCCTCG GCATCGACAA CGACGTGGCC GTCCTCACCT ACTGCCCGTA TGAGCCGGGA GCCGAGGGTG CTGTCCACCA TTCGGATCGG GCGCTGGCCA CAGCGACAAA CATCCTTCGA GCGGTGTGCG GGTCTGATTG GGCCCCGTTG GAAGTTCTGC TGCCGCGCTC TGCTCCGAGC AGCACGACGC CTTATAGCCA GTTCTTTCGA GCACCCGTCC GGTTCGATCA GGAGACGGCC GCCTTGGTCT TCCCGGCCAG CCTCCTCAAG CGGCGCATCG CCGGAGCCGA TCCGGTCCTG CGCCGCAGAG TGGAGGATCG TATCCGCCGG CTTGAATCGG CACGACCCTC CACGCTTAAG GACGGGCTTC GTCAGCACCT GCGCGCCGAG GTGATCCGGA AGCGTTGCGA CGTGATGAAG ACGGCGCTTC GCCTGTCGCT CGGTCGCCGC ACCTTGAGCC GCCGCTTGAG AGCCGAAGGG ACAAGCTTCA AGCAGCTTGC TAATGAGGCG CAATTCCGGG TGGCCAAGCG CCTCTTGGCC GATACCAAAA TGAGCATGAC GGAGATCTCG GCCGTCCTGG ACTTCTCGGA GCCCGCCGCC TTCACGCACG CCTTCCGACG CTGGGCAGGC ATGACGCCCA GTTCATGGAG GCAGGGAAAC CATTCCGAGA CGAAGCGCGA GGAGCATCAA TAA
|
Protein sequence | MFSCAALATG YPSPPENLLP VGLAHLRVAQ AIYAVLVELE ADPDTLMVEA GLDPRLFECH GKLVPYSSLG RLIAIAVERT RCPHLGLLIG QRTTITSLGL LGLLLNNSDT IGDALRALEA YLGMLNRGAV VGLGIDNDVA VLTYCPYEPG AEGAVHHSDR ALATATNILR AVCGSDWAPL EVLLPRSAPS STTPYSQFFR APVRFDQETA ALVFPASLLK RRIAGADPVL RRRVEDRIRR LESARPSTLK DGLRQHLRAE VIRKRCDVMK TALRLSLGRR TLSRRLRAEG TSFKQLANEA QFRVAKRLLA DTKMSMTEIS AVLDFSEPAA FTHAFRRWAG MTPSSWRQGN HSETKREEHQ
|
| |