Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5539 |
Symbol | |
ID | 6130122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 6076192 |
End bp | 6077286 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641645671 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001772286 |
Protein GI | 170743631 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGAGG GGACGGTGTC TAAGAGGCTG CCGCTGCCAA GCGAGAGGAT CTACGCTCCG TACAAAATAG CCGCGCTGGT TGAAATCTTG GGCGAGCAGG GCATCCCGCC CGAGGAGGCT CTGCGCGATA CGGGGGTTGA GGCAAGCAAA ATCTATGATG CCTCGGCTCT TACGTCGGTG CGGCAGTACG TGGCGGTTTG CAGAAATGCA CTCGCGCTAT CGTGCGAACC AAGAACACCT TTTCAAGTCG GCGCGCGCCT GCATCTTTCT GCATACGGTA TGTATGGGTA CGCTTTGATG TCGTGTCTTT CACTCCGGGA CTACTTTAGA CTCGGAGTAA AATACCATTT ATTGGCTACC CCGACCCTTG CAATAGAGTG GAGAGAGTAC CCGAATGTGG CGGTGTGGAC GTTTCCTGAC GAATTTACAT TTGCTCCATC CAAGGAACTT CGGCAGTTCC TGATAGAACA GCAGTTTACA CAACAGGTCA CTCACCTGCA GGACGTTGCC GGCCGGAGTT GCCCACCGGC CAAGGCGCAT TTTTCGTACT CGGCGCCGGA ACATGCCGCT ATTTATGCCG ATTATCTTGG GTGTCCGTGT TTTTTTGAGC AAGAACATTG CGAGTTGCAC TACGATAGCG CCATTCTCGA ACAAAAGCCC CAGCTTGCGC ATCGGCTGAC GTCCGCTCTG CTTCAGGACG CGTGCGATAC TCTGATCGGA AAGGCTAATG CGTCGGCCGG TGTCGCCGGT GAGGTCTACC AGATCTTGAT GAGATCGCCC GGCGTGTTCC CTGATATGGA AGATGTAGCA CAGACCCTGC GTATGACATC TCGGACACTA CGGCGCCGCC TCGACGCCGA ACAGGTATCA TTTTCAGCAA TTATCGATGA CGTCCATCGT TCGCTGGCAA CGGAATATCT GCGAATGACA AGTATGAGCC TTGAGGACAT CGCGCTGCTT GTCGGTTTCA GCGATGCCGC GAACTTCCGG CGAGCCTTCA AACGGTGGAC CGGGAAAAAT CCAGGGGAGT TCCGTGGCGA GATGCCGCTA AGGGCGACGC ACCGGCATCA TGTCCCCCGT CACTCCGGTT CTTAA
|
Protein sequence | MLEGTVSKRL PLPSERIYAP YKIAALVEIL GEQGIPPEEA LRDTGVEASK IYDASALTSV RQYVAVCRNA LALSCEPRTP FQVGARLHLS AYGMYGYALM SCLSLRDYFR LGVKYHLLAT PTLAIEWREY PNVAVWTFPD EFTFAPSKEL RQFLIEQQFT QQVTHLQDVA GRSCPPAKAH FSYSAPEHAA IYADYLGCPC FFEQEHCELH YDSAILEQKP QLAHRLTSAL LQDACDTLIG KANASAGVAG EVYQILMRSP GVFPDMEDVA QTLRMTSRTL RRRLDAEQVS FSAIIDDVHR SLATEYLRMT SMSLEDIALL VGFSDAANFR RAFKRWTGKN PGEFRGEMPL RATHRHHVPR HSGS
|
| |