Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1578 |
Symbol | |
ID | 6134661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1759629 |
End bp | 1760759 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641641844 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001768513 |
Protein GI | 170739858 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.321807 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTCAG CCATCAGTTG GAGTTGCGAG CGATCGCGGC CGGAGGCGAG GCGTGCCGAG ACCAGCATCC CTCCGCGCCC GGCCGCCCCT GCCGCCGTTC ACGGCAGCGC CGTCGAGGCG ATCTGCGAGG TTCTGGTGGC GCTCGGGATC GAGCCGGCTC CGCTGCTGGT GCAGAGCGGC ATCGGCCCGC GATGCCTGGA GGGAGCCGGG GCGCTCTCGT TCGAGAGTCT CGGACGCTTG ATGGCGCTCG CCGCCCGCCG GAGCGCCTGT CCGCATTTCG GCCTCCTGGT CGGCCAGCGC ACCACGCTGG CCTCGCTCGG GCTGCTCGGC GTGCTGATGA GGAACTCGGA GACGGTCGGT GACGCCCTGC GGGTGCTGGA GACGCATCAC GGTCTTCTCA ACCGAGGGGC CGTGATCCGC GTCGCGGTGA ACGGCCCGCT CGCCATCGCC AGCTACTCGC CCTACCGGCC CGAGGCCGAG GGGATCGCGC TCCATTGCGA GCGGGCGCTC ACGGCGATGA CGAACGTGAT CAGATCCCTC TGCGGGGGCG ATTGGGCGCC CGAGGAGGTG CTGCTGCCGC GCCTGGGGCC GGACGATGCG ACGCCCTACG CGAATGTCTT TCGCGCTCCC GTCCGCTTCG GACAGGAGAT CGCGGCGCTG ACCTTCCCGG CGCGCCTCCT CGGGCGGCCG ATCGGGGACG CGAGCCCGAT CGTGCGCAAG CTTGCCGAGC AGCGCATCCG CCAGTTCGCG GCCAGCATGC CCGCGGACCT GACGGACGAG CTGCGCCGGC ACCTGCGTGC CACCTTGACG CAAGGAGAGC TGAGCGCGCG CCAGGCCGCG GAGGCGCTGG CGGTTCACCG GCGGACGCTG AGCCGGCGTC TGAGGGCCGA GGGAACGAGC TTCCGATCGG TCGCGAACGA GACGCGCCTC TCCGTCGCCA AGCAGCTGCT GGCCGACACC AACCTGAGCT TGGCGGAGAT CTCCGTCGCC CTGGAATTCT CGGAGCCCGC CGCCTTCACC CATGCCTTCC GGCGCTGGAC CGGGACGACG CCGAGCGCGT GGCGCAAGCA GCGGCGAGAT CCGAGCGGCG GCGAGATGAG CGACGGCCGT CGCGCTGCCG CGGGCGCGTG A
|
Protein sequence | MMSAISWSCE RSRPEARRAE TSIPPRPAAP AAVHGSAVEA ICEVLVALGI EPAPLLVQSG IGPRCLEGAG ALSFESLGRL MALAARRSAC PHFGLLVGQR TTLASLGLLG VLMRNSETVG DALRVLETHH GLLNRGAVIR VAVNGPLAIA SYSPYRPEAE GIALHCERAL TAMTNVIRSL CGGDWAPEEV LLPRLGPDDA TPYANVFRAP VRFGQEIAAL TFPARLLGRP IGDASPIVRK LAEQRIRQFA ASMPADLTDE LRRHLRATLT QGELSARQAA EALAVHRRTL SRRLRAEGTS FRSVANETRL SVAKQLLADT NLSLAEISVA LEFSEPAAFT HAFRRWTGTT PSAWRKQRRD PSGGEMSDGR RAAAGA
|
| |