Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1365 |
Symbol | |
ID | 6131008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1503343 |
End bp | 1504404 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641641644 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001768315 |
Protein GI | 170739660 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.122979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.036458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGCGTGT GCATCGAGGC GGCTCCCTGC GCCCCCGCGC ACTCCGTTGC CGGGATAGGT GATTCGGGTT ACGTTCAGGT AACCTGGGCG TTCTGGGCAG TGCTTGCGAT GGAGCTTCAG GGACACACCG CGTTCAAGTA CGTGACGGGC AGACGGTTGG GGACGTCGTC CGGTCGCGGC TGGACGAGCG TCCTGGCCGA GCGCTGGGAT CATGAGGCTG GTGCCCTGCC CTCGCTGCTT CCCCGGGAAA CCGAGGTTGC GGTTCTGCTC AGCGGCCGTT CCCTCGTGTA CCGCGAGGGG GCTGGGTTGC GGCAGAGGAC TCCCGGTCAT TCCGGGACCG TCTGGCTCTG CCCGGCCGGC ATCCGGGAAG AACGCATCGA CTTCGAGCAG CCGCTCCACG ATTGTCTGCA CATCTTCCTG CCGCCCGATC CGTTCGCGGA GTGCGTGCTG CAGGATCTCA ATATCGATCC TGCTCGTGCG GGGCTTCGCT ATGAAGCAAT CGCCTACGAT CCGTTCATCG AGCAGATTGC ATTCGCGATC AACCGCGAGC TGCAGGCAGA AACCTCCGCC GGACGCCTGC TGGTCGAGTC GCTCGCCCGG TCACTTTCGG CATATCTCGT TAACCGCTAT TCGGAACTTT CGACGCGGGC GATAGGATTT TCGTCCGAGG CTAAGCCGAT CGACAGCCGG CGAATGTCGC GCGTTTTAGA GTTCATCGGA GCCCGCCTTG ATCAGAACTT TACCGTAGCG GAACTGTCAT CAGTCGCCTG CATGAGTCAG GCCCATTTCG CGCGCGCATT CAAGGCAACG ACCGGGCACG CGCCGCACGC GTTCGTAAGC CGGATGCGTC TCGAATCAGC GAAGCGGATG CTGGCCGATG GCCTCCGGCC GATCGGCGAG ATCGCCCTGG CTGCCGGCTT CTCTTCGCAA TCCAACTTCT CGAGGGCGTT CCGCAGCGCC GTGGGCCTGC CGCCCGGTGA CTATCGCCGC AGCCAAGCCC GATCGCGAGC CGCATCGGGA GATCTCACGC AGGCAGGGCA GCGATCGTCA GTCGCGGAAT AG
|
Protein sequence | MCVCIEAAPC APAHSVAGIG DSGYVQVTWA FWAVLAMELQ GHTAFKYVTG RRLGTSSGRG WTSVLAERWD HEAGALPSLL PRETEVAVLL SGRSLVYREG AGLRQRTPGH SGTVWLCPAG IREERIDFEQ PLHDCLHIFL PPDPFAECVL QDLNIDPARA GLRYEAIAYD PFIEQIAFAI NRELQAETSA GRLLVESLAR SLSAYLVNRY SELSTRAIGF SSEAKPIDSR RMSRVLEFIG ARLDQNFTVA ELSSVACMSQ AHFARAFKAT TGHAPHAFVS RMRLESAKRM LADGLRPIGE IALAAGFSSQ SNFSRAFRSA VGLPPGDYRR SQARSRAASG DLTQAGQRSS VAE
|
| |