Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0877 |
Symbol | |
ID | 6262825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 965255 |
End bp | 966151 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642611356 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001875769 |
Protein GI | 187251287 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000157744 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000198967 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACAAAA ACTTATCTGA ACTAAACAAA CAACTGCAAG AAAAAGTGCT TAATTTAGTC CCGAAACCGG GCAGCTTTGC CACTGATATA AAAGGTCTTA GAATATTTAG AAGAAACCAG CCCGAAGAGG CAAGAAAATG TTTTTATAAG CCTATCATTG CTTTAATGCT GCAAGGCAGC AAACAATGTG TTTTCAACGC TGAGAAAATA GAATATGTAG CAAACGAATG TCTGGTAACA AGTATTGATA TACCAAGCGC AAGCAGAATT ACGGACGCCT CCCCCGAAAA GCCGTGTATA GGAGTTACAC TGGGAATAGA CAGCTTTATA ATAAAACAGC TTATCATTGA AACCAATCTT TTAAAAACAG ATTATCCCGC CCAAAGAGCA GTTGGCGTAA CAAAGGCTAA CAGCGAAATA CTGGACGCGT TTTTACGTAT TGTAACGCTT TTAGAGCAGC CCAAAGAACA GCAAAATATT TTAGCTCCCA TGATTATACG CGAAATTTAC TACCGCCTGC TTATGACGCC TGTAGGCGAG CAGCTTAGAA TGGTAAATAC TGTCGGCACA AAAAGCAACC AAATTGCCAC GGCAATAGAA TGGCTTAAAG AAAACTTTAA AGAAGAATTG AATGTTGAGA AACTTGCCGC AAAGGTAAAT ATGTCCCCTT CCTCTTTTTA CCGTAATTTT AGAAAGGTAA CAGATGTAAG CCCTTTGCAA TACCAAAAAC AATTACGCTT ATATGAAGCG CAGCGTTTGA TGGTGTCTGA TAATTTGGAT GCTGCTAACG CCGGATACGC CGTAGGATAT GAAAGCCCCA CACAATTTAA CAGAGAATAT AAACGAATGT TCGGCAACCC GCCGAAAACA GATATAAAAA TACTACAAAC TCTTTAG
|
Protein sequence | MNKNLSELNK QLQEKVLNLV PKPGSFATDI KGLRIFRRNQ PEEARKCFYK PIIALMLQGS KQCVFNAEKI EYVANECLVT SIDIPSASRI TDASPEKPCI GVTLGIDSFI IKQLIIETNL LKTDYPAQRA VGVTKANSEI LDAFLRIVTL LEQPKEQQNI LAPMIIREIY YRLLMTPVGE QLRMVNTVGT KSNQIATAIE WLKENFKEEL NVEKLAAKVN MSPSSFYRNF RKVTDVSPLQ YQKQLRLYEA QRLMVSDNLD AANAGYAVGY ESPTQFNREY KRMFGNPPKT DIKILQTL
|
| |