Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0533 |
Symbol | |
ID | 4069953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 657088 |
End bp | 658173 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982538 |
Product | LacI family transcription regulator |
Protein accession | YP_589612 |
Protein GI | 94967564 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.985534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.47776 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCGGC GTAAAACGAT GCCCCCTCCG CGGACATCGT CGGACCATAA AAAACCGATC AGTCTGAAGC GTCTCGCCGA GCATCTCGGG CTCTCTTCTG CCACTGTTTC CATCGTCATT AACCGCAAGC CCCTCTCCGA CATGATCCCG GAAGAAACCA AGACCCGGAT ATGGGAAGCG GCGAGCCGTT TCAACTATCG GCCCAACATC ATCGCTCGCT CTTTGCGCCA GCAGCGGACC TACTCCATCG GGGTTTTGCT GCCGGAGTTT AGCGACGGTT ATTCCGCTTT GGTGCTAAGC GGCATTGAAG ACTACCTTTT AGGGAAGGGG TATGCATGGC TGGCGGCAAG CCATCGTCAC AAAGATGAAT TGATCCGCGA ATACCCGCAC CTGCTTTACA CCCGCGCGGT CGAGGGTTTG ATAACCATCG ACACCCCCTA TGACGAGCAT CTGCCGTTCC CCGTCGTGTC TGTTTCCGGA CACCAGACCA TCGAGGGCGT GACCAATATC GTGCTCAACC ACGATCGCTC CGCGGAGCTT GCAATCGGTC ATCTCCACGA GCTCGGCCAT CGACGCATCG CCTTCATCAA AGGACAATCC TTCAGCTCCG ATACCCAGGT CCGCTGGGAT TCGATCCGCA AGGCTTGCCG GAGCTTCGGC ATTACCGTTG ACCCGCAACT CGTGGCACAG CTCGAAGGTG TGTCTCCTTC GCCGGAGCCG GGATACCAAG CCGCGAAACG CATCCTCGCC AACAAAGTCG ACTTCACCGC GCTGTTCAGT TTTAACGACG TCTCCGCCAT CGGCGCCATC CGCGCATTGC AGGAAGCCGA CCTCCATGTT CCAGAAAGCG TATCCGTTGT CGGCTTCGAC GACATCGCCG TCGCGGCCTA CCACATCCCG GCATTGACCA CCATCCGCCA GCCACTGGGT CACATGGGTT CACTCGCCGC CGAAACGCTG GTCGAGCGCA TCGCTGCGCG CGGGAACGAA GGACCAGCAC TGCTCGAGGT CGAACCCGAA CTCGTCGTAC GCGAATCGAC CGCACCTCTT TCTACCGCCA AGGCCGTCCC TTCAGGCAAG GGATGA
|
Protein sequence | MPRRKTMPPP RTSSDHKKPI SLKRLAEHLG LSSATVSIVI NRKPLSDMIP EETKTRIWEA ASRFNYRPNI IARSLRQQRT YSIGVLLPEF SDGYSALVLS GIEDYLLGKG YAWLAASHRH KDELIREYPH LLYTRAVEGL ITIDTPYDEH LPFPVVSVSG HQTIEGVTNI VLNHDRSAEL AIGHLHELGH RRIAFIKGQS FSSDTQVRWD SIRKACRSFG ITVDPQLVAQ LEGVSPSPEP GYQAAKRILA NKVDFTALFS FNDVSAIGAI RALQEADLHV PESVSVVGFD DIAVAAYHIP ALTTIRQPLG HMGSLAAETL VERIAARGNE GPALLEVEPE LVVRESTAPL STAKAVPSGK G
|
| |