Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0899 |
Symbol | |
ID | 4069110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1122129 |
End bp | 1123214 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637982906 |
Product | LacI family transcription regulator |
Protein accession | YP_589976 |
Protein GI | 94967928 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.290552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGATG TTGGCATGGC CGCGCGGAAA CCCGGCAAGA AAATACATGC GCAGCAAGAT GTCCCTCGCC GCGTGACATT GAAGTTCCTC GCCGAATATC TTCAGCTCTC TACTACAACA GTCTCGGTGG TGTTGAGCGA CTCTCCGCTC GCGATGACGA TCGCCCAGAA AACCAAGGAG AGAATCTGGG CCGCAGTTGA GAAGTTCCAG TACCGCCCCA ACATGTTTGC GCAGTATCTG CATTCCAAGC GCACCTTCAG CGTAGCCGTT CTCGTGCCGG ATATCGGAGA TGAATACTCG TCGTCACTCA TTAGCGGCAT CGAGCGGCGA CTGTCTGAAG CAGGGTACAA ATATATCGTT GCGAGCCATC GCGGTGCTCC GAAAGAAATC GAGACATCCC CGGAAACTCT CATGGATAGG GCAGTCGAGG GCATGATTTT CATCAATACC CCCCTCCAGA AGAGACTTCC AATTCCCGTT GTCGCTGTTT CTGACATCAC GACGGCACCG GGCGTGTCGA GGATTGTAAT CGACAATGAC CGTGCAATTT GGCTCGGGCT TTCACATCTC AAGCAGCTCG GTCACAAGCG GATCGCATTC TTCAAGGGGC CGGACCACAA CGGCGACACC GAAATGCGAT GGAAGGCCGT CCTCGAGAAT TCCGAGAAAT TCGGATTGGA AGTCGAGCGC GAATTGACCG TCCAACTCGG AACATATCCA GAAGTGAATG AATCAACAGT GTCCCACCAC GGGTACGCCG CTGCGATGAC GCTGCTCAAG CGAACGCGGA GCTTCACTGC TTTAATGGCG TTCAATGACG GTTCCGCAAT CGGAGCGATT CGCGCTTTCC AGGATGCAGG ACTCTCGGTG CCCAACGCCG TCTCGGTGAT CGGGATCGAT GACGTTCCTC TGGGTGAGTT TATCTACCCA CGCCTTACCA CAGTCCGACA GCCTCTTGAA CAAATGGGTC AGCTCGCCGC TTCAACACTC CTCGACCGGA TCAACGGAAT GACGGTGCTT GAGGAGACAA AGGTCCTTCC CGAGTTGATT GTGCGAGAAT CCACTGCTCC GCAGCGCTAC AGATAA
|
Protein sequence | MYDVGMAARK PGKKIHAQQD VPRRVTLKFL AEYLQLSTTT VSVVLSDSPL AMTIAQKTKE RIWAAVEKFQ YRPNMFAQYL HSKRTFSVAV LVPDIGDEYS SSLISGIERR LSEAGYKYIV ASHRGAPKEI ETSPETLMDR AVEGMIFINT PLQKRLPIPV VAVSDITTAP GVSRIVIDND RAIWLGLSHL KQLGHKRIAF FKGPDHNGDT EMRWKAVLEN SEKFGLEVER ELTVQLGTYP EVNESTVSHH GYAAAMTLLK RTRSFTALMA FNDGSAIGAI RAFQDAGLSV PNAVSVIGID DVPLGEFIYP RLTTVRQPLE QMGQLAASTL LDRINGMTVL EETKVLPELI VRESTAPQRY R
|
| |