Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1686 |
Symbol | |
ID | 4069354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2043950 |
End bp | 2045020 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983694 |
Product | LacI family transcription regulator |
Protein accession | YP_590761 |
Protein GI | 94968713 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.155219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTAAAAC GGACGAAAGC GGCAGGCGAC AATCGCGGCA ACGGCACGGC TGAAAATCGA AAGCCGAACA TCACGATTGC TGATTTGGCT GCGCACCTGA AGCTGACGAA AGGTACCATC TCCGCGGTCC TCAACAACTC CCCGTATTCG AAGTCGATTC CGCAGCACAC CAAGGACCGC ATTCTCGCTG CTGCTGCTGA ACTCAACTAC CAGCCAAACT TCCTTGCTCG CTCGCTCCGA CAAAAGCGGA GCTACAGCAT CGGTGTTGTT GCCGAAGAGA TCGGCGATCC CTACAGCAGC GTCATTATTA GCGGTATTGA GTCTGTTCTC AGCCGGATGA AGTACATCTT TCTAACGGTC GCGCATCGAC ACGACCCGAA TCTACTTCAG CAGTACTTCG ATATCCTCCG CACCCGTGGT GTGGAAGGGA TCATCGCGAT CGACACCCGG ATCGAGTCCT CTCCTGAACT CCCGCTCGTC GCCGTGCCTG GCTATTCGAA ATTTGACGGC GTGCATAACA TTGTTCTGAA CCATCGAACT GCGGCCAAAG TGGCGCTTGA ACACTTGGTT GGTCACGGCC ATCGACGAAT CGCAATCTTG CGCGGCCAGA TCCTCAGTTC AGATTCGGCA GAGCGTTGGC ATTCGATCCA AAAGGTCGCA CAAGAGATGT CGATCAAAAT CGATCAGGAC CTTGTGGTGC AGTTGAGTGG CGACCACGCT TCGCCGCAAC CTGGTTTCGA GGCCATTCAC GAACTGCAAG CTCGCCACGC CAAGTACACG GCGGTATTCG CATACAACGA CATGGCCGCG ATCGGAGCGA TCCAGGCGCT GAAGAAATTC GGCCTGCAGG TTCCGAGCGA CGTATCAGTG GTCGGATTCG ACGATGTGCG TGAAGCGACT TTCTACTCGC CATCTCTTAC GACGGTACGC CAACCTTTGC GCAAAATGGG CGAGACGGCC GCGGAAACCC TCGTCGGTCG AATTGAAGGC AAGACGGATC TGCCGGCGCA CGTGGAAGTG GAGCCGGAAT TTGTGATTCG GCAGTCCACC GGTGCAGCAC GTTCCCTCTA G
|
Protein sequence | MVKRTKAAGD NRGNGTAENR KPNITIADLA AHLKLTKGTI SAVLNNSPYS KSIPQHTKDR ILAAAAELNY QPNFLARSLR QKRSYSIGVV AEEIGDPYSS VIISGIESVL SRMKYIFLTV AHRHDPNLLQ QYFDILRTRG VEGIIAIDTR IESSPELPLV AVPGYSKFDG VHNIVLNHRT AAKVALEHLV GHGHRRIAIL RGQILSSDSA ERWHSIQKVA QEMSIKIDQD LVVQLSGDHA SPQPGFEAIH ELQARHAKYT AVFAYNDMAA IGAIQALKKF GLQVPSDVSV VGFDDVREAT FYSPSLTTVR QPLRKMGETA AETLVGRIEG KTDLPAHVEV EPEFVIRQST GAARSL
|
| |