Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3711 |
Symbol | |
ID | 4070461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4384996 |
End bp | 4386909 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637985734 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_592786 |
Protein GI | 94970738 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4566] Response regulator |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.731459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTCAC CTGATCTTTC CAGCAAAATT GATGCGTTTC CCGAGCCGAC GCTTTTAATC GACCGCACCG GAACGGTTGT CGCTTCTAAT CGGAGTGCGT CGCAGTTGCT GCGGTTATCA TCGGCGAGAA TCATCGGGAA ACCCCTTCGT AAGCTCGTTC GTGAGGATCC CGAGAAGCTG GCGGGATACA TTCGCGGCTG GCTGCGCAAT TCCAAACAGG CCTCGGACAA GCGGACCACG CTGACGGTCC TTCCCAGGGA TGGGCGAGAG ATTGTTTGCA GAGCTGTAGG AGCATTATTG CCGGCTCGCG TGGATCAACC CCGCCTGCTT TGCCTTAGGT TTCTAGCTCT TACAATGACC GCGACAGAGC AACGCGAACC GCATCTCCTT TCCGCGTGTA AAAAGCAAGC GCAACCGCAA GCCGATCAAA GATGGCGAAC TGCCTTCGAG AACGCGGCGA TCGGAATCGT CATGGCGGAC TTTAATGGTC GCTATTTCTC TAGCAATGCT GCGTTCCGCA GAATGTTGGG CTATACCGAG GCCGATCTCT ACCGATTAAC CTTCGATGAA GTCACGTTCG AGGGTGACCG TGAAGCTAAT CTGTTGTTGG TCCGCGAGCT CGTAGGCGGC AAACGACGAC ACTTCGAACT CGAGAAGCGC TACCGCCGCA AAGACGGCAC TTTGCTTTGG GTACGGCAAC ACGTTGCACT AGTTCCAGGA ATGCAAGGTG TTGCGCCATT TTGGCTAGGT GTTGTCGAGG ACATTACCGA CGGCAAGCGC GTTGAACATG AGCTCAGCGT GCAGCGACAG AAATTTCTTG AGAGCGAGGC GCGGCTGCAG GCGTTTTTTG AAAATAGCCC CAACCCGATT TTCATTAAAG ATCGAGAGGG CCGATATCTC CACGTTAATA GAGAATTCAA ACGAGTTCTC TGCTCCGCGC AAAAACGAGT CCTAGGAAAG AGAGACGACG AACTGTTCTC AGCCGAACAG GCCGCTGCTT TTCAGGCCAA TGACCGGCAG GTGCTCGAAG CTGGCGTTCC GATGGAATTT GAGGAGACCG CTTTCGAAGA AGATGGGCAG CACACCAGCA TCGTCCAGAA GTTTCCGCTA TTGAATGCGG AGGGAGAGAT TTACGGCATT GGCGGCATTG TTACCGATAT CACTGAGCGT AAGAAATCGG AATCCGCGCT TCGATTCAGT GAGGAGTCGT ATCGCGTTGT GGTCGAAACG GCCAACGATG CCGTTATCAG CACAGACGAA ACCGGCACGA TCCGTTTTGC CAATTCCACT ACTTCGAGAG TCTTTGGATA TGACTCAACT GAGCTGATCG GGAAGCCCTG GACTGTGCTG ATGACAGAAC ACCTGCGCCA GGTACATGAG GCCGGATTCA GACGCTACTT GGAAACTGGC GTGCGGCATA TGAACTGGCA AGGTACCGAG CTGGTTGGAC TGCACAAGAA TGGGAGAGAG TTCCCGGTAG AGATTTCAAT CGGAGAGCTT GCCCGGAGTG GCCGGCGGAT GTTCACCGGC TTTATCAGAG ATATCAGTGA AAGAAAGCAG GCAGAAGAAA TCCGAACCAC TGCATTTGAA TTCCTCACGA AGCCATTCAG CGATCAGGAT CTGCTGGAGG CGATTCGCAT CGCACTGGAG CGACATCGCC ATAAGCAGGG ACACGAGCAA GAGCTGGCGA AGCTTCGACG GCGCTTCGGG TTCTTGAGCT TCCGGGAGCG GGAAGTAACC TCTATGGTTG TTTCGGGCAT GGCCAACAAA CAGATCTCCG TTAAACTCGG GATATCGGAA AACACCGTTA AGGTTCACAG GAGTCGAGCA ATGAAGAAGA TGCAGGCCCG GTCTCTTCCT GAGCTCGTCA GAATGATGGA ACAGCTGAAA GACTTTTTTG AAAAGCCAGC GTAA
|
Protein sequence | MGSPDLSSKI DAFPEPTLLI DRTGTVVASN RSASQLLRLS SARIIGKPLR KLVREDPEKL AGYIRGWLRN SKQASDKRTT LTVLPRDGRE IVCRAVGALL PARVDQPRLL CLRFLALTMT ATEQREPHLL SACKKQAQPQ ADQRWRTAFE NAAIGIVMAD FNGRYFSSNA AFRRMLGYTE ADLYRLTFDE VTFEGDREAN LLLVRELVGG KRRHFELEKR YRRKDGTLLW VRQHVALVPG MQGVAPFWLG VVEDITDGKR VEHELSVQRQ KFLESEARLQ AFFENSPNPI FIKDREGRYL HVNREFKRVL CSAQKRVLGK RDDELFSAEQ AAAFQANDRQ VLEAGVPMEF EETAFEEDGQ HTSIVQKFPL LNAEGEIYGI GGIVTDITER KKSESALRFS EESYRVVVET ANDAVISTDE TGTIRFANST TSRVFGYDST ELIGKPWTVL MTEHLRQVHE AGFRRYLETG VRHMNWQGTE LVGLHKNGRE FPVEISIGEL ARSGRRMFTG FIRDISERKQ AEEIRTTAFE FLTKPFSDQD LLEAIRIALE RHRHKQGHEQ ELAKLRRRFG FLSFREREVT SMVVSGMANK QISVKLGISE NTVKVHRSRA MKKMQARSLP ELVRMMEQLK DFFEKPA
|
| |