Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4314 |
Symbol | |
ID | 4071887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5124565 |
End bp | 5125842 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637986347 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_593388 |
Protein GI | 94971340 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.241702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGATCCA CGACATGCAT CGTGGCTGAC ACTCAGCACC AAATCGCGGG GATGGTGGAC CATCTGTTCC GCACCAGTGC GGGTCAGATG GTCTCGCACC TTACCCGTGT GCTCGGCCCG GCCCACCTCG ATCTCGCGGA AGAAGCCGTA CAGGATGCGC TTGTGAAGGC CCTGCAAAGC TGGCCTTTTG GCGGCGTTCC CAACAATCCC GGCGGCTGGC TCATGCAGGT TGCGCGCAAT CGCGCGCTCG ACATCGTTCG TCACCGCGGA ATGGCCGCCG AGAAGACCGG AGAAATCGTC GCAGAACTTA CCCGCGCCAA TCCAACCGGT GACATCGAAG TGCGCGACCA GTTTCGTGAC GACGAGTTGC GCATGATCTT CCTTTGCTGC CATCCACTGA TTTCGCGCGA TGCCCGCGTT GCGCTCAGTC TTAAGACGGT CAGCGGTTTC TCGATCGAGG AGATTTCTCG CGCGTTCCTC GCCGATCCGC CAACCATTGC GCAGCGCCTG GTGCGCGCCA AGCGGCAGAT ACGCGACGCC AACATCCGCT TCGATCTTCC GCCGCGTAAA GAACTTTCCG AGCGGCTCGA TTCCGTTCTC GAAGTGATTT ACCTGCTCTT TAACGAAGGC TACACGGCAC ACGCCGGCGA TGACCTGGTG CGGCAAGACC TTTGCGTGGA AGCCCTACGT CTCGCCATGT TGGTGGCGGC ATCGCCGGTA TCGCAACCGC GCGCCAACGC GCTGGTCTCG TTGCTGGCGT TCCAGGCCGC TCGCCTTCCC GCCCGTGTCG ATGACAAAGG CGAGTTGGTT CTTCTCGAGG ACCAGGACCG CAGCAAGTGG GACCAGAACT TAATCGCATT CGGTTTCCAC GAGATCGTGA AGAGCGCGCA AGGACAGGCC GTGTCTACGT ATCACATGCA GGCCGCGATC GCATCCATTC ACGCCCAAGC CAAAGACACT GCCGGCACCG ACTGGCCGAA GATCCTCATC CTTTACGACG ACCTGATGGC ACTGAATCCC TCAGCGATCA TCGCGCTGAA TCGGGCGATC GCCGTGTGGC GGGTCCACGG CGTTGTGGCG GCAATGCGCG AAGTGGACCA AATCGCCCAC GAACCAGCAC TCGCCCACTA CTATCTTCTT CCCGCCACTC GCGGACGCCT CCTCCTCGAA ATCGGCGACC GTACGGCAGC AGCCGAGTGC TTCAGCGAAG CACTCAATCG CAAGTGCTCG GAACCGGAGC GCCGGTTCTT ACTGCGACAG CTGAAGGAAT GCGAATAG
|
Protein sequence | MRSTTCIVAD TQHQIAGMVD HLFRTSAGQM VSHLTRVLGP AHLDLAEEAV QDALVKALQS WPFGGVPNNP GGWLMQVARN RALDIVRHRG MAAEKTGEIV AELTRANPTG DIEVRDQFRD DELRMIFLCC HPLISRDARV ALSLKTVSGF SIEEISRAFL ADPPTIAQRL VRAKRQIRDA NIRFDLPPRK ELSERLDSVL EVIYLLFNEG YTAHAGDDLV RQDLCVEALR LAMLVAASPV SQPRANALVS LLAFQAARLP ARVDDKGELV LLEDQDRSKW DQNLIAFGFH EIVKSAQGQA VSTYHMQAAI ASIHAQAKDT AGTDWPKILI LYDDLMALNP SAIIALNRAI AVWRVHGVVA AMREVDQIAH EPALAHYYLL PATRGRLLLE IGDRTAAAEC FSEALNRKCS EPERRFLLRQ LKECE
|
| |