Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0600 |
Symbol | |
ID | 4069633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 730289 |
End bp | 731335 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982605 |
Product | ECF subfamily RNA polymerase sigma-24 factor |
Protein accession | YP_589679 |
Protein GI | 94967631 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR00741] ribosomal subunit interface protein [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCC ATATCAGTTA CAAGGGTCTG GAAAAGACCC CCGACGTCGA ATCCACCGTC AAACTACACC TGAAAAAACT GGAGCGCCGA CTGCAGGTAT TCCGGCCCGA GCTGGTCAGC CTGCATGGCA GTGTCCTGCA AAAGTCGCCA CGAGCGGGGT TTGTGGTCGC CTTGAACCTC AAGCTCCCGA CGACCGACAT TGCGGCGGAA CAGGCGAACC CAAATTCAGC AGTCGCGGTG AAGGCCGCGT TCGATTCCCT CATCGGCCAG ATCTCGCGCC ACAAAGGTGC GCTGCGCAAC GAGCACGCCT GGCCGCGACG ACAACAGGAA CATGGTCGCA ATGCGATCAG CGAAGTTCCC TTTGAAGACA CGGTCGCAGC GATCAAGCCA GAGAGCGTGA CCAACGAAGA TGTGAGCAGC TACATCAACG CCAACCTGCC TCGACTGCGC CGGTTCGTGC AGCGCGAACT GCGGCATCGC GAACAAGATG AGAAGATCGC GCGCGGCTCG ATCTCGGTTG ACGAAGTGAT TGACGAAGCC ATCGGCAACG CGTTGAGTGA GGCCTTTGAG CGGCCCGAGA AGATGCGCCT CGAGCCATGG CTCTATCGGC TGTCCACCGA TGCGATCGAT CGGCTCGCAG CCGGCGATTC CGGGGGCGGA AATATTCCAC TGGACCGGCC CGACCGTTCG CGCGATGGCG AAGGCAGCGA CGAAAACGTG CTCCAGTTCC ACCAACCCGA CGAAGATCTG TCGGCAATGA GCCTGACGTT CGACAAGAAC ATTTCCACGC CCGAAGACCT CGCGGCGAAG GACGAGATGA TTTCGCTCGT CGAACGCACG TTGCGCGATG CTGGGAGGAA CGAACGCGAG GCTTTCATTC TTTTCACCAT CGAGGGCTTC ACCGTCGAAG AGATTGCAGA CATCACCCAG AAACCGGAAG AGGAGGTAAG AAAGAGCGTG CATTCCGCCC GTGAGTACTT GAAAGAGTTT TTGCCCGTGC GGGATCCGCT GAGTGACCGT TTGATCGAAC ACTCCAAAAC GGCCTAA
|
Protein sequence | MNVHISYKGL EKTPDVESTV KLHLKKLERR LQVFRPELVS LHGSVLQKSP RAGFVVALNL KLPTTDIAAE QANPNSAVAV KAAFDSLIGQ ISRHKGALRN EHAWPRRQQE HGRNAISEVP FEDTVAAIKP ESVTNEDVSS YINANLPRLR RFVQRELRHR EQDEKIARGS ISVDEVIDEA IGNALSEAFE RPEKMRLEPW LYRLSTDAID RLAAGDSGGG NIPLDRPDRS RDGEGSDENV LQFHQPDEDL SAMSLTFDKN ISTPEDLAAK DEMISLVERT LRDAGRNERE AFILFTIEGF TVEEIADITQ KPEEEVRKSV HSAREYLKEF LPVRDPLSDR LIEHSKTA
|
| |