Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1255 |
Symbol | |
ID | 4069830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1527935 |
End bp | 1529041 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983264 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_590331 |
Protein GI | 94968283 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.180206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.312586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTGGA AGGGATTTCA AAAACCGAAG CGCCTCGCGT TTGACTCGGA GTCGTTGACC GACAAGTATG GTCACTTCTG GGCCCAGCCG TTTGAGCGCG GCTTCGGAAC CACGATTGGC AACGCGCTGC GCCGCGTGCT GCTTTCCTCG ATTGAGGGCG CTGCGATTAC CGCAGTGAAG ATTGAAGGCG TATTACACGA ATTCCAGTCA ATCCCTGGCG TCGTAGAGGA TGCGACGGAC ATCATCCTCA ACCTGAAGCA GATTCCGTTC CGCCTCAACG GAGACGCTCC CAAGGCGATC TACCTGCGCG CGGAACAGCC TGGCATTGTG ACCTCGGGCA TGATCGAGAC CGATGCCGAT GTCGAGATCC TCGACAAGGA CGTGTATATC GCCACCATCA GCGAAGGTGG CAAGCTCGAC ATGGAAATGC GGTTGAAGAA GGGCCGCGGC TACGTGTCAG CCGATAAGAA CTTCGACGAA GACCTTGGCC TCGGGTTCAT TCCGATCGAC TCGGTCCACT CGCCCGTCCG CAAGTGCAAC TACTCGGTGG AAGCAGCCCG TTTGGGTCAG ATCACCGACT ACGACAAGCT CTCGATTGAA TTGTGGACCA ATGGCTCCGT GAACCCGGCC GACGCGCTCG GCCTGGCCGC GAAGCTGCTC AAGGACCACA TGAACATCTT CATCAATTTC GAAGAAGAAA TCGAAGCTTC GCACGCGGAA GACCGCAAGC CGGAAATCCG CAACGAGAAC CTGAACCGCT CGGTGGAAGA GCTCGAGCTT TCGGTCCGCA GCTACAACTG CCTGAAGAAT GCCAATATCC AGACCATCGG AGAACTGGTG CAGAAGACCG AAGCAGAAAT GCTCAAGACC AAGAACTTCG GCCGCAAGTC GCTCAACGAG ATCAAGGAAA TTCTGGCCTC GATGGGACTG AGCCTGGGCA TGAAGATCGA CGAGCATGGC AACGCGGTGG CTCCGCCTCC GGGTTCGCAA CCTGCTCCGA GCTACGGCGG CTATCCGGGA AGCTACGGCA CCGGCGGAAC GTTCGGTGGC GGCGGCAACT ACGGTGGTGG CGGCGGCTTC GGCGGCGACA ACAACCCGGG CTTCTAG
|
Protein sequence | MLWKGFQKPK RLAFDSESLT DKYGHFWAQP FERGFGTTIG NALRRVLLSS IEGAAITAVK IEGVLHEFQS IPGVVEDATD IILNLKQIPF RLNGDAPKAI YLRAEQPGIV TSGMIETDAD VEILDKDVYI ATISEGGKLD MEMRLKKGRG YVSADKNFDE DLGLGFIPID SVHSPVRKCN YSVEAARLGQ ITDYDKLSIE LWTNGSVNPA DALGLAAKLL KDHMNIFINF EEEIEASHAE DRKPEIRNEN LNRSVEELEL SVRSYNCLKN ANIQTIGELV QKTEAEMLKT KNFGRKSLNE IKEILASMGL SLGMKIDEHG NAVAPPPGSQ PAPSYGGYPG SYGTGGTFGG GGNYGGGGGF GGDNNPGF
|
| |